ATTRIBUTE GROUPING-BASED CATEGORICAL OUTLIER DETECTION USING CAUSAL COUPLING WEIGHT

Attribute grouping-based categorical outlier detection using causal coupling weight

Attribute grouping-based categorical outlier detection using causal coupling weight

Blog Article

Abstract For high-dimensional datasets, outlier objects can be effectively identified and extracted with the help of the coupling relationship between any two attributes.However, when all the coupling is used directly, there is a phenomenon of pseudo-correlation between attribute values that results in redundant coupling and affects Audio Video Connector the effectiveness of high-dimensional outlier detection.In this paper, a novel attribute group-based outlier detection approach for categorical data is proposed by using the attribute causal coupling weights to depict abnormal degree of the attributes.Firstly, according to the local and global correlation, all attributes are automatically divided into several groups, and all attributes in each group have a high correlation or association.Secondly, new concepts of causal pseudo-correlation are defined, and a case analysis that the pseudo-correlation is the main cause of attribute redundant coupling.

By constructing attribute causality graph using the graph structure, the pseudo-correlation is effectively avoided in each attribute group.Thirdly, attribute causal coupling weight formula, which effectively characterizes the abnormal degree of attribute and reflects the causal coupling between any two attributes, is constructed from the causality graph.An attribute group-based outlier detection algorithm powered by causal coupling weight is proposed for categorical data.In the end, experimental results on the UCI and synthetic datasets validate that the algorithm has good outlier detection performance and effectively alleviates the effect of redundant coupling among attributes.Importantly, compared with the competitive methods, the algorithm bolsters the AUC Board Game Barbie index and the detection efficiency by averages of 10.

97 and 42.84 $$%$$ % , respectively.

Report this page