TY - GEN
T1 - OIE
T2 - 2025 ACM SIGMOD/PODS International Conference on Management of Data, SIGMOD-Companion 2025
AU - Xu, Jingzhe
AU - Deng, Yuhao
AU - Chai, Chengliang
AU - Li, Zequn
AU - Wang, Yuping
AU - Cao, Lei
N1 - Publisher Copyright:
© 2025 ACM.
PY - 2025/6/22
Y1 - 2025/6/22
N2 - Outlier detection is crucial for preventing financial fraud, network intrusions, and device failures. However, while existing methods excel at identifying outliers, they often fall short of providing clear and interpretable explanations. This limitation forces users to manually analyze numerous outliers, making the process both time-consuming and inefficient. Additionally, current summarization approaches often focus only on data attributes, ignoring that outliers in the same subspace may have different causes. Consequently, these methods produce broad summaries that make quick diagnosis difficult. To address these challenges, we propose OIE, a system that generates interpretable, fine-grained rules to summarize and explain outlier detection results. OIE leverages decision trees to generate concise rules, balancing simplicity and classification accuracy. Additionally, it employs dynamic data partitioning and a boundary stabilizer to efficiently handle high-dimensional and complex datasets. Through multiple real-world scenarios, OIE demonstrates effective anomaly detection and summarization, providing actionable insights and enhancing the efficiency of outlier analysis. A demonstration video of OIE is available at:http://youtu.be/YzfDdF9f5HI.
AB - Outlier detection is crucial for preventing financial fraud, network intrusions, and device failures. However, while existing methods excel at identifying outliers, they often fall short of providing clear and interpretable explanations. This limitation forces users to manually analyze numerous outliers, making the process both time-consuming and inefficient. Additionally, current summarization approaches often focus only on data attributes, ignoring that outliers in the same subspace may have different causes. Consequently, these methods produce broad summaries that make quick diagnosis difficult. To address these challenges, we propose OIE, a system that generates interpretable, fine-grained rules to summarize and explain outlier detection results. OIE leverages decision trees to generate concise rules, balancing simplicity and classification accuracy. Additionally, it employs dynamic data partitioning and a boundary stabilizer to efficiently handle high-dimensional and complex datasets. Through multiple real-world scenarios, OIE demonstrates effective anomaly detection and summarization, providing actionable insights and enhancing the efficiency of outlier analysis. A demonstration video of OIE is available at:http://youtu.be/YzfDdF9f5HI.
KW - decision tree
KW - interpretable machine learning
KW - outlier explanation
UR - http://www.scopus.com/pages/publications/105010201617
U2 - 10.1145/3722212.3725120
DO - 10.1145/3722212.3725120
M3 - Conference contribution
AN - SCOPUS:105010201617
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 259
EP - 262
BT - SIGMOD-Companion 2025 - Companion of the 2025 International Conference on Management of Data
A2 - Deshpande, Amol
A2 - Aboulnaga, Ashraf
A2 - Salimi, Babak
A2 - Chandramouli, Badrish
A2 - Howe, Bill
A2 - Loo, Boon Thau
A2 - Glavic, Boris
A2 - Curino, Carlo
A2 - Zhe Wang, Daisy
A2 - Suciu, Dan
A2 - Abadi, Daniel
A2 - Srivastava, Divesh
A2 - Wu, Eugene
A2 - Nawab, Faisal
A2 - Ilyas, Ihab
A2 - Naughton, Jeffrey
A2 - Rogers, Jennie
A2 - Patel, Jignesh
A2 - Arulraj, Joy
A2 - Yang, Jun
A2 - Echihabi, Karima
A2 - Ross, Kenneth
A2 - Daudjee, Khuzaima
A2 - Lakshmanan, Laks
A2 - Garofalakis, Minos
A2 - Riedewald, Mirek
A2 - Mokbel, Mohamed
A2 - Ouzzani, Mourad
A2 - Kennedy, Oliver
A2 - Kennedy, Oliver
A2 - Papotti, Paolo
A2 - Alvaro, Peter
A2 - Bailis, Peter
A2 - Miller, Renee
A2 - Roy, Senjuti Basu
A2 - Melnik, Sergey
A2 - Idreos, Stratos
A2 - Roy, Sudeepa
A2 - Rekatsinas, Theodoros
A2 - Leis, Viktor
A2 - Zhou, Wenchao
A2 - Gatterbauer, Wolfgang
A2 - Ives, Zack
PB - Association for Computing Machinery
Y2 - 22 June 2025 through 27 June 2025
ER -