Summarizing sequential data with serial episodes allows non-trivial insight into the data generating process. Existing methods penalize gaps in pattern occurrences equally, regardless of where in the pattern these occur. This results in a strong bias against patterns with long inter-event delays, and in addition that regularity in terms of delays is not rewarded or discovered---even though both aspects provide key insight. In this paper we tackle both these problems by explicitly modeling inter-event delay distributions. That is, we are not only interested in discovering the patterns, but also in describing how many times steps typically occur between their individual events. We formalize the problem in terms of the Minimum Description Length principle, by which we say the best set of patterns is the one that compresses the data best. The resulting optimization problem does not lend itself to exact optimization, and hence we propose Hopper to heuristically mine high quality patterns. Extensive experiments show that Hopper efficiently recovers the ground truth, discovers meaningful patterns from real-world data, and outperforms existing methods in discovering long-delay patterns.
History
Editor
Wooldridge MJ ; Dy JG ; Natarajan S
Primary Research Area
Trustworthy Information Processing
Name of Conference
National Conference of the American Association for Artificial Intelligence (AAAI)
Journal
Proceedings of the AAAI Conference on Artificial Intelligence
Volume
38
Page Range
8346-8353
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Open Access Type
Gold
BibTeX
@inproceedings{Cüppers:Krieger:Vreeken:2024,
title = "Discovering Sequential Patterns with Predictable Inter-event Delays",
author = "Cüppers, Joscha" AND "Krieger, Paul" AND "Vreeken, Jilles",
editor = "Wooldridge, Michael J" AND "Dy, Jennifer G" AND "Natarajan, Sriraam",
year = 2024,
month = 3,
journal = "Proceedings of the AAAI Conference on Artificial Intelligence",
number = "8",
pages = "8346--8353",
publisher = "Association for the Advancement of Artificial Intelligence (AAAI)",
issn = "2159-5399",
doi = "10.1609/aaai.v38i8.28676"
}