We study the problem of predicting an event sequence given some meta data. In particular, we are interested in learning easily interpretable models that can accurately generate a se- quence based on an attribute vector. To this end, we propose to learn a sparse event-flow graph over the training sequences, and statistically robust rules that use meta data to determine which paths to follow. We formalize the problem in terms of the Minimum Description Length (MDL) principle, by which we identify the best model as the one that compresses the data best. As the resulting optimization problem is NP-hard, we propose the efficient CONSEQUENCE algorithm to discover good event-flow graphs from data.
Through an extensive set of experiments including a case study, we show that it ably discovers compact, interpretable and accurate models for the generation and prediction of event sequences from data, has a low sample complexity, and is particularly robust against noise.
History
Preferred Citation
Boris Wiegand, Dietrich Klakow and Jilles Vreeken. Discovering Interpretable Data-to-Sequence Generators. In: National Conference of the American Association for Artificial Intelligence (AAAI). 2022.
Primary Research Area
Trustworthy Information Processing
Name of Conference
National Conference of the American Association for Artificial Intelligence (AAAI)
Legacy Posted Date
2021-12-17
Open Access Type
Unknown
BibTeX
@inproceedings{cispa_all_3557,
title = "Discovering Interpretable Data-to-Sequence Generators",
author = "Wiegand, Boris and Klakow, Dietrich and Vreeken, Jilles",
booktitle="{National Conference of the American Association for Artificial Intelligence (AAAI)}",
year="2022",
}