CISPA
Browse
- No file added yet -

Learning Exceptional Subgroups by End-to-End Maximizing KL-Divergence.

Download (1.52 MB)
conference contribution
posted on 2024-09-24, 12:08 authored by Sascha XuSascha Xu, Nils WalterNils Walter, Janis KalofoliasJanis Kalofolias, Jilles VreekenJilles Vreeken
Finding and describing sub-populations that are exceptional in terms of a target property has important applications in many scientific disciplines, from identifying disadvantaged demographic groups in census data to finding conductive molecules within gold nanoparticles. Current approaches to finding such subgroups require pre-discretized predictive variables, do not permit non-trivial target distributions, do not scale to large datasets, and struggle to find diverse results. To address these limitations, we propose SYFLOW, an end-to-end optimizable approach in which we leverage normalizing flows to model arbitrary target distributions and introduce a novel neural layer that results in easily interpretable subgroup descriptions. We demonstrate on synthetic data, real-world data, and via a case study, that SYFLOW reliably finds highly exceptional subgroups accompanied by insightful descriptions.

History

Primary Research Area

  • Trustworthy Information Processing

Name of Conference

International Conference on Machine Learning (ICML)

Journal

ICML

Publisher

OpenReview.net

BibTeX

@conference{Xu:Walter:Kalofolias:Vreeken:2024, title = "Learning Exceptional Subgroups by End-to-End Maximizing KL-Divergence.", author = "Xu, Sascha" AND "Walter, Nils Philipp" AND "Kalofolias, Janis" AND "Vreeken, Jilles", year = 2024, month = 5, journal = "ICML", publisher = "OpenReview.net" }

Usage metrics

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC