We introduce HYPERBARD, a dataset of diverse relational data representations derived from Shakespeare’s plays. Our representations range from simple graphs capturing character co-occurrence in single scenes to hypergraphs encoding complex communication settings and character contributions as hyperedges with edge-specific node weights. By making multiple intuitive representations readily available for experimentation, we facilitate rigorous representation robustness checks in graph learning, graph mining, and network analysis, highlighting the advantages and drawbacks of specific representations. Leveraging the data released in HYPERBARD, we demonstrate that many solutions to popular graph mining problems are highly dependent on the representation choice, thus calling current graph curation practices into question. As an homage to our data source, and asserting that science can also be art, we present our points in the form of a play.1
History
Primary Research Area
Trustworthy Information Processing
Journal
Digital Scholarship in the Humanities
Volume
39
Page Range
74-96
Publisher
Oxford University Press (OUP)
Open Access Type
Hybrid
Sub Type
Article
BibTeX
@article{Coupette:Vreeken:Rieck:2024,
title = "All the world’s a (hyper)graph: A data drama",
author = "Coupette, Corinna" AND "Vreeken, Jilles" AND "Rieck, Bastian",
year = 2024,
month = 4,
journal = "Digital Scholarship in the Humanities",
number = "1",
pages = "74--96",
publisher = "Oxford University Press (OUP)",
issn = "0268-1145",
doi = "10.1093/llc/fqad071"
}