Isolating Unisolated Upsilons with Anomaly Detection in CMS Open Data

Rikab Gambhir, Radha Mastandrea, Benjamin Nachman, Jesse Thaler

arXiv:2502.14036·hep-ph·Published 2025-02-19·Updated 2025-02-27

We present the first study of anti-isolated Upsilon decays to two muons ($Υ\to μ^+ μ^-$) in proton-proton collisions at the Large Hadron Collider. Using a machine learning (ML)-based anomaly detection strategy, we "rediscover" the $Υ$ in 13 TeV CMS Open Data from 2016, despite overwhelming anti-isolated backgrounds. We elevate the signal significance to $6.4 σ$ using these methods, starting from $1.6 σ$ using the dimuon mass spectrum alone. Moreover, we demonstrate improved sensitivity from using an ML-based estimate of the multi-feature likelihood compared to traditional "cut-and-count" methods. Our work demonstrates that it is possible and practical to find real signals in experimental collider data using ML-based anomaly detection, and we distill a readily-accessible benchmark dataset from the CMS Open Data to facilitate future anomaly detection developments.

TopicsParticle & High Energy Physics

Tagsanomaly-detection

arXiv categorieshep-ph, hep-ex, physics.data-an

arXiv abstract pagePDF