The SATCHEL pipeline: A general tool for data classified through citizen science [IMA]

http://arxiv.org/abs/2203.09458


Citizen science is a powerful analysis tool, capable of processing large amounts of data in a very short time. To bridge the gap between classification data products from web-based citizen science platforms to statistically robust signal significance scores, we present the Search Algorithm for Transits in the Citizen science Hunt for Exoplanets in Lightcurves (SATCHEL) pipeline. This open source, customizable pipeline was constructed to identify and assign significance estimates to one-dimensional features marked by volunteers. We describe the functional capabilities of the SATCHEL pipeline through application to features in photometric time-series data from the Kepler Space Telescope, classified by volunteers as part of the Planet Hunters citizen science project hosted on the Zooniverse platform. We evaluate the SATCHEL pipeline’s overall performance based on recovery of known signals (both simulations and signals corresponding to official Kepler Objects of Interest) and relative contamination by spurious features. We find that, for a range of pipeline hyperparameters and with a reasonable score cutoff, SATCHEL is able to recover volunteer identifications of over 98% of signals from simulations corresponding to exoplanets $>2~R_\oplus$ in radius and about 85% of signals corresponding to the same size range of KOIs. SATCHEL is transparently adaptable to other citizen science classification datasets, and available on GitHub.

Read this paper on arXiv…

E. Safron, T. Boyajian and N. Eisner
Fri, 18 Mar 22
49/66

Comments: 20 pages, 23 figures. Accepted for publication in MNRAS