Mapping Circumgalactic Medium Observations to Theory Using Machine Learning [GA]

http://arxiv.org/abs/2301.02001


We present a random forest framework for predicting circumgalactic medium (CGM) physical conditions from quasar absorption line observables, trained on a sample of Voigt profile-fit synthetic absorbers from the Simba cosmological simulation. Traditionally, extracting physical conditions from CGM absorber observations involves simplifying assumptions such as uniform single-phase clouds, but by using a cosmological simulation we bypass such assumptions to better capture the complex relationship between CGM observables and underlying gas conditions. We train random forest models on synthetic spectra for \HI and selected metal lines around galaxies across a range of star formation rates, stellar masses, and impact parameters, to predict absorber overdensities, temperatures, and metallicities. The models reproduce the true values from Simba well, with transverse standard deviations of $0.2-0.3$ dex in overdensity, $0.14-0.2$ dex in temperature, and $0.16-0.2$ dex in metallicity predicted from metal lines (not HI), across all ions. Examining the feature importance, the random forest indicates that the overdensity is most informed by the absorber column density, the temperature is driven by the line width, and the metallicity is most sensitive to the specific star formation rate. Alternatively examining feature importance by removing one observable at a time, the overdensity and metallicity appear to be more driven by the impact parameter. We introduce a normalising transform approach in order to ensure the scatter in the true physical conditions is accurately spanned by the network. The trained models are available online.

Read this paper on arXiv…

S. Appleby, R. Davé, D. Sorini, et. al.
Fri, 6 Jan 23
23/55

Comments: 16 pages, 14 figures, submitted to MNRAS. Comments welcome!