Towards cosmological inference on unlabeled out-of-distribution hi observational data
Loading...
Date
2025
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science and Business Media B.V.
Abstract
We present an approach that can be utilized in order to account for the covariate shift between two datasets of the same observable with different distributions. This helps improve the generalizability of a neural network model trained on in-distribution samples (IDs) when inferring cosmology at the field level on out-of-distribution samples (OODs) of unknown labels. We make use of HI maps from the two simulation suites in CAMELS, IllustrisTNG and SIMBA. We consider two different techniques, namely adversarial approach and optimal transport, to adapt a target network whose initial weights are those of a source network pre-trained on a labeled dataset. Results show that after adaptation, salient features that are extracted by source and target encoders are well aligned in the embedding space. This indicates that the target encoder has learned the representations of the target domain via the adversarial training and optimal transport. Furthermore, in all scenarios considered in our analyses, the target encoder, which does not have access to any labels (Ωm) during adaptation phase, is able to retrieve the underlying Ωm from out-of-distribution maps to a great accuracy of R2 score ≥ 0.9, comparable to the performance of the source encoder trained in a supervised learning setup. We further test the viability of the techniques when only a few out-of-distribution instances are available for training and find that the target encoder still reasonably recovers the matter density. Our approach is critical in extracting information from upcoming large scale surveys.
Description
Keywords
Large-scale structure of Universe, Methods: numerical, statistical, Techniques: machine learning, Cosmological inference
Citation
Andrianomena, S. and Hassan, S., 2025. Towards cosmological inference on unlabeled out-of-distribution HI observational data. Astrophysics and Space Science, 370(2), p.14.