Towards cosmological inference on unlabeled out-of-distribution hi observational data

Loading...
Thumbnail Image

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Science and Business Media B.V.

Abstract

We present an approach that can be utilized in order to account for the covariate shift between two datasets of the same observable with different distributions. This helps improve the generalizability of a neural network model trained on in-distribution samples (IDs) when inferring cosmology at the field level on out-of-distribution samples (OODs) of unknown labels. We make use of HI maps from the two simulation suites in CAMELS, IllustrisTNG and SIMBA. We consider two different techniques, namely adversarial approach and optimal transport, to adapt a target network whose initial weights are those of a source network pre-trained on a labeled dataset. Results show that after adaptation, salient features that are extracted by source and target encoders are well aligned in the embedding space. This indicates that the target encoder has learned the representations of the target domain via the adversarial training and optimal transport. Furthermore, in all scenarios considered in our analyses, the target encoder, which does not have access to any labels (Ωm) during adaptation phase, is able to retrieve the underlying Ωm from out-of-distribution maps to a great accuracy of R2 score ≥ 0.9, comparable to the performance of the source encoder trained in a supervised learning setup. We further test the viability of the techniques when only a few out-of-distribution instances are available for training and find that the target encoder still reasonably recovers the matter density. Our approach is critical in extracting information from upcoming large scale surveys.

Description

Keywords

Large-scale structure of Universe, Methods: numerical, statistical, Techniques: machine learning, Cosmological inference

Citation

Andrianomena, S. and Hassan, S., 2025. Towards cosmological inference on unlabeled out-of-distribution HI observational data. Astrophysics and Space Science, 370(2), p.14.