pdetclassifier: Gravitational-wave selection effects using neural-network

Check me out on github.com/dgerosa/pdetclassifier.

We present a novel machine-learning approach to estimate selection biases in gravitational-wave observations. Using techniques similar to those commonly employed in image classification and pattern recognition, we train a series of neural-network classifiers to predict the LIGO/Virgo detectability of gravitational-wave signals from compact-binary mergers. We include the effect of spin precession, higher-order modes, and multiple detectors and show that their omission, as it is common in large population studies, tends to overestimate the inferred merger rate. Although here we train our classifiers using a simple signal-to-noise ratio threshold, our approach is ready to be used in conjunction with full pipeline injections, thus paving the way toward including empirical distributions of astrophysical and noise triggers into gravitational-wave population analyses.

This repository contains models supporting arXiv:2007.06585. We are very happy if you find this useful for your research; please cite our paper. For a DOI pointing to this repository:

This code is developed and maintained by Davide Gerosa. To report bugs, please open an issue on GitHub. If you want to contact me, it's [email protected].

Data products

We provide three kinds of data products:

Our code: pdetclassifier.py. See below for a short description.
Pre-trained TensorFlow neural networks: trained_*.h5.
Training/validation samples: sample_*.h5.These can be downloaded from the github release page.

Models were trained on samples of N=2e7 binaries. This sample is divided in two chunks of 1e7 sources each used for training and validation.

The following models are described in arXiv:2007.06585. - trained_2e7_design_nonspinning_quadrupole_1detector.h5 - trained_2e7_design_precessing_higherordermodes_1detector.h5 - trained_2e7_design_precessing_higherordermodes_3detectors.h5 They are computed assuming LIGO/Virgo noise curves aLIGODesignSensitivityP1200087 and AdVDesignSensitivityP1200087 from lal.

The following additional models use noise curves for LIGO/Virgo O1+O2, O3, and forecasted O4. The training distributions and the network setup is the same as described in the paper. - trained_2e7_O1O2_precessing_higherordermodes_3detectors.h5 - trained_2e7_O3_precessing_higherordermodes_3detectors.h5 - trained_2e7_O4_precessing_higherordermodes_3detectors.h5 For O1+O2 we use the aLIGOEarlyHighSensitivityP1200087 and AdVEarlyHighSensitivityP1200087 noise curves from lal. For O3 and O4 we use the txt files from LIGO-T2000012.

Code and examples

First, install the following python packages: tensorflow, astropy, lalsuite, pycbc, tqdm, and deepdish.

Note: if used as it is, the pdetclassifier.py script assumes precessing systems, higher-order modes, and a three-detector network. If you want to do something different, you'll need to hack it a little bit.

Example 1: use a precomputed model

Here is a code snippet to use a precomputed model:

# Load sample
binaries= readsample('sample_2e7_design_precessing_higherordermodes_3detectors.h5')
# Split test/training
train_binaries,test_binaries=splittwo(binaries)
# Load trained network
model = loadnetwork('trained_2e7_design_precessing_higherordermodes_3detectors.h5')
# Evaluate performances on training sample
testnetwork(model,train_binaries)
# Evaluate performances on test sample
testnetwork(model,test_binaries)
# Predict on new sample
newbinaries = generate_binaries(100)
predictions = predictnetwork(model, newbinaries)
print(predictions)
# Regenerate the extrinsic angles and marginalize over them
pdets = pdet(model,newbinaries, Nmc=1000)
print(pdets)

The binaries object is a python dictionary with keys - mtot: detector-frame total mass - q: mass ratio - z: redshift - chi1x,chi1y,chi1z: dimensionless spin components of the primary - chi2x,chi2y,chi2z: dimensionless spin components of the secondary - iota: inclidation - ra,dec: sky location - psi: polarization. - snr: the SNR - det: detectability, equal to 1 if detectable or 0 if not detectable. The frame of the spins is defined such that z is along L at 20 Hz (as in lal).

The predictions one gets at the end is a list of 0s and 1s, encoding the predicted detectability. One can then marginalize over the extrinsic angles to compute the detection probability pdet (by default the pdet function assumes isotropic inclination, sky-location and polarization, Nmc is the number of Monte Carlo samples).

Example 2: train your own neural network

Here is an example where we generate a small sample of 1000 binaries, train a neural network, and evaluate the performances.

# Generate and store a sample
store_binaries('sample.h5',1e3,approximant='IMRPhenomXPHM',noisecurve='design',SNRthreshold=12)
# Load sample
binaries= readsample('sample.h5')
# Split test/training
train_binaries,test_binaries=splittwo(binaries)
# Train a neural network
trainnetwork(train_binaries,test_binaries,filename='trained.h5')
# Load trained network
model = loadnetwork('trained.h5')
# Evaluate performances on training sample
testnetwork(model,train_binaries)
# Evaluate performances on test sample
testnetwork(model,test_binaries)
# Predict on new sample
newbinaries = generate_binaries(100)
predictions = predictnetwork(model, newbinaries)
print(predictions)
# Regenerate the extrinsic angles and marginalize over them
pdets = pdet(model,newbinaries, Nmc=1000)
print(pdets)

Example 3: evaluate selection effects for your population

Here is an example where we just compute the detectability of an injected population using one of our neural networks

# Initialize
binaries={}
N = int(1e3)
binaries = generate_binaries(N)
# Populate with your distribution
binaries['mtot'] = np.random.normal(30, 3, N )
binaries['q'] = np.random.uniform(0.1,1,N)
binaries['z'] = np.random.normal(0.2, 0.01, N )
binaries['chi1x'] = np.random.uniform(0, 0.1, N )
binaries['chi1y'] = np.random.uniform(0, 0.1, N )
binaries['chi1z'] = np.random.uniform(0, 0.1, N )
binaries['chi2x'] = np.random.uniform(0, 0.1, N )
binaries['chi2y'] = np.random.uniform(0, 0.1, N )
binaries['chi2z'] = np.random.uniform(0, 0.1, N )
# Load trained network
model = loadnetwork('trained_2e7_design_precessing_higherordermodes_3detectors.h5')
# Compute detectability averaged over extrinsic parameters
pdets = pdet(model,binaries, Nmc=1000)
print(pdets)
# Integrate over entire population
predictions = predictnetwork(model, binaries)
integral = np.sum(predictions)/N
print(integral)