Enhancing IMR Consistency Tests for Gravitational Wave Transients

Gravitational Wave Analysis
Enhancing IMR Consistency Tests for Gravitational Wave Transients banner
Kristina Armitage, Quanta Magazine

Time-Domain Analysis

The time domain likelihood function for gravitational wave time series data d=[di]\textbf{d} = [d_i] and waveform model h=[hi]\textbf{h} = [h_i] with Gaussian noise is expressed as L(dh)=12πdetCexp(12(dh)C1(dh))\mathcal{L}(\textbf{d} | \textbf{h}) = \frac{1}{\sqrt{2\pi\det \textbf{C}}} \exp{\left(-\frac12(\textbf{d} - \textbf{h})^\dag \cdot \textbf{C}^{-1} \cdot (\textbf{d} - \textbf{h})\right)} where C\textbf{C} is the noise correlation matrix defined by Cij=ninjC_{ij} = \big\langle n_in_j \big\rangle, where nk=dkhkn_k=d_k-h_k, as the ensemble average.

We analyze the noise covariance matrix, Cij=limn1n=0n1sisΔji+iC_{ij}=\lim\limits_{n\rightarrow\infty}\frac{1}{n}\displaystyle\sum\limits_{\ell=0}^{n-1}s^{\ell}_is^{\ell}_{\Delta_{ji}+i}, where Δji=ji\Delta_{ji}=j-i, which usually is dependent on time ti=iΔtt_i=i\Delta t and displacement τij=ΔjiΔt\tau_{ij}=\Delta_{ji}\Delta t. However, one can make the assumption that the noise is wide-sense stationary, where the mean and variance are both constant in time. Under this assumption, any constant can be added to the indices in the above equation to obtain the same result. This makes C\textbf{C} symmetric since the factors commute, and Toeplitz since the elements along the diagonals are equal. Additionally, since C0 Δji=CΔji 0=C0 ΔjiC_{0\ \Delta_{ji}}=C_{-\Delta_{ji}\ 0}=C_{0\ -\Delta_{ji}}, the elements of C\textbf{C} are even functions of Δji\Delta_{ji}.

Additionally, one can assume that the data is ergodic, meaning that new realizations of n\textbf{n} are obtained via time. Under this assumption and the properties of the elements of C\textbf{C}, the ensemble averages can be replaced with time averages to give, Cij=limn1n=0n1ssΔji+=limn12n=nn1sisΔji+i=12ρ((ji)Δt)C_{ij}=\lim\limits_{n\rightarrow\infty}\frac{1}{n}\displaystyle\sum\limits_{\ell=0}^{n-1}s^{\ell}_{\ell}s^{\ell}_{\Delta_{ji}+\ell}=\lim\limits_{n\rightarrow\infty}\frac{1}{2n}\displaystyle\sum\limits_{\ell=-n}^{n-1}s^{\ell}_is^{\ell}_{\Delta_{ji}+i}=\frac{1}{2}\rho((j-i)\Delta t), where ρ\rho describes the correlation between points in the time series.

For a stationary random process, the covariance takes a symmetric Toeplitz form given by Cij=12ρ((ji)Δt)C_{ij} =\frac{1}{2}\rho((j-i)\Delta t) where ρ\rho is the autocovariance function. This is estimated empirically by autocorrelating a long stretch of noise-only data, where ρ(k)1Nρi=0Nρ1nini+k(NρN)\rho(k) \approx \frac1{N_{\rho}}\sum\limits_{i=0}^{N_{\rho}-1}n_in_{i+k} \quad (N_{\rho}\gg N) for k<Nρ|k|<N_{\rho} and zero otherwise. If, in addition to stationarity, we impose periodic boundary conditions, then ρ(k)=ρ(Nk)\rho(k) = \rho(N - k), and CC will be circulant. Circulant matrices are diagonalized by the discrete Fourier transform, implying the Fourier amplitudes are given by C~ij=n~in~j=12(NΔt)S(fij)δij\tilde{C}_{ij} = \big\langle \tilde{n}_i\tilde{n}_j \big\rangle = \frac{1}{2} (N\Delta t) S(|f_{ij}|) \delta_{ij} where the PSD is derived from the cyclic autocorrelation function through a discrete Fourier transform S(fj)=2Δtj=0N1ρ(k)exp2πijkNS(|f_j|) = 2\Delta t \sum\limits_{j=0}^{N-1} \rho(k)\exp{-\frac{2\pi ijk}{N}}.

Starting from a given PSD estimate S(f)S(|f|) sampled at NN frequencies fjf_j , we can invert the above equation to obtain the corresponding autocorrelation function estimate is given by ρ(k)=12Tk=0NFFT1S(fj)exp2πijkN\rho(k) = \frac{1}{2T} \sum\limits_{k=0}^{N_{\text{FFT}}-1} S(|f_j|)\exp{\frac{2\pi ijk}{N}} as long as the PSD was estimated from data segments of length NFFTN_{\text{FFT}} much longer than the analysis segment length NN. A long segment allows us to truncate the resulting NFFTN_{\text{FFT}}-long ρ(k)\rho(k) to length NN before constructing the covariance matrix. Thereby the covariance matrix can be estimated as a symmetric Toeplitz form.

Once we have constructed a covariance matrix, we can analyze a noisy data stream with the likelihood is evaluated as lnL(dh)=12i,j=0N1(d~ih~i)Cij1(d~jh~j)=i,j=0N1d~iCij1h~j12i,j=0N1d~iCij1d~j=SNRinner12SNRopt\ln\mathcal{L}(\textbf{d}|\textbf{h}) = -\frac{1}{2}\sum\limits_{i,j=0}^{N-1}\left(\tilde{d}_i-\tilde{h}_i\right)C^{-1}_{ij}\left(\tilde{d}_j-\tilde{h}_j\right) = \sum\limits_{i,j=0}^{N-1}\tilde{d}_iC^{-1}_{ij}\tilde{h}_j-\frac12\sum\limits_{i,j=0}^{N-1}\tilde{d}_iC^{-1}_{ij}\tilde{d}_j=\mathrm{SNR}^{\mathrm{inner}}-\frac12\mathrm{SNR}^{\mathrm{opt}}

Gating and In-Painting analyses

If we assume that a detector’s noise is wide-sense stationary and ergodic, only then its covariance C\textbf{C} is a symmetric Toeplitz matrix with elements given by the autocorrelation function of the data. If the autocorrelation function goes to zero in some finite amount of time that is less than T/2T/2, then the covariance matrix is asymptotically equivalent to a circulant matrix. The inverse of the covariance matrix can then be well-approximated by that of the equivalent circulant matrix.

Instead of considering the full set of time samples, if we wish to only evaluate the truncated set dtr={d0,...,da,da+M,...,dN1}\textbf{d}_{\rm tr} = \{d_0, ..., d_a, d_{a+M}, ..., d_{N-1}\}, with M>1M > 1. The data between the time steps [a,a+M)[a,a+M) is said to be gated.

The probability density function of the truncated noise is still a multivariate normal distribution (excising dimensions from a multivariate normal is equivalent to marginalizing over those dimensions). The challenge is that the covariance matrix of the truncated noise Ctr\textbf{C}_{\rm tr} is no longer Toeplitz. Its eigenvectors can no longer be approximated by that of a circulant matrix, and so the above expression for the likelihood is no longer valid. The inverse of the covariance matrix needs to be found by other means. We use gating and in-painting to find the likelihood of the truncated time series. This was applied to the problem of matched filtering; we apply it to parameter estimation.

Define n=ng+x\textbf{n}' = \textbf{n}_g + \textbf{x}, where ng\textbf{n}_g is the noise with the gated times t[a,a+M)Δtt\in [a, a+M)\Delta t zeroed out, and x\textbf{x} is a vector that is zero everywhere except in the gated times. If the non-zero elements of x\textbf{x} are such that (C1n)[k]=0(\textbf{C}^{-1} \textbf{n}')[k] = 0 for all k[a,a+M)k \in [a, a+M), then nC1n\textbf{n}'^\dag \textbf{C}^{-1} \textbf{n}' will be the same as the truncated version ntrCtr1ntr\textbf{n}_{\rm tr}^\dag \textbf{C}_{\rm tr}^{-1} \textbf{n}_{\rm tr}. Our aim is to solve the equation C1(ng+x)=0\textbf{C}^{-1}(\textbf{n}_g + \textbf{x}) = 0 in the gated region. Since x\textbf{x} is zero outside of the gated region, C1x\textbf{C}^{-1}x only involves the [a,a+M)[a, a+M) rows and columns of C1\textbf{C}^{-1}, which form an M×MM \times M Toeplitz matrix. We therefore solve for x\textbf{x} such that C1x=C1ng\overline{\textbf{C}^{-1}}\,\overline{\textbf{x}} = -\overline{\textbf{C}^{-1}\textbf{n}_g}, where the overbar indicates the [a,a+M)[a, a+M) rows (and columns) of the given vector (matrix). This can be solved numerically using a Toeplitz solver. Adding x\textbf{x} to the gated noise (in painting) will then yield the same result as if we had truncated the noise and the covariance matrix.

Note that if the gate spans the entire beginning of the data segment, the truncated covariance matrix Ctr\textbf{C}_{\rm tr} is Toeplitz, and so could be inverted numerically using a Toeplitz solver. The advantage of using in-painting is that it involves solving for an M×MM\times M matrix rather than an (NM)×(NM)(N-M)\times (N-M) matrix. In the standard in-painting approach, one first solves for the in-painting vector x\textbf{x} using a Toeplitz solver and then augments the gated noise ng\textbf{n}_g by setting n=ng+x\textbf{n}' = \textbf{n}_g + \textbf{x}.

Drafting is currently in progress. Sorry for any inconvenience. Please check back soon!