You are on page 1of 13

Mon. Not. R. Astron. Soc. 000, 000000 (0000) Printed 9 August 2016 (MN LATEX style file v2.

2)

Reconstruction of Missing Data using Iterative Harmonic Expansion

Atsushi
1
J. Nishizawa1,2 and Kaiki Taro Inoue 3
Institute for Advanced Research, Nagoya University, Aichi 464-8602, Japan,
2
Kavli Institute for the Physics and Mathematics of the Universe (WPI), The University of Tokyo, Chiba 277-8583, Japan
3
Faculty of Science and Engineering, Kindai University, Higashi-Osaka, 577-8502, Japan
arXiv:1305.0116v2 [astro-ph.CO] 8 Aug 2016

9 August 2016

ABSTRACT
In the cosmic microwave background or galaxy density maps, missing fluctuations in masked
regions can be reconstructed from fluctuations in the surrounding unmasked regions if the
original fluctuations are sufficiently smooth. One reconstruction method involves applying a
harmonic expansion iteratively to fluctuations in the unmasked region. In this paper, we dis-
cuss how well this reconstruction method can recover the original fluctuations depending on
the prior of fluctuations and property of the masked region. The reconstruction method is
formulated with an asymptotic expansion in terms of the size of mask for a fixed iteration
number. The reconstruction accuracy depends on the mask size, the spectrum of the under-
lying density fluctuations, the scales of the fluctuations to be reconstructed and the number
of iterations. For Gaussian fluctuations with the HarrisonZeldovich spectrum, the recon-
struction method provides more accurate restoration than naive methods based on bruteforth
matrix inversion or the singular value decomposition. We also demonstrate that an isotropic
non-Gaussian prior does not change the results but an anisotropic non-Gaussian prior can
yield a higher reconstruction accuracy compared to the Gaussian prior case.
1 INTRODUCTION of Avoidance) and simply ignoring the data in such regions. An-
other approach involves reconstructing the missing fluctuations in
After the first data release of the cosmic microwave background the masked region based on those outside the masked region and
(CMB) temperature fluctuations observed by the Wilkinson Mi- using the reconstructed data as well. However, to do so, it is neces-
crowave Anisotropy Probe (WMAP), multiple authors reported sary to make certain assumptions regarding the prior on the prop-
anomalous signatures, so-called largeangle anomalies, in the erty of the missing fluctuations. To estimate the power spectrum of
CMB on large angular scales (Ralston & Jain 2004; de Oliveira- the fluctuations from an incomplete sky, we can use deconvolution
Costa et al. 2004; Hansen et al. 2004; Hajian et al. 2005; Mof- techniques (e.g. Hivon et al. 2002) if we adopt a prior that the fluc-
fat 2005; Land & Magueijo 2006; Bernui et al. 2006; Copi et al. tuations are statistically isotropic. However, to estimate the density
2007; Eriksen et al. 2007; Montesern et al. 2008; Samal et al. field itself, it is necessary to develop methods that can reconstruct
2009) which were confirmed recently by the Planck Collaboration the phases (if expressed in complex numbers) as well as the ampli-
(Planck Collaboration 2014, 2015c). To date, the origin of these tudes of the missing fluctuations.
anomalies has not been addressed. They may be due to (a) a dif- To reconstruct missing fluctuations on the masked region, it is
ference between a priori and posteriori significance (Aurich et al. necessary to find the inverse of the masking operator. However, in
2010; Pontzen & Peiris 2010; Efstathiou et al. 2010; Bennett et al. general, the mask matrix is singular and therefore is not invertible.
2011) (b) incomplete subtraction of foreground emissions (Abramo It is necessary to make certain assumptions about the underlying
et al. 2006; Cruz et al. 2011; Hansen et al. 2012) (c) a contribution data such as isotropy and smoothness (Abrial et al. 2008; Kim et al.
from largescale structures via the integrated SachsWolfe effect 2012; Bucher & Louis 2012; Starck et al. 2013) or their derivatives
(Inoue & Silk 2006, 2007; Rassat et al. 2007; Afshordi et al. 2009; (Inoue et al. 2008) to regularize the inverse operator. Because the
Francis & Peacock 2010; Rassat et al. 2013; Rassat & Starck 2013; result depends on the choice of the prior, the mutual robustness of
Tomita & Inoue 2008; Sakai & Inoue 2008; Inoue 2012; Planck each reconstruction method should be checked.
Collaboration 2014), or kinetic SunyaevZeldovich effect (Peiris In this paper, we revisit the iterative harmonic expansion
& Smith 2010), (d) possible systematics from instruments (Hanson (IHE) for the regularization of the inverse of a masking opera-
et al. 2010) (e) incomplete treatment of masking (Kim et al. 2012; tor. This method is well known as the Jacobi iterative process and
Rassat et al. 2014), or (f) extensions of inflationary models (Au- has been applied to create CMB maps, as reported in the litera-
rich et al. 2007; Emir Gumrukcuoglu et al. 2007; Rodrigues 2008; ture (Prunet et al. 2000; Hamilton 2003) which is fast and easy to
Bernui & Hipolito-Ricaldi 2008; Cruz et al. 2008; Fialkov et al. use. It has already been implemented in the HEALPIX package as
2010; Zheng & Bunn 2010; Liu et al. 2013). map2alm iterative1 (Gorski et al. 2005). The IHE method is quite
The entire sky cannot be directly observed with a sufficiently robust against the statistical properties of the fluctuations. In this
high signaltonoise (S/N) ratio. A conservative approach involves
masking out the regions where the S/N ratio is low and the sig-
nal is highly contaminated by foreground emissions (e.g., the Zone 1 http://healpix.jpl.nasa.gov/


c 0000 RAS
2 Nishizawa and Inoue
paper, we show that the IHE method does not require statistical where d is the surface element on SN . Equation (3) can be written
isotropy or Gaussianity for the fluctuation to be reconstructed. We in terms of the true harmonic coefficients atrue
j s, as
also demonstrate that the underlying power spectrum of the fluctua- X
true
tions greatly influences the reconstruction accuracy. For simplicity, ai = a j Wi j , (4)
j
we ignore the noise components in our discussion. Because our
main purpose in the study described herein is to apply the IHE where Wi j is the (i, j) component
R of the mode coupling matrix due
method to reconstruct the largeangle CMB fluctuations contam- to the masking: Wi j = d WX j Xi .
inated by the foreground, this assumption is reasonable. Equation (4) implies that the true expansion coefficient atrue
This paper is organized as follows. In Sec. 2, we describe the can be obtained by inverting the mode coupling matrix W. How-
formulation of the IHE method using the IHE on an Ndimensional ever, in general, the matrix can be singular and noninvertible. To
unit sphere and provide a verification of the IHE method based on regularize a singular matrix, the IHE scheme can be employed. The
asymptotic expansion. In Sec. 3, we describe the simulation set we iteration process starts from a set of the pseudo harmonic coeffi-
used and show some numerical results to compare the reconstruc- cients
tion accuracy of the IHE method to that of the bruteforce inver-
a(0) P
i = ai . (5)
sion or the singular value decomposition (SVD) method. We also
discuss the masking effect and the reconstruction accuracy for dif- For n > 1, the nth set of ai s can be constructed from two maps: the
ferent and m modes. In Sec. 4, we present the application of the original map of the unmasked region that was obtained observation-
IHE method to the CMB sky and non-Gaussian fluctuations. In Sec. ally and the map that was reconstructed from the inverse transform
5, we give our conclusions. of the nth ai s in the masked region,
Z h i
a(n)
i = d ()W() + (n) ()R() Xi (), (6)
2 ITERATIVE HARMONIC EXPANSION where
In this section, we describe the IHE method of reconstructing miss- imax
X
ing fluctuations on a masked region. In Sec. 2.1, we formulate the (n) () = ai(n1) Xi () (7)
IHE method for an Ndimensional unit sphere. In Secs. 2.2 and 2.3, i

we discuss the asymptotic expansion of the mask matrix in terms and R = 1 W (see also Fig. 1). Note that the nth iterated real
of the size of a masked region on a circle and a two dimensional space map contains information equivalent to that conveyed by
sphere, respectively. In both cases, we show that the IHE method the (n 1)th harmonic coefficient. Therefore, we call the (NI ) as
I 1)
gives an exact solution in the limit where the size of the masked the NI th estimator together with a(N
i . In the following, we as-
region approaches zero when the iteration number is fixed. sume that fluctuations whose angular scales are larger than that of
the masked region are not significantly correlated with fluctuations
smaller than the masked region. In that case, the summations in
2.1 Formulation Eqs. (4) and (7) can be truncated at a certain multipole imax , which
Suppose a density fluctuation () on a unit Nsphere SN , where can be inferred from the size of the mask, as long as we concern
represents a unit vector pointing to a position in SN . The masking largeangle fluctuations corresponding to multipoles irec,max 6 imax .
function is defined by In the following, we sum the ai s up to the multipole imax , and we
omit the summation symbol when no confusion arises.
obs () = W()(), (1) Recursively substituting equation (7) into (6), we can obtain
the general formula for the NI th iterated harmonic coefficients as
where W = 0 inside the masked region and W = 1 elsewhere. The
a series of Ri j ,
fluctuation can be expanded in terms of spherical harmonics Xi ,
X imax
X
I 1)
() = ai Xi (), (2) a(N
i = a(0) K 2
j (i j + Ri j + Ri j + + [R
NI 1
]i j ), (8)
i j

where Xi () is a solution of the Helmholtz equation, (N +ki2 )Xi = 0 where iKj is the Kronecker delta, Ri j = iKj Wi j , and R2i j = k Rik Rk j
P
where N is the Ndimensional Laplacian on SN and the ki2 s are the and so on. This finite series, which is truncated at the NI th order
eigenvalues defined in ascending order k02 < k12 < k22 , . We use represents an asymptotic expansion of W 1 in terms of the masked
a single subscript index i to represent the scale of each mode. For region. As the area of the masked region approaches zero, the series
instance, for a twodimensional unit sphere S2 , the eigenfunction converges to the true value ai . As shown in Sec. 2.3, the difference
is Yi , the spherical harmonics and the eigenvalues are ki2 = ( + 1). between the NI th estimator and the true ai , (N I 1)
= | j a j [RNI ]i j |
P
i
The index i of Yi () is given by a multipole number l and a magnetic is of the order of O(bNI ). Therefore, in the limit of b 0, the
quantum number m i = 2 + + m + 1. reconstructed fluctuations converge to the true fluctuations. Even if
The sum in Equation (2) should be taken over all imodes. the area of the mask is finite, the series can converge depending on
However, in real applications, we can truncate the sum at a certain the following conditions:
scale if the modes on smaller scales are not physically relevant. The
coefficient ai is called the harmonic coefficient and can be obtained The mask size: As the volume/area of the mask increases, the
from the inverse transformation of Equation (2). If the fluctuations rate of convergence slows down because the residual matrix R sig-
on the masked region is set to zero, we will obtain the socalled nificantly deviates from zero and the contribution to (N i
I)
is not
pseudo harmonic coefficients aPi s, negligible. We shall discuss this issue in Secs. 2.2 and 2.3.
Z The scales we try to reconstruct: The minimum scale corre-
aPi = d ()W()Xi () (3) sponding to the highest imode irec,max to be reconstructed should
be equal to or less than the scale of the imax mode. The choice of


c 0000 RAS, MNRAS 000, 000000
Reconstruction of Missing Data using IHE 3

Figure 1. Schematic diagram of the IHE method. (a) Original map including high multipoles. (b) Original map in which an azimuthally symmetric region is
masked. (c) (b) is expanded in harmonics up to a given max . (d) Combination of the original map (b) outside the mask and the pseudo map (c) inside the mask.
(e) (d) is expanded up to max and inversely transformed to obtain the smoothed map. The process (d)(e) is repeated for NI 1 time. (f) We obtain an NI times
iterated map and modes up to rec,max are extracted. (g) The true fluctuations on largescales up to rec,max

the cutoff scale may change the speed of convergence because the simple and easy to implement. The IHE method may also work for
mode coupling with the high imodes may become important. We three (or higher) dimensional problems provided that the volumes
shall discuss this point in Sec. 3.6. of the missing regions are sufficiently smaller than the scales of
The spectrum of underlying density fluctuations: If an ensem- interest.
ble averaged density fluctuation has a blue spectrum, the effect of
mode coupling, especially from high imodes, becomes more con-
2.2 Asymptotic expansion on a circle
spicuous than in the cases with red spectra. We shall examine this
point in Sec. 3.5. In this section and the next section, we demonstrate that the IHE
method gives a finite inversion in the limit that the size of a masked
Therefore, equation (8) can approximate the underlying true den-
region approaches zero, even in the case when the size is finite. In
sity fluctuation. The optimal number of iterations, NI , under
this section, we consider the reconstruction of the missing fluctua-
the given conditions described above should be evaluated using
tions in a segment on a circle C with a perimeter L = 2 and show
MonteCarlo simulations; this procedure is discussed later in Sec.
that the IHE method is valid in the limit that the size of the segment
3.
converges to zero.
We can think of the reconstruction process as a mapping from
We chose an arbitrary point on C as the origin of the coordi-
an observable to an estimator,
nate , which runs from - to . The mask function is defined as
I 1)
FIHE : ai a(N
i . (9) W(, b) = 0 for || < b/2 and 1 otherwise, where b characterizes
the size of the masked region B: || < b/2. Because the eigen-
As we briefly mentioned above and will discuss in more detail in
function Xm corresponding to an eigenvalue km2 = m2 is given by
Sec. 3, the accuracy of the IHE method depends on the highest
exp(im), a density fluctuation defined on C can be decomposed
mode to be reconstructed imax , the spectrum index ns of the power
into discrete Fourier modes as
spectrum of , the mask size and the number of total iterations NI . X
Therefore, we can write FIHE = FIHE (imax , ns , W, NI ). In Sec. 4.1, () = am exp (im) , (10)
we will show that the details of the boundary shape do not signifi- m=0

cantly affect the reconstruction accuracy if the area of the masked where m is an integer with m = 1 corresponding to the largest
sky is similar. Fourier fluctuation mode on C. As before, the missing fluctuation
The IHE method can easily be applied to a sky with an az- in the masked region B can be constructed by multiplying the mask
imuthally symmetric mask. As shown in Fig. 1, the algorithm is function to the original fluctuation: obs (, b) = ()W(, b).


c 0000 RAS, MNRAS 000, 000000
4 Nishizawa and Inoue
The pseudoestimator for , which is equivalent to the NI = 1 Note that we have written equation (21) in terms of R = 1W rather
IHE is given by than W to limit the integral range to the vicinity of /2, which
m
X max facilitates analysis of the behavior of the estimator. Note also that
(1) () = a(0)
m exp (im) , (11) the subscript i denotes a set of parameters and m. Using equations
m=0 (4) and (5), we can write equations (20) and (21) as
where the Fourier component a(1) is
X
Z a(0)
i = ai a j Ri j (22)
j
a(0)
m = d()W(, b) exp (im) . (12)
where Ri j is the residual mask matrix,
The mask function can be expressed as X
R i1 i2 = si3 T i1 i2 i3 , (23)
W(, b) = 1 ( + b/2) + ( b/2), (13) i3

where () is the Heaviside step function. Then, equation (12) can and
R si denotes the harmonic expansion of the mask residual, si =
be rewritten in terms of a true density a as dR(, b)Yi (). The matrix T is explicitly given in Hivon et al.
(2002) as
X
a(0)
m = am Wmm , (14) Z
m
R T i1 i2 i3 = d Yi3 Yi2 Yi1
where Wmm = dW() exp[i(m m )] corresponds to the mode
" #1/2
coupling matrix in equation (4). Explicitly, it can be written as (21 + 1)(22 + 1)(23 + 1)
= (1)m2
(m m )b 4
" #
K 2
Wmm = mm sin . (15) ! !
m m 2 1 2 3 1 2 3
, (24)
In the limit of b 1, equation (15) can be expanded into a series 0 0 0 m1 m2 m3
as where (:::) is the Wigner 3 jsymbol. Note that the subscript i j de-
1 pends only on j and m j , i.e. i j = 2j + j + m j + 1.
Wmm mmK
b + (m m )2 b3 + . (16)
24 For an azimuthally symmetric mask, si can be analytically in-
Then, the first iterated IHE estimator is tegrated and expanded in terms of b( 1) as
" # Z 2 Z /2+b
(0)
X 1 2 3
am = am am b (m m ) b + ,
(17) si = d d cos Yi (, = 0)
m
24 0 /2b
r
which will recover the true density fluctuation when b 0. In a (2 + 1) cos2 (b) P1 [sin(b)] P1 [sin(b)]
= 2 (25)
similar manner, the second iterated estimator can be written as 4 ( + 1)
X
a(1)
m = am am c(1) (3) 3 4
i b + ci b + O[b ], (26)
m m
where the explicit form of the ci s is given by
(m m )2 + (m m )2 4
" #
b2 + b + . (18) p
24 c(1)
i = 2 (2 + 1)P0 (0), (27)
The second iterative estimator also converges to the true density (3) 1p
ci = (2 + 1)[P0 (0) P2 (0)], (28)
fluctuation in the limit of b 0; however, it converges faster than 3
the first iterative estimator because the order of the difference be- where Pm is the associated Legendre function. Because si is the
tween the estimator and the original value is b2 rather than b. After first order of b and the matrix Ri j is a linear combination of the
NI iterations, we obtain si s, the lowest order of Ri j is b. Therefore, the leading order of the
X
a(NI 1)
= am0 amNI (b)NI 1 difference between the first IHE estimator and the true fluctuation,
m0
m1 ...mNI (0) (0)
i |ai ai |, is also b. For the second iterated IHE estimator, we
( NI have
1 X

2 3
X
b (mn1 mn ) b + , (19) a(1)
i = ai ak Rk j R ji , (29)

24 n=1

j,k
where the difference between the NI th estimator and the true fluc-
tuation is of the same order as bNI . and, therefore, (1)
i
2
= O(b ). As in the onedimensional case, for
I 1)
a given NI , we expect that, (N i = O(bNI ). However, note that
(NI )
i scales to 0 as b 0 but does not approach 0 as NI
2.3 Asymptotic expansion on a twodimensional sphere due to the mode coupling between the different multipole modes,
In this section, we consider the IHE reconstruction of missing fluc- as shown in equation (19). Instead, it converges to the one for the
tuations on a two dimensional sphere. In what follows, we as- direct inversion as illustrated later in Fig. 3.
sume that the mask is azimuthally symmetric: i.e., W(, b) = 0 Fig. 2 presents the D2 accuracy of the IHE map reconstruction
for |/2 | < b and 1 otherwise. As described in Sec. 2.2, the as a function of the angular size of the masked region, which is
pseudo or first IHE estimator can be written as, compared to the accuracy of the truncated asymptotic expansion in
Z equation (22). To compute the D2 accuracies we used the fiducial
a(0)
i = d ()W(, b)Yi () (20) set of simulations described in Section 3.1 with various mask sizes.
Z The circles and diamonds represent the D2 accuracies defined by
= ai d ()R(, b)Yi (). (21) equation (30) for the IHE reconstruction and the truncated asymp-
totic expansion defined by equation (22), respectively with filled


c 0000 RAS, MNRAS 000, 000000
Reconstruction of Missing Data using IHE 5
100 over the unmasked sky and can be subtracted out before reconstruc-
tion. As a simple model of the zone of avoidance, we considered an
azimuthally symmetric mask with W() = 0 at |/2 | < b and
W = 1 otherwise with b = 20 . We used pixels that were suf-
10-1
ficiently smaller than the size of the mask and the reconstructed
fluctuation scales to reduce errors due to the pixelization effect. We
adopt the Healpix resolution Nside = 1024 (the total number of pix-
2
10-2 els in the entire sky is Npix = 12 Nside 1.2 107 ), where the pixel
D2

size corresponds to 3.4 arcmin. The input power spectrum should


be truncated at scales cut sufficiently small compared to the recon-
b
sin( )
structed scales max , i.e. max cut . In our simulated maps, we set
10-3 IHE (NI=1) cut = 30, which is a scale sufficiently smaller than the mask size.
asymp. exp. ( NI=1) We also conducted the same analysis with cut = 100; however, the
IHE (NI=4) result was unchanged. We use this set of fiducial simulations unless
asymp. exp. ( NI=4) otherwise stated. In addition to this fiducial set, we also generated
10-4
0 5 10 15 20 25 30 35 the same simulation set for the input spectrum indices ns = 0 and
b[degree] ns = 4.
The deviation from the original fluctuations can be measured
Figure 2. Accuracy of fluctuations reconstructed using IHE method and from the ratio of the L2 norms denoted by D2 ,i.e. the squared dif-
truncated asymptotic expansion. Filled and open circles represent the D2
ference between the density fluctuations of the original and recon-
accuracies of fluctuations reconstructed using IHE method and filled and
open diamonds are fluctuations obtained using the asymptotic expansion
structed maps divided by the squared density fluctuations of the
defined in equation (22), with filled and open symbols denoting 1 and 4 it- original map:
erations, respectively. Also shown with dotted line is the sin(b) curve which PNpix
[rec (i ) true (i )]2
corresponds to the expected D2 accuracy when masked region is filled with D i PNpix
2
(30)
zeros. i 2true (i )
where rec and true describe the reconstructed and the original den-
sity fluctuations up to the highest multipole:
symbols denoting 1 iteration and open symbols corresponding to
4 iterations. One can see that the filled and open circles and dia- irec,max
X
monds agree very well, suggesting that the convergence of the IHE true () atrue
i Yi (), (31)
method can be numerically verified. Furthermore, we expect from i=0

equation (29) that the D2 accuracy improves as the number of itera- irec,max
X
tions increases. This tendency is more clearly observable at smaller rec () a(N
i
I)
Yi (). (32)
b. In the following, we discuss the reconstruction of the missing i=0

fluctuations on a twodimensional sphere;however, in general, the


reconstruction method can be used for an Ndimensional sphere.
3.2 Comparison with W 1 and SVD
Using equation (4), we can compare the IHE method to the brute
force inversion of the matrix W and the SVD (Efstathiou 2004).
3 RESULTS If the mode coupling matrix W is invertible, we obtain a unique
solution for the underlying density fluctuation within the mask.
In this section, we describe the numerical simulation that we per-
However, if the matrix W contains some eigenvalues that are close
formed to test the IHE method and present the definition of the L2
to zero, where the matrix is close to being singular, the inversion
norm that we used to investigate the reconstructed fluctuations in
causes large errors. In such cases, we can remove the singulari-
Sec. 3.1. Then, we present comparisons of the IHE method with
ties by replacing the small eigenvalues with zero; this procedure
other inversion methods using the bruteforce inversion or the sin-
is called the SVD method. Note that the order of the matrix W is
gular value decomposition (SVD) in Sec. 3.2. We describe the fluc-
closely related to the singularity of W. If we take the order of W to
tuation conditions that can be successfully reconstructed by using
be sufficiently large to include most of the modes that are coupled
the IHE method in Sec. 3.5. In Sec. 3.6, we discuss the accuracy
to ai , the inverse of W, if it exists, gives an accurate solution of ai .
for each and m mode.
However, the matrix inversion is hampered by the ill-posed nature
of the inversion due to singularities. Conversely, if we truncate the
order of W at a sufficiently small scale max , the matrix is invertible;
3.1 Simulations however, the inversion matrix gives a biased solution of ai because
To assess the reconstruction accuracy, we generated 103 random the possible couplings from higher modes are discarded by the max
realisations of an isotropic Gaussian density field in the sky. We truncation. Here we define W as an imax imax matrix. As the first
used the code synfast, which is publicly available as a package step of the SVD method, we decompose the matrix W into three
in HEALPIX to generate random Gaussian maps. First, we used matrices:
the HarrisonZeldovich spectrum as the input power spectrum. W = U V, (33)
It gives an angular power spectrum C ns , where ns = 2 on
large angular scales, in the Einstein deSitter universe, which cor- where V and U are imax imax unitary matrices and the superscript
responds to the SachsWolfe plateau of the CMB power spectrum. denotes the Hermitian conjugate. is the diagonal matrix, which
We set the monopole power to zero because it is a uniform value consists of the eigenvalues of W, called the singular values. The


c 0000 RAS, MNRAS 000, 000000
6 Nishizawa and Inoue

(a) C 0 The choice of the threshold k is not trivial and should be


carefully determined a priori because the mapping FSVD de-
pends on various factors including the threshold k , i.e. FSVD =
101 101
FSVD (max , ns , W, k ). In this context, a singularity can only be de-
fined in terms of k . More specifically, the mapping FSVD is con-
2

taminated by the eigenvalues that are close to being singular if


D

100 100 D2 /k < 0. In such cases, the threshold of the eigenvalues should
be increased to enable more accurate reconstruction. The brute
force inversion corresponds to Finv = FSVD (max , ns , W, k = 0).
10-1 10-1 The optimal choice of max should depend on ns , W and even on
0.0 0.2 0.4 0.6 0.8 1.0 100 101 102
k N I
rec,max . Here we simply set max = rec,max .
In the left panels of Fig. 3, D2 is depicted as a function of
(b) C 2
the minimum nonzero eigenvalue k . The solid lines represent the
max
=3
different maximum multipoles to be reconstructed, max = 3, 5 and
max
=5

max
= 10 10. The three different rows show the different input power spec-
10 0
10 0 W 1

tra, which will be discussed in detail in Sec. 3.5. Let us focus on


b
sin( )

the middle left panel. For max = 3 and 5, D2 increases monoton-


2
D

ically as the threshold increases. Thus, FSVD is not contaminated


by singular eigenvalues because D2 /k > 0. On the other hand,
10-1 10-1 Conversely, for max = 10, D2 has a minimum near k 0.4. There-
fore, eigenvalues smaller than 0.4 may be the contaminants of the
0.0 0.2 0.4 0.6 0.8 1.0 100 101 102 mapping F, and we can better estimate the original density fluctua-
k N I tions when we limit the eigenvalues to < 0.4. In the right panel of
Fig. 3, we show D2 for the IHE method as a function of the number
(c) C 4 of iterations. The D2 has a minimum near NI < 10 that depends
0
10 100 on the maximum multipole. After a sufficient number of iterations,
D2 converges to the bruteforce inversion results, which are shown
10-1 10-1
as the horizontal dashed lines in Fig. 3, i.e. FIHE (NI ) = Finv .
The IHE method with a certain finite number of iterations is there-
2
D

fore always more accurate than the SVD method. However, in prac-
10-2 10-2 tice, we should know the optimal number of iterations a priori. This
number depends on the mask size, the maximum multipole to be re-
10-3 10-3
constructed and the underlying spectrum of the density fluctuation.
0.0 0.2 0.4 0.6 0.8 1.0 100 101 102 We can estimate the optimal iteration number using MonteCarlo
k N I simulations. In Sec. 3.5 we will see how the result changes for dif-
ferent types of underlying power spectra.
Figure 3. (Right) D2 accuracy versus iteration number. Panels (a), (b) and Note that the computation time for estimating a(NI ) can be sig-
(c) show the different underlying density fluctuations that have powerlaw
nificantly reduced if we use equation (6) instead of equation (8).
spectra of C 0 , 2 and 4 , respectively. We chose max = rec,max . For
The reason is as follows. Because the rank of the matrix Ri j is on
an ns = 0 spectrum, IHE, SVD, and bruteforce inversion worsen recon- 2 4
struction results. However, for ns = 2, there exists an optimal number of the order of max , it costs Npix max according to equation (24), and
6
iterations near NI = 10 that minimizes D2 and depends on max . For ns = 4 the matrix algebra of equation (8) costs NI max computations.
2
spectrum, D2 converges to a few percent accuracy after sufficient number If one uses (equation (6), it would be on the order of NI Npix max ,
of iterations.In all cases, after a sufficient iterations, D2 converges to value where NI is the number of iterations, Npix is the number of pixels
obtained by direct inversion. (Left) SVD method results shown for compar- that tile the sky and max is the maximum multipole to be recon-
ison versus eigenvalue threshold k . We also show the sin(b) = 0.34 curves structed. For example, for given max = 10, Npix = 12 10242
where masked region is filled with zeros. For ns = 0, doing nothing is pixels and five iterations, the computation time will be reduced by
optimal. a factor of 25.

order of the eigenvalues in is arbitrary; however, they are ar-


ranged in a descending order so that the decomposition is deter-
mined uniquely. Let the kth eigenvalue be k , and the eigenvalues
smaller than k be zero. Then, the pseudoinversion of W can be
3.3 Statistical properties
written in terms of + , the rankk diagonal matrix that consists of
the reciprocal of the nonzero eigenvalues that are equal to or larger The statistical isotropy of the CMB map (e.g. Planck Collaboration
than k : 2015c) and its Gaussianity (e.g. Planck Collaboration 2015d) has
previously been examined. When the map reconstruction method
W + = V + U , (34) is applied to the CMB map, the statistical properties of the CMB
temperature fluctuations should remain unchanged. Using our fidu-
which gives
cial simulation set (ns = 2, rec,max = 5 and b = 20 ), which is
X
aest a j Wi+j . described in Sec. 3.1, we discuss how the IHE method affects the
i = (35)
j underlying statistical properties via the reconstruction.


c 0000 RAS, MNRAS 000, 000000
Reconstruction of Missing Data using IHE 7
3.3.1 Statistical isotropy with six iterations, where an ensemble average was taken over 1000
random realizations. Thus, the PDFs of the reconstructed fluctua-
First, the statistical isotropy of a fluctuation can be measured from
tions and the original fluctuations were consistent within the 2
the ratio between the off-diagonal and corresponding diagonal
level.
terms in the correlation matrix of the expansion coefficients (Inoue
However, the significance decreased once we took into ac-
2000):
count the spatial correlation of the fluctuations on the pixels. Such

m
ham a m i cases have been observed in cosmological and astronomical signals
fm p , (36)
h|am |2 ih|a m |2 i (Olea & Pawlowsky-Glahn 2008). The covariance of the fluctua-
tions in the pixels in the sky is given by
where the ensemble average is taken over 1000 realizations. For
statistically isotropic fluctuations, we expect that the off-diagonal hx( ni )x( n j )i = C(i j ), (40)
terms satisfy f 1. To visualize the matrix, we contract the four
where ni points to the sky position of the i-th pixel, and cos i j =
dimensional subscript into twodimensional ones as before. Figure
m ni n j . We constructed the covariance matrix directly from our 1000
4 illustrates the distribution of fm . The right panel shows the val-

m realizations. If the PDF of the fluctuations is Gaussian or, more
ues for each element of fm calculated from the input map, which
generally, is fully described in the quadratic form of the data, we
contains only the largest modes, < 6. Because the matrix com-
m can diagonalize the covariance matrix as
ponent is symmetric, we present the fm elements for the original
fluctuation in the upper left triangle matrix and those for the IHE re- M = OT CO, (41)
constructed method in the lower right triangle matrix. The original
map clearly satisfies statistical isotropy. The left panel shows a his- where M = diag[1 , 2 , , n ], s are eigenvalues rearranged in
togram of f with the inset values being the mean and the standard descending order and the matrix O consists of n eigenvectors. If the
deviation. There are a few modes with large reconstructed fluctua- fluctuations are highly spatially correlated, the rank of the matrix
tions; however, f is statistically consistent with zero within 1. M is always less than n; therefore, for i > rM , i = 0, where rM is
the rank of M. Then the de-correlated data is shrunk such that
1  T 
3.3.2 Probability distribution xi xi = O x , for i 6 rM . (42)
i i

Next, we describe how the probability distribution function (PDF) With this de-correlated dataset, we found that a KS test yielded
of the fluctuations is modified by a reconstruction using the hi = 0.64 for the IHE reconstructions, still fully consistent with
Kolmogorov-Smirnov (KS) test. The KS test enables us to discrim- the original PDF.
inate the difference between the two underlying PDFs in a non- Finally, we applied a single-sample KS test to determine
parametric manner, or the difference between the PDFs taken from whether the PDF of the data was Gaussian. Using the rescaled and
a single sample of an assumed function. de-correlated data for this test, we found that hi = 0.57 and 0.58
First, we conducted two sample KS tests using a fiducial set for the original and IHE reconstructed fluctuations, respectively.
of 1000 random Gaussian simulations. Let xi be the pixel value of Therefore, we concluded that the original map was fairly consis-
a certain realization of an original simulated map at the position of tent with a Gaussian distribution and that the IHE reconstruction
the i-th pixel and yi be the corresponding reconstructed pixel value. did not change the underlying statistical properties significantly.
Then, we define the cumulative distribution functions (CDFs) for
these two data as
n
1X 3.4 Power spectrum reconstruction accuracy
Fn (x) = U(xi x),
n i
In Sec. 3.3.2, we demonstrated that the IHE reconstruction does
m
1 X not significantly change the statistical properties of the map. In this
Fm (x) = U(yi x), (37) section, we will show how a two-point statistic, the power spec-
m i
trum, is affected or recovered using the IHE method. Again, we use
where U(x) is the unit step function and n = m = Npix . In practice, our fiducial simulation set to access the reconstruction. The power
we divided the sky into Npix pixels of the Healpix with a resolution spectrum of the -th realization map can be estimated to be
of Nside = 32. The KS statistic is the supremum of the difference
1 X 2
between the two measured CDFs: C = a , (43)
2 + 1 m m
Dnm = sup |Fn (x) Fm (x)| . (38)
x where the true power spectrum can be estimated from the arith-
metic mean over 1000 realizations, C = hC i = C /1000 with
P
Based on the hypothesis that the reconstructed PDF is consistent
a variance of 2 = (C C )2 /999. To quantify the discrepancy
P
with the original one,
r the probability that the statistic has a value
nm between the power spectra of the reconstructed and original fluctu-
larger than nm = Dnm is ations, we use the relative difference summed over the multipoles,
n+m
X C rec !

C = org 1 , (44)
X
(nm < ) = 1 (1) j exp[2 j2 2 ]. (39) C

j=

To detect the difference in the shape of the PDF, we consider the de- and the Maharanobis distance,
generacy due to different statistical measures. To do so, we rescaled rec,max
X (Crec Corg )2
the data as y = (ym)/, where m is the mean and is the standard D2M = , (45)
deviation. We found that hi = 0.104 for the IHE reconstructions
2


c 0000 RAS, MNRAS 000, 000000
8 Nishizawa and Inoue

90 20 Original 1.00
80 original : 0.003 0.022
0.75
70 IHE : 0.001 0.023 15
60 0.50

index i
50
10 0.25
40
30 0.00
20 5
0.25
10 IHE
0 0 0.50
0.4 0.2 0.0 0.2 0.4 0.6 0.8 1.0 0 5 10 15 20
lm
index i
flm

Figure 4. (Left) Distribution of f over different modes. Dashed vertical lines show the 1 regions. Inset values are the mean and the standard deviation.
(Lower Right) Components of f in original map (upper left triangle) and the IHE reconstruction with Nite = 6 (lower right triangle).

3.5 Given the typical scale of the mask M , it is impossible to re-


3.0 input construct a fluctuation whose scale is smaller than max M >
IHE 180/M . Due to the mode coupling, fluctuations with angular scales
2.5 corresponding to M are strongly affected by fluctuations with
pseudo
2.0 smaller angular sizes. If the spectral index is negative, the ampli-
input

tude of a smaller scale fluctuation is weak and does not strongly


C /C

1.5 disturb the large-scale fluctuations. Therefore, the deconvolution


1.0 mapping F is less affected by singularities. However, if the spec-
trum is flat or has a positive slope, the large-scale modes are highly
0.5
contaminated by the small-scale fluctuations.
0.0 In Fig. 3, the top and bottom panels show the D2 accuracies for
1 2 3 4 5
multipole ns = 0 and ns = 4. For the ns = 0 case, the SVD has a minimum
at k 0.5 that is larger than when ns = 2. Thus, FSVD (ns = 0) is
more affected by singularities than FSVD (ns = 2). The IHE results
Figure 5. Power spectrum reconstruction accuracy for fiducial simulation
sets. Shown are reconstructed power spectra C normalized by spectrum in the right panels shows that NI = 1 gives the highest accuracy
from original map. Error bars are 1 regions computed from 1000 realiza- and that the reconstructed accuracies gradually degrade to the value
tions, and shaded regions are 1 and 2 regions for original spectrum. given by the bruteforce method. Conversely, FSVD (ns = 4) is not
affected by singularities because it always shows D2 /k < 0 for
the SVD method.
where Crec is estimated from either a pseudo-fluctuation or the IHE
reconstructed fluctuations. Fig. 5 shows the averaged power spec-
3.6 Reconstructed accuracies for each and m mode
trum of the fiducial simulations normalized by input spectrum. We
obtained C = 0.29 and 0.022 for the pseudo- and IHE recon- It is important to pay attention to the dependence of the recon-
structions and found that for 6 5, D2M = 0.88 and 0.48, respec- struction accuracy on the multipoles and m. If there are particular
tively. Therefore, the IHE method provides a less biased estimate modes that do not suffer from the masking effect, we can use them
of the power spectrum C . On large scales, the suppression of the to perform robust cosmological analysis.
power due to the masking is mitigated by the IHE reconstruction, Fig. 6 shows a scatter plot of the original and reconstructed
while on smaller scales, rec,max and the reconstruction slightly am s for an azimuthally symmetric 20 mask. The solid line rep-
overestimates the power spectrum. resents a relation in which the reconstructed am is identical to the
original one. The dashed red and blue lines in each panel are the
best linear fits for the pseudoam s and the IHE method, respec-
3.5 Dependence on underlying power spectrum tively. The statistical accuracy is also shown in each panel: ,
and r are the average differences between the original and recon-
The reconstruction accuracy depends on the underlying power
structed am s, the standard deviation, and the best fitted slope of the
spectrum C ns of the fluctuation because the harmonic modes
linear fit, respectively. We note the following three characteristics:
are not independent in the masked incomplete sky even if the under-
lying fluctuation is Gaussian. We consider three power law indices, For the odd modes ( + m = 2n + 1, where n is an integer), the
ns = 0, 2 and 4, which correspond to the following three cases. masking effect is sufficiently small to enable accurate map recon-
The projected two-dimensional galaxy or dark matter distribution is struction. In that case, the pseudoam s are already accurate and
g
approximated as C 0 (e.g. Frith et al. 2005) on large scales, and the IHE method slightly improves the accuracy.
the ordinary SachsWolfe spectrum is approximated as CSW 2 For the even modes ( + m = 2n, where n is an integer), the
(Sachs & Wolfe 1967). On very largeangular scales, the integrated masking systematically suppresses the amplitude of fluctuations.
SachsWolfe effect, which gives CISW 4 (e.g. Cooray 2002) For a given mode, the suppression is more significant for larger m
dominates the CMB in the standard CDM scenario. modes.


c 0000 RAS, MNRAS 000, 000000
Reconstruction of Missing Data using IHE 9

= 0.000
IHE(NI =10)
1.0

=0.181
r =0.909 Pseudoam
0.5
0
=

0.0
= 0.006
0.5
=0.376
1.0 r =0.660
1.0 = 0.000 = 0.001
=0.028 =0.159
0.5
r =1.000 r =0.783
1
=

0.0
=0.001 = 0.000
0.5
=0.044 =0.258
1.0 r =0.959 r =0.512
1.0 =0.000 = 0.001 =0.003
=0.184 =0.030 =0.145
0.5
r =0.910 r =1.001 r =0.841
2
=

0.0
=0.005 = 0.001 =0.003
0.5
=0.372 =0.041 =0.182
1.0
r =0.673 r =0.909 r =0.394
1.0 =0.000 =0.001 =0.000 = 0.006
=0.057 =0.127 =0.037 =0.139
0.5
r =0.994 r =0.846 r =0.999 r =0.752
3
=

0.0
= 0.002 =0.000 =0.001 = 0.001
0.5
=0.090 =0.196 =0.046 =0.150
1.0 r =0.824 r =0.709 r =0.850 r =0.329
1.0 = 0.000 =0.002 = 0.002 =0.001 = 0.001
=0.149 =0.047 =0.101 =0.044 =0.151
0.5
r =0.954 r =0.994 r =0.947 r =1.004 r =0.964
4
=

0.0
= 0.004 =0.001 = 0.003 =0.001 = 0.006
0.5
=0.275 =0.063 =0.119 =0.049 =0.119
1.0
r =0.798 r =0.781 r =0.709 r =0.788 r =0.275
1.0 = 0.000 = 0.001 = 0.000 =0.004 =0.002 =0.007
=0.073 =0.095 =0.051 =0.089 =0.047 =0.133
0.5
r =1.001 r =0.966 r =0.993 r =0.939 r =1.000 r =0.955
5
=

0.0
=0.002 = 0.000 = 0.001 =0.001 =0.002 =0.003
0.5
=0.112 =0.128 =0.062 =0.088 =0.049 =0.103
1.0
r =0.751 r =0.890 r =0.737 r =0.751 r =0.725 r =0.236
1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0

m = 0 m = 1 m = 2 m = 3 m = 4 m = 5
Figure 6. Reconstruction accuracy for individual (top to bottom) and m modes (left to right). We assume that |b| < 20 region is masked and that the
underlying density fluctuation obeys isotropic Gaussian statistics with the Zeldovich spectrum (ns = 2). In each panel, horizontal axis shows input am s and
vertical axis represents reconstructed am s. Each point describes one realization. Red and blue points are the pseudoam s and am s reconstructed with NI = 10
using the IHE method, respectively. For illustrative purposes, we show only 100 samples which were randomly picked from 1000 realizations, and whose am
magnitudes range from 1 to 1.

For a given mode, the masking effect is the most significant Therefore, the above dependency of the reconstruction accuracy is
for the = m mode and the effect is more significant for higher simply correlated with the choice of the basis function relative to
modes. the mask geometry. The circle symbols in Fig. 7 show the best fit
of the slope of the pseudoam s obtained in Fig. 6 as a function
These dependencies are closely related to the value of the di-
of Qm for the even (toppanel) and odd (bottompanel) modes,
agonal part of the mask matrix, i.e.
Z respectively. We can see a clear correlation between the slopes and
Qm s. For the odd modes, the Qm s are larger, while they are smaller
Qm Wmm = d cos d |Ym (, )|2 W(, b), (46)
for the even modes. In each panel, the different modes are distin-
guished by the color levels. It is evident that the lower modes tend
which quantifies how important the fluctuation outside the masked
to have a larger values of Qm and, therefore, larger slopes r. The
region is and takes values between 0 and 1. In the limit where the
blue squares in Fig. 7 are the same as before but for the am recon-
area of the masked region approaches zero, the mask matrix be-
structed using the IHE method. Note that the IHE reconstruction
comes an identity matrix; therefore, Qm 1, which means that
works pretty well for the odd modes, as the slope r 1 implies.
100% of the fluctuation distribution is outside the mask. For the odd
Conversely, the reconstruction for the even modes is less accurate
modes, because Ym is small near the equator, i.e. within the masked
than that for the odd modes.
region, the fluctuation in the masked region is not significantly dif-
ferent from zero. In this case, Qm is close to unity. Conversely, for Even though Qm can explain the strength of suppression of
the even modes, Ym takes relatively larger values near the equator; the am s, it does not have a perfect correlation with the slope r.
therefore, the fluctuation inside the mask becomes important com- Consequently, we need to consider other factors as well. As we
pared to that outside the mask. In this case, Qm is close to zero. mentioned above, Qm represents the amount of fluctuations leaking


c 0000 RAS, MNRAS 000, 000000
10 Nishizawa and Inoue
1.2 5 b = 1 b = 20
1.0 4
1.0
W
11 1

0.8
0.8
W
33 3
3 W
slope r

0.6
55 5

Wm m
0.6 0.4
2
0.4 0.2

1 0.0
0.2
even mode
0
0.2

0.0
1.2 5 1.0
W
53 3
1.0
0.8
W
54 4
4 0.6
W
55 5

Wm m
0.8
slope r

0.4

0.6 3 0.2

0.4 2
0.0

pseudoam 0.2
0.2
odd mode IHE(NI =10)
1
0 5 10 15 20 25 0 5 10 15 20 25

0.0
0.2 0.3 0.4 0.5 0.6 0.7 0.8

Qm
Figure 8. Mask matrix, W for (left) b = 1 and (right) b = 20 . Top and
bottom panels correspond to m = m = modes and m = m , = 5 modes,
Figure 7. Correlation between Qm defined in equation (46) and the mask-
respectively. Here, we show only nonzero components. For b = 1 case,
ing effect probed by the fitted slope r for pseudoam s (circle) and IHE
matrix is close to identity matrix, while for b = 20 case, diagonal parts
reconstructed am s (square). Colors of circles represent corresponding
leak into the offdiagonal parts, which induces mode coupling between dif-
modes shown in color bars at right. Error bars are 1 regions derived from
ferent , m modes in am s.
1000 random simulations. Pseudoam s are linearly correlated with Qm s
with some errors. Top and bottom panels are for even ( + m = 2n) and odd
( + m = 2n + 1) modes, respectively. Planck DR2 2 (Planck Collaboration 2015a), and neglected the
point source masks because the masks for each point source were
too small to affect the reconstruction of the large mode fluctua-
outside the masked region. In other words, Qm represents a fraction tions, < 10. However, note that the fluctuations inside the point
of the diagonal components in the mode coupling matrix Wm m , source mask could be well reconstructed by limiting the reconstruc-
tion area in the vicinity of the point source mask region instead of
P
where Wm m = 1 for a given and m mode. Note that for
azimuthally symmetric masks, the Wm m s have nonzero values the entire sky. In that case, we could redefine a local orthogonal
only for the m = m and = + 2n modes, where n = 0, 1, 2, . system, i.e. the harmonics in the finite two-dimensional flat space.
Fig. 8 shows the nonzero components of the mask matrix, Wm m . However, such analysis was beyond the scope of this study and
The top and bottom panels show the = m modes for different will be investigated in the future. Figure 9 illustrates the Galac-
and the different m modes for = 5, respectively. In the top panel, it tic masks we used. The colors (from black to white) show differ-
is evident that W at = m, which is Qm , decreases monotonically ent masking schemes in which the areas of the unmasked regions
with . It is also apparent that for the b = 1 case, the matrix are 60%, 70%, 80%, 90% and 97 % of the entire sky. We generated
is almost diagonal, while for the b = 20 case, there is a long 1000 random Gaussian simulations with an input spectrum from
tail towards higher modes. This tail can induce mode coupling the latest Planck CDM cosmological model Planck Collaboration
between different modes, and the strength of the coupling depends (2015b).
on each realisation of the map. The scattering of points centered at Figure 10 presents the reconstruction accuracy for the simu-
the dashed lines in Fig. 7 may be due to this effect. lated CMB maps masked by the Galactic plane. For all the rec,max ,
the IHE reconstruction with a finite number of iterations gives bet-
ter reconstructions than the direct inversion. By comparing the re-
sults with those shown in Figure 3, remembering that 65% of the
sky of a b = 20 mask is unmasked, it can be seen that the ns = 2
4 APPLICATION TO CMB SKY results are consistent with the case in which the Galactic plane is
4.1 Planck Galactic mask and CMB power spectrum masked. As mentioned, the detailed shape of the mask does not sig-
nificantly affect the reconstruction using the IHE method if the area
We again consider an isotropic Gaussian prior; however, now, we of the masked sky is similar. The optimal number of iterations de-
discuss the application of the IHE method to more realistic cases. pends on the size of the mask and the reconstruction scale rec,max ;
First, we describe a reconstruction of the CMB map, which is therefore, the number of iterations should be determined before re-
masked by the Galactic plane. In the concordant model, on super- constructing the CMB map.
horizon scales, the spectral index of the CMB power spectrum is
approximately given by ns = 4 because the ISW effect is dom-
inant. On horizon scales, the index increases to ns = 2, which 4.2 Non-Gaussian Prior
corresponds to the Harrison-Zeldovich spectrum because the or- In the past sections, we demonstrated that the IHE reconstruction
dinary Sachs-Wolfe effect is dominant. On subhorizon scales, the method could be applied to Gaussian isotropic fluctuations. In prac-
index increases to ns 1 if the scale is smaller than the sound
horizon at the last scattering time.
In this study, we used the Galactic masks provided by the 2 http://irsa.ipac.caltech.edu/data/Planck/release 2/ancillary-data/


c 0000 RAS, MNRAS 000, 000000
Reconstruction of Missing Data using IHE 11

Figure 10. Reconstruction accuracy for simulated CMB maps masked by Galactic plane. Colors from top to bottom indicate different masks shown in inset.
For each color, solid line marks central value and shaded region enclosed by thin lines indicates 1 region for the 1000 random realizations.

rec,max =3 rec,max =5 rec,max =10


1
10

D2
0
10

-1
10
0 1 2 0 1 2 0 1 2
10 10 10 10 10 10 10 10 10

NI
rec,max =3 rec,max =5 rec,max =10
0
10
D2

-1
10

Figure 9. Planck Galactic plane masks without apodization: from black to 10


-2
0 1 2 0 1 2 0 1 2
10 10 10 10 10 10 10 10 10
white, GAL60, 70, 80, 90 and 97 respectively.
NI

10
0
rec,max =3 rec,max =5 rec,max =10
tice, the map may contain non-Gaussian features. In this section, -1
10
we consider two types of non-Gaussian priors. First, we consider fNL =0
D2

a type of isotropic non-Gaussian fluctuation induced by primordial 10


-2 fNL =5000
non-Gaussianity in the density perturbation. The local type non- W1
-3
10
0 1 2 0 1 2 0 1 2
Gaussianity on large-scale CMB fluctuations can be written as, 10 10 10 10 10 10 10 10 10


NI
! !2 * !2 +
T T T

T

= 3 fNL , (47)

T T G T G
T G

Figure 11. Comparison of the map reconstruction accuracy between Gaus-
sian (red solid) and non-Gaussian (blue dashed) fluctuations. From top to
where hT/T iG is a Gaussian fluctuation of the CMB tempera- bottom, the spectra of the Gaussian fluctuations are ns = 0, 2 and 4 re-
ture in the sky. The map is scaled to its root mean square to be spectively. Even for large non-Gaussianity, fNL = 5000, IHE recovers same
reconstruction accuracies for all rec,max and ns .
hT/T iG = 104 so that readers can compare the value of the fNL
to the one introduced in the CMB analysis. We show the recon-
struction accuracies for fNL = 0 and 5000 for comparison. Given
that the recent CMB observation by Planck suggest that the value of ation, we first generated a Gaussian isotropic map and then added
fNL is consistent with zero (Planck Collaboration 2015d), the value a circular structure that had a Gaussian radial profile:
of fNL = 5000 assumed here might be too large. However, even in T T
! "
( n n0 )2
#
such an extreme case of non-Gaussianity, we found that the IHE ( n) = ( n) + A exp , (48)
T T G 2r2
reconstruction accuracy was not affected much. Here, we fixed the
azimuthal mask size to b = 20 as before. where A , r and n0 = (0 , 0 ) are free parameters that specify the
Figure 11 shows the map reconstruction accuracies as a func- amplitude, size and the center position of a non-Gaussian structure.
tion of the number of iterations for IHE. The results are compared A is the amplitude of a circular symmetric non-Gaussian structure
with the isotropic Gaussian prior case. It is evident that even if with a Gaussian profile with a radius r .
the non-Gaussianity is quite large, such as fNL = 5000, the re- It is more convenient to rewrite the parameter as A = A ( +
construction accuracy does not significantly change compared to 1)C /2, where = /r . We chose to use A = 1 and 10 and
the isotropic Gaussian prior case. This result implies that the IHE r = 10 and 40 in this study. Note that the azimuthal position of
method is robust against the probability distribution of the underly- the structure 0 does not affect the results because the background
ing fluctuation as long as the statistical isotropy holds. Gaussian field is statistically isotropic and the mask we consid-
Second, we consider an anisotropic non-Gaussian prior. For ered is independent of the azimuthal position. Therefore, we can
instance, if there is a large non-Gaussian structure in the universe, always set 0 = 0 without losing generality. 0 was chosen such
it can affect the CMB sky via the ISW effect. To simulate this situ- that the center of structure would correspond to the center of the


c 0000 RAS, MNRAS 000, 000000
12 Nishizawa and Inoue

10
1
0 =0 0 =70 0 =90 10
0
0 =0 0 =70 0 =90
-1
rec,max =3

rec,max =3
0 10
10
-2
10
-1
10 -3
10

-2 -4
10 10
1 0
10 10

-1
rec,max =5

rec,max =5
0 10
10
-2
10
-1
10 -3
10

-2 -4
10 10
1 0
10 10

A
rec,max =10

rec,max =10
-1
0 10
10
-2 B
-1
10
C
10
10
-3 D
G
-2 -4
10 10
0 1 2 0 1 2 0 1 2 0 1 2 0 1 2 0 1 2
10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10

NI NI NI NI NI NI

Figure 12. Reconstruction accuracy of anisotropic non-Gaussian maps for Figure 14. Same as Fig. 12 but for ns = 4 background spectrum. Red solid
ns = 0. From left to right, center of non-Gaussian structure is located at and dashed lines (denoted as A and B in the inset) show cases where r = 10
0 = 0 , 70 and 90 . From top to bottom, reconstruction multipoles are and A = 1 and 10, respectively. Blue solid and dashed lines (denoted as C
rec,max = 3, 5 and 10, respectively. Different lines indicate different sizes and D) show cases where r = 40 and A = 1 and 10, respectively. Black
and amplitudes structure shown in inset of equation (14). Gaussian cases are dashed line is case with only Gaussian fluctuation.
denoted with black dot-dashed lines together with 1 error regions from
1000 random simulations.
5 SUMMARY

10
0
0 =0 0 =70 0 =90 We investigated the map reconstruction accuracy with the IHE for
isotropic Gaussian fluctuations and isotropic and anisotropic non-
rec,max =3

Gaussian fluctuations as well as the realistic CMB fluctuation when


the sky was masked near the Galactic plane. Reconstructing the
10
-1
missing data in a masked region is known as an inverse problem.
We found that the IHE method is equivalent to bruteforce inver-
0
10
sion in the limit that the number of iterations approaches infinity.
However, in particular cases, finite truncation of the iterations re-
rec,max =5

sults in a better estimate of the underlying fluctuation. The recon-


10
-1
struction accuracy depends on the size of the mask b, the maximum
multipole mode to be reconstructed irec,max and the spectral index of
0
10 the underlying fluctuation ns The IHE method is equivalent to the
rec,max =10

asymptotic expansion in terms of the mask size b and it converges


to the correct values in the limit of b 0.
10
-1
As an example, we applied the IHE method to reconstruct
0 1 2 0 1 2 0 1 2
the data obscured by azimuthally symmetric masks. We consid-
10 10 10 10 10 10 10 10 10

NI NI NI ered three types of Gaussian fluctuations with powerlaw indices


of ns = 0, 2 and 4, which correspond to the matter or galaxy
Figure 13. Same as Fig. 12 but for ns = 2 background spectrum. power spectrum, the ordinary SachsWolfe spectrum and the inte-
grated SachsWolfe spectrum, respectively, in the context of cos-
mological analyses. For the ns = 2 case, we found that there exists
an optimal finite number of iterations that makes the reconstruction
mask (0 = 90[deg]), the edge of the mask (0 = 70[deg]), and more accurate than the SVD method or the bruteforce matrix in-
completely outside the mask (0 = 0). version method. For the ns = 0 case, the pseudoam is the best
Figures 12-14 show the map reconstruction accuracies as a estimator of the projected density fluctuations. For the ns = 4
function of the number of iterations for background fluctuations case, the bruteforce inversion method yields the highest accuracy.
with power-law indices ns = 0, 2 and 4. For the ns = 2 and 4 In that case, the IHE method can help reduce the computation time
backgrounds, the effect of the non-Gaussian structure is very small, for inversion.
while for ns = 0 background, the effect is prominent. We obtained We also found that for azimuthally symmetric masks, the am-
higher reconstruction accuracies for larger structures. Therefore, plitudes of the reconstructed fluctuations for the even ( + m = 2n)
adding a structure with r increases the amplitude of the power modes are significantly suppressed in comparison to the odd modes
spectrum at /r , e.g. 3 for r = 40 . For the ns = 0 spec- ( + m = 2n + 1). For a fixed mode, the m = mode is more
trum, adding power to large scales may greatly change the effective affected by the masking than by other m , modes, and the sup-
slope of the spectrum so that the spectrum becomes redder, which pression is more prominent for higher modes. Therefore, the IHE
acts to mitigate the modemode coupling. However, further studies method reproduces odd modes more accurately. The suppression
are necessary to examine the reason for this phenomenon. due to masking can be explained by the deviation of Qm from


c 0000 RAS, MNRAS 000, 000000
Reconstruction of Missing Data using IHE 13
unity; however, the strength of the mode coupling that changes at Emir Gumrukcuoglu A., Contaldi C. R., Peloso M., 2007, JCAP,
each realisation may also affect the suppression in a complex man- 11, 5
ner. Eriksen H. K., Banday A. J., Gorski K. M., Hansen F. K., Lilje
We demonstrated that the IHE method can be applied to recon- P. B., 2007, ApJL, 660, L81
struct realistic CMB observations. For large-scale modes, < 10, Fialkov A., Itzhaki N., Kovetz E. D., 2010, JCAP, 2, 4
the IHE method provides more accurate reconstructed maps than Francis C. L., Peacock J. A., 2010, MNRAS, 406, 14
direct inversion does, and the optimal number of iterations should Frith W. J., Outram P. J., Shanks T., 2005, MNRAS, 364, 593
be determined before reconstructing the CMB. For some special Gorski K. M., Hivon E., Banday A. J., Wandelt B. D., Hansen
cases, we investigated the IHE reconstruction accuracy for both F. K., Reinecke M., Bartelmann M., 2005, ApJ, 622, 759
isotropic and anisotropic non-Gaussian fluctuations. For isotropic Hajian A., Souradeep T., Cornish N., 2005, ApJL, 618, L63
non-Gaussian fluctuations, which are characterized by fNL , the re- Hamilton J.-C., 2003, ArXiv e-prints (astro-ph/0310787)
construction is not substantially affected by non-Gaussianity, which Hansen F. K., Banday A. J., Gorski K. M., 2004, MNRAS, 354,
only changes the amplitude of the power spectrum but does not 641
affect its tilt. As an example of anisotropic non-Gaussianity, we Hansen M., Kim J., Frejsel A. M., Ramazanov S., Naselsky P.,
added a single structure with a Gaussian radial profile onto an Zhao W., Burigana C., 2012, JCAP, 10, 59
isotropic background Gaussian fluctuation. For the ns = 0 spec- Hanson D., Lewis A., Challinor A., 2010, Phys. Rev. D, 81,
trum, adding such a non-Gaussian structure dramatically improves 103003
the reconstruction accuracy compared to the isotropic Gaussian Hivon E., Gorski K. M., Netterfield C. B., Crill B. P., Prunet S.,
case, while for the ns = 2 and 4 spectra, the effect of non- Hansen F., 2002, ApJ, 567, 2
Gaussianity is negligible. Inoue K. T., 2000, Phys. Rev. D, 62, 103001
It would be interesting to investigate how the significance Inoue K. T., 2012, MNRAS, 421, 2731
of the largeangle CMB anomaly changes when we use different Inoue K. T., Cabella P., Komatsu E., 2008, Phys. Rev. D, 77,
methods of map reconstruction. We will explore this problem in 123539
our future work. Inoue K. T., Silk J., 2006, ApJ, 648, 23
Inoue K. T., Silk J., 2007, ApJ, 664, 650
Kim J., Naselsky P., Mandolesi N., 2012, ApJL, 750, L9
Land K., Magueijo J., 2006, MNRAS, 367, 1714
ACKNOWLEDGMENTS
Liu H., Frejsel A. M., Naselsky P., 2013, JCAP, 7, 032
We thank Masahiro Takada, Eiichiro Komatsu, Issha Kayo and Moffat J. W., 2005, JCAP, 10, 12
Takahiro Nishimichi for the useful discussions. AN was supported Montesern C., Barreiro R. B., Vielva P., Martnez-Gonzalez E.,
in part by the FIRST program Subaru Measurements of Images Hobson M. P., Lasenby A. N., 2008, MNRAS, 387, 209
and Redshifts (SuMIRe), CSTP, Japan. This work was also sup- Olea R. A., Pawlowsky-Glahn V., 2008, Stochastic Environmental
ported in part by MEXT KAKENHI Grant Number 16H01096. Research and Risk Assessment, 23, 749
Peiris H. V., Smith T. L., 2010, Phys. Rev. D, 81, 123517
Planck Collaboration, 2014, Astronomy & Astrophysics, 571,
A23
REFERENCES Planck Collaboration, 2015a, ArXiv e-prints (1507.02704)
Abramo L. R., Sodre, Jr. L., Wuensche C. A., 2006, Phys. Rev. D, Planck Collaboration, 2015b, ArXiv e-prints (1502.01589)
74, 083515 Planck Collaboration, 2015c, ArXiv e-prints (1506.07135)
Abrial P., Moudden Y., Starck J.-L., Fadili J., Delabrouille J., Planck Collaboration, 2015d, ArXiv e-prints (1502.01592)
Nguyen M. K., 2008, Statistical Methodology, 5, 289 Pontzen A., Peiris H. V., 2010, Phys. Rev. D, 81, 103008
Afshordi N., Geshnizjani G., Khoury J., 2009, JCAP, 8, 30 Prunet S., Netterfield C. B., Hivon E., Crill B. P., 2000, ArXiv
Aurich R., Lustig S., Steiner F., 2010, Classical and Quantum e-prints (astro-ph/0006052)
Gravity, 27, 095009 Ralston J. P., Jain P., 2004, International Journal of Modern
Aurich R., Lustig S., Steiner F., Then H., 2007, Classical and Physics D, 13, 1857
Quantum Gravity, 24, 1879 Rassat A., Land K., Lahav O., Abdalla F. B., 2007, MNRAS, 377,
Bennett C. L. et al., 2011, ApJS, 192, 17 1085
Bernui A., Hipolito-Ricaldi W. S., 2008, MNRAS, 389, 1453 Rassat A., Starck J.-L., 2013, Astronomy & Astrophysics, 557,
Bernui A., Villela T., Wuensche C. A., Leonardi R., Ferreira I., L1
2006, Astronomy & Astrophysics, 454, 409 Rassat A., Starck J.-L., Dupe F.-X., 2013, Astronomy & Astro-
Bucher M., Louis T., 2012, MNRAS, 424, 1694 physics, 557, A32
Cooray A., 2002, Phys. Rev. D, 65, 083518 Rassat A., Starck J.-L., Paykari P., Sureau F., Bobin J., 2014,
Copi C. J., Huterer D., Schwarz D. J., Starkman G. D., 2007, JCAP, 8, 6
Phys. Rev. D, 75, 023507 Rodrigues D. C., 2008, Phys. Rev. D, 77, 023534
Cruz M., Martnez-Gonzalez E., Vielva P., Diego J. M., Hobson Sachs R. K., Wolfe A. M., 1967, ApJ, 147, 73
M., Turok N., 2008, MNRAS, 390, 913 Sakai N., Inoue K. T., 2008, Phys. Rev. D, 78, 063510
Cruz M., Vielva P., Martnez-Gonzalez E., Barreiro R. B., 2011, Samal P. K., Saha R., Jain P., Ralston J. P., 2009, MNRAS, 396,
MNRAS, 412, 2383 511
de Oliveira-Costa A., Tegmark M., Zaldarriaga M., Hamilton A., Starck J.-L., Fadili M. J., Rassat A., 2013, Astronomy & Astro-
2004, Phys. Rev. D, 69, 063516 physics, 550, A15
Efstathiou G., 2004, MNRAS, 348, 885 Tomita K., Inoue K. T., 2008, Phys. Rev. D, 77, 103522
Efstathiou G., Ma Y.-Z., Hanson D., 2010, MNRAS, 407, 2530 Zheng H., Bunn E. F., 2010, Phys. Rev. D, 82, 063533


c 0000 RAS, MNRAS 000, 000000

You might also like