Daniel J. Eisenstein
Institute for Advanced Study, Olden Lane, Princeton, NJ 08540
and
Emory F. Bunn
Physics and Astronomy Department, Bates College, Lewiston, ME 04240
A recent Letter [1] claims to have discovered evidence for birefringence in the propagation of radio waves across cosmological distances. Unfortunately, this claim is based on a flawed statistical analysis.
To search for birefringence, the authors look for correlations between
the direction and distance to a galaxy and the angle between the
polarization direction and the galaxy's major axis.
Plotting the data as shown in Fig. 1(d) of their paper, here
reproduced as Fig. 1, they use the correlation coefficient
as their statistic.
To estimate the significance of their result, the authors use mock data
samples constructed by randomly picking the angle from a uniform
distribution of allowed angles.
This is not the proper null hypothesis
for testing the dependence of
on the direction and distance
to the galaxies. Rather, for the null hypothesis, one should draw the
angles from the observed distribution, which in the case of the
high-redshift subsample, from which the primary conclusions were drawn,
clearly is not a uniform distribution. From Fig. 1, one can see by
eye that the polarization in these galaxies tends to align with the
galaxy minor axis, i.e.
prefers
and avoids 0
or
. For example,
of the points have
.
That this matters can be easily seen from the following example.
Consider the region in the x-y coordinate plane spanning -1 to 1 in
both directions. If we uniformly fill the first and third quadrants,
the correlation coefficient will be 0.75. If, however, we fix
while allowing x to span -1 to 1 uniformly as
before, then
. Collapsing the y
direction in this way allows more of the scatter to be explained by
the best-fit line.
Hence, we should expect that the tendency of the angle to
prefer
will cause
to be higher than it would be if
were uniformly distributed between 0 and
. By using
the latter as their null hypothesis, the authors find a spuriously high
statistical significance for their result. Indeed, if the underlying
galaxy population truly had a uniform intrinsic distribution of
, it would be impossible to measure the proposed birefringence
at all; one could not detect a rotation of such a distribution.
Stated another way and estimating by eye, in Fig. 1 the data
are more tightly correlated than they would be if the
values were randomly and uniformly distributed between 0 and
. However, they are not significantly more correlated than they
would be if the
values in a quadrant were shuffled among
themselves while the best-fit line was adjusted accordingly. Hence,
the claimed correlation of the angle
with the position and
distance of the galaxy is not statistically significant.
Taking the null hypothesis that the birefringence does not exist and that
the angles between the polarization directions and galaxies' major axes
are distributed as the data indicate, one is left to explain why the
particular direction in the sky turned out to yield a higher
than other directions. This most likely results from combining the
inhomogeneous sky coverage--the sample is mostly from the Northern sky
and avoids low galactic latitudes--and the propensity of the chosen
statistic
to prefer directions that place many galaxies near
the center of the spread in
where the tendency of
to prefer the center of its range can best reduce the scatter.
It is not surprising that such a direction could exist.
A second error relating to the choice of statistic and null hypothesis
is the authors' use of the slope of the best fit line in Fig. 1
as a measure of the inverse birefringence scale .
Because the null hypothesis (either the one they used or the one proposed
here) produces a non-zero slope in the absence of birefringence, this
is clearly a highly biased and inappropriate estimator.
Figure 1: Figure 1(d) reproduced from [1].
71 galaxies with redshifts above 0.3 are shown.
is the angle from the proposed birefringence direction, r is the
distance to the galaxy, and
is the angle between the galactic
major axis and the polarization direction. See [1] for more details.
Our claim is that the
are not uniformly distributed between 0
and
but rather are clumped toward
.
D.J.E. was supported by NSF PHY-9513835.