The AstroStat Slog » Bayes factor http://hea-www.harvard.edu/AstroStat/slog Weaving together Astronomy+Statistics+Computer Science+Engineering+Intrumentation, far beyond the growing borders Fri, 09 Sep 2011 17:05:33 +0000 en-US hourly 1 http://wordpress.org/?v=3.4 [ArXiv] 4th week, May 2008 http://hea-www.harvard.edu/AstroStat/slog/2008/arxiv-4th-week-may-2008/ http://hea-www.harvard.edu/AstroStat/slog/2008/arxiv-4th-week-may-2008/#comments Sun, 01 Jun 2008 03:59:15 +0000 hlee http://hea-www.harvard.edu/AstroStat/slog/?p=323 Eight astro-ph papers and two statistics paper are listed this week. One statistics paper discusses detecting filaments and the other talks about maximum likelihood estimation of satellite images (clouds).

  • [astro-ph:0805.3532] Balan and Lahav
    ExoFit: Orbital Parameters of Extra-solar Planets from Radial Velocities (MCMC)

  • [astro-ph:0805.3983] R. G. Carlberg et al.
    Clustering of supernova Ia host galaxies (Jackknife method is utilized).

  • [astro-ph:0805.4005] Kurek, Hrycyna, & Szydlowski
    From model dynamics to oscillating dark energy parametrisation (Bayes factor)

  • [astro-ph:0805.4136] C. Genovese et al.
    Inference for the Dark Energy Equation of State Using Type Ia Supernova Data

  • [math.ST:0805.4141] C. Genovese et al.
    On the path density of a gradient field (detecting filaments via kernel density estimation, KDE)

  • [astro-ph:0805.4342] C. Espaillat et al.
    Wavelet Analysis of AGN X-Ray Time Series: A QPO in 3C 273?

  • [astro-ph:0805.4414] Tegmark and Zaldarriaga
    The Fast Fourier Transform Telescope

  • [astro-ph:0805.4417] A. Georgakakis et al.
    A new method for determining the sensitivity of X-ray imaging observations and the X-ray number counts

  • [stat.AP:0805.4598] E. Anderes et al.
    Maximum Likelihood Estimation of Cloud Height from Multi-Angle Satellite Imagery
]]>
http://hea-www.harvard.edu/AstroStat/slog/2008/arxiv-4th-week-may-2008/feed/ 0
[ArXiv] Matching Sources, July 11, 2007 http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-matching-sources/ http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-matching-sources/#comments Fri, 13 Jul 2007 23:24:23 +0000 hlee http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-matching-sources-july-11-2007/ From arxiv/astro-ph: 0707.1611 Probabilistic Cross-Identification of Astronomical Sources by Budavari and Szalay

As multi-wave length studies become more popular, various source matching methodologies have been discussed. One of such methods particularly focused on Bayesian idea was introduced by Budavari and Szalay with a demand for symmetric algorithms in a unified framework.

First, astrometric precision, varying due to instrumental effects and the nature of objects in multi-bands, is explained as well as Bayes factor for testing a hypothesis that multiple observations from various catalogs are from the same source. Then, the formula of calculating the Bayes factor, acquired from the spherical normal distribution is presented. However, the matching process is not a straightforward process of calculating the Bayes factor analytically derived from the spherical normal distribution particularly when the goal is finding a new objects with unknown spectral energy distributions, where physics squeezes in. Before summarizing, practical issues such as fast and efficient computation on multiple catalogs (sequentially adding a catalog to current n catalogs) and recursion formulas for evaluating the weight of evidence are given.

I want to quote a few sentences of my liking from the paper:

  • Often Bayesian analysis is referred to as the calculus of belief; however, it should rather be thought of as the calculus of observational evidence.
  • The Bayesian analysis is inherently recursive. As soon as we obtain new measurements, and compute the posterior probability, that becomes the prior for subsequent studies.

Also one sentence that I have some questions:

… it penalizes complicated hypotheses (with smaller prior probability [hlee: is this a generally true statement? Complicated model occupies larger parameter space by the dimensions although they mentioned that the hypothesis of separate sources occupies a more restricted parameter space. Why complicated model occupies this restricted parameter space?]) over simpler ones.

In model selection, BIC penalizes complicated model more, proportional to the number of parameters (also proportional to log sample size) but this comes from Laplace approximation, not the idea that a complicated model has small prior probability.

In addition, I want to raise a question that under the spherical normal distribution, what is the difference between using Bayes factor and classical hypothesis testing to test multiple observations from different wavelengths are from the same source? To my knowledge, numerous studies on multiple hypothesis testing for biological data (our counterpart could be testing on millions of sources across catalogs) have been available recent years and such frequentist approaches seem to be eligible for cross-matching problem applications.

Finally, I wish to add some references on Cross-Matching/Coincidence Assessment with the VO given by Tom Loredo from one of the SAMSI Surveys and Population Studies working group meetings (Please, note that his talk on the subject matter and other journal papers linked at the SAMSI AstroStat websites are password protected. SAMSI is an acronym of
Statistical and Applied Mathematical Sciences Institute).

]]>
http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-matching-sources/feed/ 1