The AstroStat Slog

Archive for the ‘arXiv’ Category.

On the history and use of some standard statistical models

Jun 26th, 2008| 08:03 pm | Posted by hlee

What if R. A. Fisher was hired by the Royal Observatory in spite that his interest was biology and agriculture, or W. S. Gosset^[1] instead of brewery? An article by E.L. Lehmann made me think this what if. If so, astronomers could have handled errors better than now. Continue reading ‘On the history and use of some standard statistical models’ »

Gosset’s pen name was Student, from which the name, Student-t in t-distribution or t-test was spawned.[↩]

Tags: CLT, E. L. Lehmann, independence, normality, R.A.Fisher, W.S. Gosset
Category: arXiv, Cross-Cultural, Jargon, Stat, Uncertainty | Comment

Discontinuation of weekly [arXiv] series

Jun 21st, 2008| 11:50 pm | Posted by hlee

Now it’s time for me to write my own astrostat papers instead of spending time for sieving them from [arXiv]. It has been an irresistible temptation scanning daily [arXiv] preprints to look for astronomy and sometimes statistics papers that 1. adopt statistics, 2. contain statistically challenging problems, 3. could be improved by more rigorous statistical applications, 4. look like abusing statistics, 5. may inspire statisticians by the data sets, or 6. might be useful for astronomers’ advancement in the data analysis. The temptation grew too much to be handled. The amount of papers belong to the above selection criteria seems to grow as my understanding widens. Also the mesh gets loose and starts to show holes. Continue reading ‘Discontinuation of weekly [arXiv] series’ »

Tags: discontinuation
Category: arXiv, Misc | 1 Comment

[ArXiv] 3rd week, June 2008

Jun 21st, 2008| 11:10 pm | Posted by hlee

This is my last [ArXiv] series. Continue reading ‘[ArXiv] 3rd week, June 2008’ »

Tags: CMB, Gaia, K-S test, KMM, lensing, marginal distribution, multi-scale image
Category: arXiv, MCMC | Comment

[ArXiv] 2nd week, June 2008

Jun 16th, 2008| 10:47 am | Posted by hlee

As Prof. Speed said, PCA is prevalent in astronomy, particularly this week. Furthermore, a paper explicitly discusses R, a popular statistics package. Continue reading ‘[ArXiv] 2nd week, June 2008’ »

Tags: Bayesian evidence, Binning, broken power law, cosmology, K-S test, LF, lhs, likelihood, PCA, power spectrum, R, SFH, Sun, Tully-Fisher
Category: arXiv, MCMC | Comment

[ArXiv] 1st week, June 2008

Jun 8th, 2008| 09:45 pm | Posted by hlee

Despite no statistic related discussion, a paper comparing XSPEC and ISIS, spectral analysis open source applications might bring high energy astrophysicists’ interests this week. Continue reading ‘[ArXiv] 1st week, June 2008’ »

Tags: black box, catalog, CMB, confidence interval, EGRET, ICA, ISIS, maximum likelihood, radio, sample size, student t, XSPEC
Category: arXiv, Data Processing, gamma-ray, High-Energy, Methods, Stat | Comment

[ArXiv] 4th week, May 2008

May 31st, 2008| 11:59 pm | Posted by hlee

Eight astro-ph papers and two statistics paper are listed this week. One statistics paper discusses detecting filaments and the other talks about maximum likelihood estimation of satellite images (clouds). Continue reading ‘[ArXiv] 4th week, May 2008’ »

Tags: AGN, Bayes factor, bootstrap, confidence set, cosmological constanct, dark energy, Exofit, exoplanet, filament, jackknife, KDE, Model Selection, time series, Type Ia SNe, unbiased, wavelet
Category: arXiv, Bayesian, MCMC, Stat | Comment

[ArXiv] 3rd week, May 2008

May 26th, 2008| 02:59 pm | Posted by hlee

Not many this week, but there’s a great read. Continue reading ‘[ArXiv] 3rd week, May 2008’ »

Tags: clustering, high dimension, LF, maximum likelihood, multivariate, Poisson, Schechter, zero count
Category: arXiv, Bayesian, Fitting, MCMC, Methods, Stat | Comment

Did they, or didn’t they?

May 20th, 2008| 12:10 am | Posted by vlk

Earlier this year, Peter Edmonds showed me a press release that the Chandra folks were, at the time, considering putting out describing the possible identification of a Type Ia Supernova progenitor. What appeared to be an accreting white dwarf binary system could be discerned in 4-year old observations, coincident with the location of a supernova that went off in November 2007 (SN2007on). An amazing discovery, but there is a hitch.

And it is a statistical hitch, and involves two otherwise highly reliable and oft used methods giving contradictory answers at nearly the same significance level! Does this mean that the chances are actually 50-50? Really, we need a bona fide statistician to take a look and point out the errors of our ways.. Continue reading ‘Did they, or didn’t they?’ »

Tags: arXiv, Chandra, CXC, Optical, Peter Edmonds, positional coincidence, positional error, Power, progenitor, question for statisticians, significance, Supernova, Type Ia, White Dwarf, White Dwarf binary, X-ray
Category: arXiv, Astro, Data Processing, News, Objects, Optical, Stat, Uncertainty | 5 Comments

[ArXiv] 2nd week, May 2008

May 19th, 2008| 10:42 am | Posted by hlee

There’s no particular opening remark this week. Only I have profound curiosity about jackknife tests in [astro-ph:0805.1994]. Including this paper, a few deserve separate discussions from a statistical point of view that shall be posted. Continue reading ‘[ArXiv] 2nd week, May 2008’ »

Tags: bimodality, bootstrap, calibration uncertainty, CF, Classification, CMB, dip, exoplanet, Fisher matrix, flare, GL, jackknife, KS test, marked point, maximum likelihood, MLE, poisson point process, spatial data, XLF
Category: arXiv, Frequentist, Uncertainty, X-ray | Comment

[ArXiv] 1st week, May 2008

May 11th, 2008| 10:42 pm | Posted by hlee

I think I have to review spatial statistics in astronomy, focusing on tessellation (void structure), point process (expanding 2 (3) point correlation function), and marked point process (spatial distribution of hardness ratios of X-ray distant sources, different types of galaxies -not only morphological differences but other marks such as absolute magnitudes and existence of particular features). When? Someday…

In addition to Bayesian methodologies, like this week’s astro-ph, studies on characterizing empirical spatial distributions of voids and galaxies frequently appear, which I believe can be enriched further with the ideas from stochastic geometry and spatial statistics. Click for what was appeared in arXiv this week. Continue reading ‘[ArXiv] 1st week, May 2008’ »

Tags: Classification, covariance, FARIMA, Fisher information, GL, GRB, Levy, light curve, limb darkening, ML, Pareto distribution, quasars, solar flare, standard candle, tessellation, time series, VO, void
Category: arXiv, MCMC, Uncertainty | 1 Comment

[ArXiv] 5th week, Apr. 2008

May 5th, 2008| 03:08 am | Posted by hlee

Since I learned Hubble’s tuning fork^[1] for the first time, I wanted to do classification (semi-supervised learning seems more suitable) galaxies based on their features (colors and spectra), instead of labor intensive human eye classification. Ironically, at that time I didn’t know there is a field of computer science called machine learning nor statistics which do such studies. Upon switching to statistics with a hope of understanding statistical packages implemented in IRAF and IDL, and learning better the contents of Numerical Recipes and Bevington’s book, the ignorance was not the enemy, but the accessibility of data was. Continue reading ‘[ArXiv] 5th week, Apr. 2008’ »

Wikipedia link: Hubble sequence[↩]

Tags: ANN, automation, Classification, correlation function, denoising, FFT, gravitational wave, lensing, LISA, machine learning, missing data, mock data, morphology, PCA, power spectrum, robust, SDSS, spectrum, sunspots, wavelet, zoo
Category: arXiv, Galaxies, Imaging, MCMC, Physics, Spectral | Comment

[ArXiv] 4th week, Apr. 2008

Apr 27th, 2008| 11:29 am | Posted by hlee

The last paper in the list discusses MCMC for time series analysis, applied to sunspot data. There are six additional papers about statistics and data analysis from the week. Continue reading ‘[ArXiv] 4th week, Apr. 2008’ »

Tags: clusters, CMB, GALEX, gravitaional waves, lensing, LF, LMC, machine learning, maximum likelihood, priors, probability, SDSS, stellar populations, sunspot, time series
Category: arXiv, MCMC | Comment

The LRT is worthless for …

Apr 25th, 2008| 01:48 am | Posted by hlee

One of the speakers from the google talk series exemplified model based clustering and mentioned the likelihood ratio test (LRT) for defining the number of clusters. Since I’ve seen the examples of ill-mannerly practiced LRTs from astronomical journals, like testing two clusters vs three, or a higher number of components, I could not resist indicating that the LRT is improperly used from his illustration. As a reply, the citation regarding the LRT was different from his plot and the test was carried out to test one component vs. two, which closely observes the regularity conditions. I was relieved not to find another example of the ill-used LRT. Continue reading ‘The LRT is worthless for …’ »

Tags: LRT, mixture
Category: arXiv, Bad AstroStat, CHASC, Frequentist, Stat | Comment

Is 8-sigma significant enough for you?

Apr 24th, 2008| 02:56 pm | Posted by vlk

There is a new report from Bernabei et al. (arXiv:0804.2741) of the direct detection of the effects of Dark Matter that is causing a lot of buzz. (The Bad Astronomer has a good summary.) They find yearly modulation in their detected scintillation rate that matches what you would expect if the Earth were rushing through Galactic Dark Matter as it goes around the Sun. They have worked out the significance of the modulation to be 8.2 sigma. Significant! But significant of what? Continue reading ‘Is 8-sigma significant enough for you?’ »

Tags: arXiv, DAMA, dark matter, Earth, modulation, orbit, scintillation, systematic error, year
Category: arXiv, News | 2 Comments

[ArXiv] Ripley’s K-function

Apr 21st, 2008| 11:56 pm | Posted by hlee

Because of the extensive works by Prof. Peebles and many (observational) cosmologists (almost always I find Prof. Peeble’s book in cosmology literature), the 2 (or 3) point correlation function is much more dominant than any other mathematical and statistical methods to understand the structure of the universe. Unusually, this week finds an astro-ph paper written by a statistics professor addressing the K-function to explore the mystery of the universe.

[astro-ph:0804.3044] J.M. Loh
Estimating Third-Order Moments for an Absorber Catalog

Continue reading ‘[ArXiv] Ripley’s K-function’ »

Tags: BATSE, catalog, correlation function, cosmology, K-function, Point Process, spatial statistics
Category: arXiv, Methods, Stat | Comment