The AstroStat Slog

All models are wrong, but some are useful

Jun 30th, 2008| 11:12 pm | Posted by hlee

All models are wrong, but some are useful. –George Box

Continue reading ‘All models are wrong, but some are useful’ »

Tags: George Box, model, petabytes
Category: Misc | 3 Comments

Probability Plotting

Jun 30th, 2008| 08:50 pm | Posted by aneta

I just saw this web site with the probability plots on the probability papers. Is this real? Does somebody use this type of analysis when everything is done on the computers?

Quote from the web page:

“… probability plotting involves a physical plot of the data on specially constructed probability plotting paper. This method is easily implemented by hand, given that one can obtain the appropriate probability plotting paper.”

http://www.weibull.com/LifeDataWeb/probability_plotting.htm

Category: Misc | 5 Comments

On the history and use of some standard statistical models

Jun 26th, 2008| 08:03 pm | Posted by hlee

What if R. A. Fisher was hired by the Royal Observatory in spite that his interest was biology and agriculture, or W. S. Gosset^[1] instead of brewery? An article by E.L. Lehmann made me think this what if. If so, astronomers could have handled errors better than now. Continue reading ‘On the history and use of some standard statistical models’ »

Gosset’s pen name was Student, from which the name, Student-t in t-distribution or t-test was spawned.[↩]

Tags: CLT, E. L. Lehmann, independence, normality, R.A.Fisher, W.S. Gosset
Category: arXiv, Cross-Cultural, Jargon, Stat, Uncertainty | Comment

Workshop on Algorithms for Modern Massive Data Sets

Jun 25th, 2008| 08:57 pm | Posted by hlee

A conference that I wanted to go but never made, started today. With relief, they have presentation files from the previous workshop
http://www.stanford.edu/group/mmds and I expect the same for this year. The workshop title may not attract astronomers but the contents, tools, methodologies, and theory are modern astronomy friendly. Astronomers can motivate, initiate, and push further these researchers at the workshop, which I believe currently happening without broad recognitions (foremost interdisciplinary works tend to stay within research groups).

Tags: MMDS, workshop
Category: Algorithms, Cross-Cultural, News | 2 Comments

Open and Shut [Equation of the Week]

Jun 25th, 2008| 01:00 pm | Posted by vlk

For a discipline that relies so heavily on images, it is rather surprising how little use astronomy makes of the vast body of work on image analysis carried out by mathematicians and computer scientists. Mathematical morphology, for example, can be extremely useful in enhancing, recognizing, and extracting useful information from densely packed astronomical
images.

The building blocks of mathematical morphology are two operators, Erode[I|Y] and Dilate[I|Y], Continue reading ‘Open and Shut [Equation of the Week]’ »

Tags: close, dilate, EotW, Equation, Equation of the Week, erode, Morphological operator, open, set theory
Category: Imaging, Jargon | 1 Comment

Discontinuation of weekly [arXiv] series

Jun 21st, 2008| 11:50 pm | Posted by hlee

Now it’s time for me to write my own astrostat papers instead of spending time for sieving them from [arXiv]. It has been an irresistible temptation scanning daily [arXiv] preprints to look for astronomy and sometimes statistics papers that 1. adopt statistics, 2. contain statistically challenging problems, 3. could be improved by more rigorous statistical applications, 4. look like abusing statistics, 5. may inspire statisticians by the data sets, or 6. might be useful for astronomers’ advancement in the data analysis. The temptation grew too much to be handled. The amount of papers belong to the above selection criteria seems to grow as my understanding widens. Also the mesh gets loose and starts to show holes. Continue reading ‘Discontinuation of weekly [arXiv] series’ »

Tags: discontinuation
Category: arXiv, Misc | 1 Comment

[ArXiv] 3rd week, June 2008

Jun 21st, 2008| 11:10 pm | Posted by hlee

This is my last [ArXiv] series. Continue reading ‘[ArXiv] 3rd week, June 2008’ »

Tags: CMB, Gaia, K-S test, KMM, lensing, marginal distribution, multi-scale image
Category: arXiv, MCMC | Comment

my first AAS. VI. Normalization

Jun 20th, 2008| 11:58 pm | Posted by hlee

One realization of mine during the meeting was related to a cultural difference; therefore, there is no relation to any presentations during the 212th AAS in this post. Please, correct me if you find wrong statements. I cannot cover all perspectives from both disciplines but I think there are two distinct fashions in practicing normalization. Continue reading ‘my first AAS. VI. Normalization’ »

Tags: AAS, measure, measure theory, normalization, PDF, pmf
Category: Bad AstroStat, Cross-Cultural, Uncertainty | Comment

[Q] systematic error

Jun 20th, 2008| 11:02 pm | Posted by hlee

What is systematic error? Can it be modeled statistically? Is it random? Is it fixed? Is it a bias? Is it …? Continue reading ‘[Q] systematic error’ »

Tags: measurement error, statistical error, systematic error
Category: Cross-Cultural | 5 Comments

my first AAS. V. measurement error and EM

Jun 19th, 2008| 11:46 pm | Posted by hlee

While discussing different view points on the term, clustering, one of the conversers led me to his colleague’s poster. This poster (I don’t remember its title and abstract) was my favorite from all posters in the meeting. Continue reading ‘my first AAS. V. measurement error and EM’ »

Tags: bias, bimodality, EM algorithm, LF, likelihood, measurement error
Category: Cross-Cultural | 1 Comment

my first AAS. IV. clustering

Jun 19th, 2008| 11:42 pm | Posted by hlee

I was questioned by two attendees, acquainted before the AAS, if I can suggest them clustering methods relevant to their projects. After all, we spent quite a time to clarify the term clustering. Continue reading ‘my first AAS. IV. clustering’ »

Tags: Classification, clustering, cosmology, spatial statistics, supervised learning, test, unsupervised learning
Category: Cross-Cultural, Jargon | Comment

GLAST

Jun 19th, 2008| 03:13 pm | Posted by vlk

You all may have heard that GLAST launched on June 11, and the mission is going smoothly. Via Josh Grindlay comes news that Steve Ritz, the GLAST Project Scientist at GSFC, is keeping a weblog dedicated to it at

http://blogs.nasa.gov/cm/blog/GLAST

and intends to post status reports and related information on it.

Tags: GLAST, June, June 11, Steve Ritz
Category: Astro, High-Energy, News | Comment

Likelihood Ratio Test Statistic [Equation of the Week]

Jun 18th, 2008| 01:00 pm | Posted by vlk

From Protassov et al. (2002, ApJ, 571, 545), here is a formal expression for the Likelihood Ratio Test Statistic,

T_LRT = -2 ln R(D,Θ₀,Θ)

R(D,Θ₀,Θ) = [ sup_θεΘ₀ p(D|Θ₀) ] / [ sup_θεΘ p(D|Θ) ]

where D are an independent data sample, Θ are model parameters {θ_i, i=1,..M,M+1,..N}, and Θ₀ form a subset of the model where θ_i = θ_i⁰, i=1..M are held fixed at their nominal values. That is, Θ represents the full model and Θ₀ represents the simpler model, which is a subset of Θ. R(D,Θ₀,Θ) is the ratio of the maximal (technically, supremal) likelihoods of the simpler model to that of the full model.
Continue reading ‘Likelihood Ratio Test Statistic [Equation of the Week]’ »

Tags: EotW, Equation, Equation of the Week, F-test, likelihood, likelihood ratio test, LRT, Protassov, Rostislav Protassov
Category: Fitting, Jargon, Stat | 2 Comments

[ArXiv] 2nd week, June 2008

Jun 16th, 2008| 10:47 am | Posted by hlee

As Prof. Speed said, PCA is prevalent in astronomy, particularly this week. Furthermore, a paper explicitly discusses R, a popular statistics package. Continue reading ‘[ArXiv] 2nd week, June 2008’ »

Tags: Bayesian evidence, Binning, broken power law, cosmology, K-S test, LF, lhs, likelihood, PCA, power spectrum, R, SFH, Sun, Tully-Fisher
Category: arXiv, MCMC | Comment

my first AAS. III. ANOVA

Jun 11th, 2008| 11:59 pm | Posted by hlee

Believe it or not, I saw ANOVA (ANalysis Of VAriance) from a poster at AAS. This acronym was considered as one of very statistical jargons that one would never see in an astronomical meeting. I think you like to know the story in detail. Continue reading ‘my first AAS. III. ANOVA’ »

Tags: AAS, ANOVA, experiment design
Category: Stat | 2 Comments