Monday, March 31, 2008

How useful are normality tests?



How useful are normality tests?
source: graphpad.com


Many data analysis methods (t test, ANOVA, regression) depend on the assumption that data were sampled from a Gaussian distribution. The best way to evaluate how far your data are from Gaussian is to look at a graph and see if the distribution deviates grossly from a bell-shaped normal distribution.
How useful are statistical methods to test for normality? Less useful than you’d guess. Consider these potential problems:
  • Small samples almost always pass a normality test. Normality tests have little power to tell whether or not a small sample of data comes from a Gaussian distribution.
  • With large samples, minor deviations from normality may be flagged as statistically significant, even though small deviations from a normal distribution won’t affect the results of a t test or ANOVA.
  • Decisions about when to use parametric vs. nonparametric tests should usually be made to cover an entire series of analyses. It is rarely appropriate to make the decision based on a normality test of one data set.

I think it is usually a mistake to test every data set for normality, and use the result to decide between parametric and nonparametric statistical tests. But normality tests can help you understand your data, especially when you get similar results in many experiments.


Which normality test is best?

When we added a normality test to Prism several years ago, we selected the one that was best known, the Kolmogorov-Smirnov test. This test compares the cumulative distribution of the data with the expected cumulative Gaussian distribution, and bases its P value on the largest discrepancy.
The Kolmogorov-Smirnov test was designed to compare two experimentally-determined distributions. When testing for normality, you compare an experimental distribution against a hypothetical ideal, so its necessary to apply the Dallal-Wilkinson-Lilliefors correction. Prism versions 4.02 and 4.0b do this, but earlier versions did not.
The Kolmogorov-Smirnov test is based on a simple way to quantify the discrepancy between the observed and expected distributions. It turns out, however, that it is too simple, and doesn't do a good job of discriminating whether or not your data was sampled from a Gaussian distribution. An expert on normality tests, R.B. D’Agostino, makes a very strong statement: “The Kolmogorov-Smirnov test is only a historical curiosity. It should never be used.” (“Tests for Normal Distribution” in Goodness-of-fit Techniques, Marcel Decker, 1986).
In Prism 4.02 and 4.0b, we added two more normality tests to Prism’s column statistics analysis the Shapiro-Wilk normality test and the D’Agostino-Pearson omnibus test.
All three procedures test the same the null hypothesis – that the data are sampled from a Gaussian distribution. The P value answers this question: If the null hypothesis were true, what is the chance of randomly sampling data that deviate as much (or more) from Gaussian as the data we actually collected? The three tests differ in how they quantify the deviation of the actual distribution from a Gaussian distribution.

The Shapiro-Wilk normality test is difficult for nonmathematicians to understand, and it doesn't work well when several values in your data set are the same. In contrast, the D’Agostino-Pearson omnibus test is easy to understand. It first analyzes your data to determine
skewness (to quantify the asymmetry of the distribution) and kurtosis (to quantify the shape of the distribution). It then calculates how far each of these values differs from the value expected with a Gaussian distribution, and computes a single P value from the sum of the squares of these discrepancies. Unlike the Shapiro-Wilk test, this test is not affected if the data contains identical values.

If you to decide use normality tests, I recommend that you stop using the Kolmogorov-Smirnov test, and switch instead to the D’Agostino-Pearson omnibus test. But first rethink whether the normality tests are providing you with useful information.

Tuesday, March 25, 2008

What's new on BMJ.com?

 Internal and external validity of cluster randomised trials
Cluster randomised trials are essential for evaluating certain types of intervention, and there are often strong scientific reasons to conduct them. The extent to which investigators are designing and analysing these trials appropriately has improved, according to this systematic review of recent trials.


RAPID RESPONSES
Should we dump the metabolic syndrome? No
Should we dump the metabolic syndrome? Yes

Wednesday, March 19, 2008

Links to Health ABC sites

Links to Health ABC sites

So that you have it, here is some information about HealthABC.

The first is a link to the main NIH Health ABC site:
     http://www.nia.nih.gov/ResearchInformation/ScientificResources/HealthABCDescription.htm

This site contains a description of the study and further links to tables showing

     1.   Measurements in the study
               http://www.nia.nih.gov/ResearchInformation/ScientificResources/MeasurementsHealthABC.htm,

     2.  Laboratory analytes and repository plan
               http://www.nia.nih.gov/ResearchInformation/ScientificResources/LaboratoryAnalytes.htm

Another "main" location exists for the:

     1.  Health ABC website
               http://www.grc.nia.nih.gov/branches/ledb/healthabc/index.htm

          a.  Analysis Plan Proposal  [PDF]
                    http://www.grc.nia.nih.gov/branches/ledb/healthabc/analysisplan.pdf

          b.  Health ABC Contact Directory  [PDF]
                    http://www.grc.nia.nih.gov/branches/ledb/healthabc/directory.pdf

          c.  Ancillary Study Proposal  [PDF]
                    http://www.grc.nia.nih.gov/branches/ledb/healthabc/ancillaryform.pdf

          d.  Ancillary Studies Guidelines  [PDF]
                    http://www.grc.nia.nih.gov/branches/ledb/healthabc/ancillaryguidelines.pdf

          e.  Measurements in the Study  [PDF]
                    http://www.grc.nia.nih.gov/branches/ledb/healthabc/measurements.pdf

         f.  Publications and Presentations Guidelines  [PDF]
                    http://www.grc.nia.nih.gov/branches/ledb/healthabc/publicationguidelines.pdf

          g.  Health ABC Publications List  [PDF]
                    http://www.grc.nia.nih.gov/branches/ledb/healthabc/publicationslist.pdf


Friday, March 14, 2008

Microsoft Windows Hotkey Shortcut

Hotkeys/Shortcuts - Microsoft Windows, Office, etc.

Wednesday, March 12, 2008

A Thousand Winds

千风之歌 - A Thousand Winds


source: http://ripple1978.spaces.live.com/blog/cns!E3391D08951E33E!1377.entry
86 歲的李登輝首次在鏡頭前暢談生死觀,背景是一個人的教堂。他引用了日文歌曲《千風之歌》裡的一句,一言以蔽之概況了他對死亡的態度,「請別在我的墳前哭 泣,我不在那裡,我沒有沉睡在那裏,我已化為一千個風。」心內有所觸動,剛好前幾日在日本緯來台紅白大賽中截聽了此曲的後半段,旋律哀淡而清透,難得的催 人眩淚後繼爾能挺起胸膛,悲傷而不頹蘼。很震撼一曲詮釋死亡的音樂能帶給聽者這樣的感受。
聞報導的典故源自印弟安一對情侶,女人死後,男子欲相伴辭世循情,在清理愛侶遺物時發現她生前留下的詩篇,意勸情人好好地活下去,果然如亡者所願,男子不 再企圖自殺。但網路搜索的版本卻有更多種。真實已經不再重要。經典的東西總是在時光沉澱之後浸透出更濃郁的況味,超越人等與國度。
 
除了宗教力量,能夠告慰死亡的東西委實不多,很感激有一千個風,在「幸福後死不捨得,不幸時死不甘願;豐收後死不捨得,貧瘠時死不甘願」的生命步仗中,稍稍地卸下那份難於消彌的懼怕。
如果我的愛人先我而死,就當做他猶然活著的耳語,夜夜裏哼唱給我聽。如果我先死,這將是我的遺言,「最好你能忘記我,如果不能,那就唱這首歌。」 

评:千风之歌 李登辉生死观真情告白 凤凰视频 2008/01/08->01/05




千风之歌


请不要伫立在我坟前哭泣
我不在那里  我没有沉睡不醒 
化为千风  我已化身为千缕微风 
翱翔在无限宽广的天空里 
秋天  化身为阳光照射在田地间 
冬天  化身为白雪绽放钻石光芒 
晨曦升起时  幻化为飞鸟轻声唤醒你 
夜幕低垂时  幻化为星辰温柔守护你 
请不要伫立在我坟前哭泣 
我不在那里  我没有离开人间
化为千风  我已化身为千缕微风
翱翔在无限宽广的天空里 
化为千风  我已化身为千缕微风 
翱翔在无限宽广的天空里

化為千風 - Wikipedia
Do not stand at my grave and weep - Wikipedia

英文版──Do not stand at my grave and weep
by Mary Elizabeth Frye



I am not there, I do not sleep
I am a thousand winds that blow.
I am the diamond glints on snow.
I am the sunlight on ripened grain.
I am the gentle autumn rain.
When you awaken in the morning's hush
I am the swift uplifting rush
Of quiet birds in circled flight.
I am the soft stars that shine at night.
Do not stand at my grave and cry;
I am not there. I did not die.


日文版──千の風になって
 私のお墓の前で 泣かないでください
そこに私はいません 眠ってなんかいません
千の風に千の風になって
あの大きな空を吹き渡っています

秋には光になって
畑にふりそそぐ
冬はダイヤのように きらめく雪になる
朝は鳥になって あなたを

目覚めさせる
夜は星になって あなたを見守る私のお墓の前で
泣かないでください
そこに私はいません 死んでなんかいません
中文版──千風之歌
請不要佇立在我的墓前哭泣
因為我並不在那裡
我並沒有沉睡不醒
而是化為千風
我已化身為千縷微風
翱翔在無限寬廣的天空裡

秋天
我化身為陽光
照射在田野間
冬天
我化身為白雪
綻放鑽石般的閃耀光芒

晨曦升起之際
我幻化為飛鳥
輕聲地喚醒你
夜幕低垂之時
我幻化成星辰
溫柔地守護你

請不要佇立在我的墓前哭泣
因為我並不在那裡
我並沒有沉睡不醒
而是化為千風
我已化身為千縷微風
翱翔在無限寬廣的天空裡

I Am A Thousand Winds:
Don't stand at my grave and weep
I am not there, I do not sleep
I am the sunlight on the ripened grain
I am the gentle autumn rain


I am a thousand winds
I am a thousand winds that blow
I am the diamond glint on snow
I am a thousand winds that blow


Don't stand at my grave and cry
I am not there, I did not die
I am the swift rush of birds in flight
Soft stars that shine at night


I am a thousand winds
I am a thousand winds that blow
I am the diamond glint on snow
I am a thousand winds that blow


Don't stand at my grave and weep
I am not there, I do not sleep
I am the sunlight on the ripened grain
I am the gentle autumn rain


I am a thousand winds
I am a thousand winds that blow
I am the diamond glint on snow
I am a thousand winds that blow


I am the diamond glint on snow
I am a thousand winds that blow


  



Other version of wind s



不要站在我的墳前哭泣
我不在那,我未沉眠
我化為千風吹送著
我化為雪中閃耀的鑽石
我化為陽光灑落在成熟稻穀上
我化為綿綿的秋雨
當你在早晨的寧靜中醒來
我化為湍急的溪流
寧靜的鳥兒在上方盤旋
我化為溫柔的星星在夜晚閃耀著
不要站在我的墳前哭泣
我不在那,我還未離世

Reinventing Type 2 Diabetes: Pathogenesis, Treatment, and Prevention -- Unger 299 (10): 1185 -- JAMA

Reinventing Type 2 Diabetes: Pathogenesis, Treatment, and Prevention

JAMA has just put online the article by Professor Unger about the
lipocentric model of diabetes. Below is the direct link to the PDF.

http://elib.cdc.gov:2056/cgi/reprint/299/10/1185

Monday, March 10, 2008

Tips - Stata: -suest- for comparing regression coefficients between models

Tips - Stata: -suest- for  comparing regression coefficients between models
 
I found that 'suest' of Stata is a very useful command for comparing regression coefficients between different (separated) regression models EASILY. The most important, it can deal with complex survey data.
Enjoy!
 
 
suest is a postestimation command; see estcom and postest.

suest
combines the estimation results -- parameter estimates and associated (co)variance matrices -- stored under namelist into one parameter vector and simultaneous (co)variance matrix of the sandwich/robust type. This (co)variance matrix is appropriate even if the estimates were obtained on the same or on overlapping data.

Typical applications of suest are tests for intramodel and cross-model hypotheses using test or testnl, for example, a generalized Hausman specification test. lincom and nlcom may be used after suest to estimate linear combinations and nonlinear functions of coefficients. suest may also be used to adjust a standard VCE for clustering or survey design effects.
Different estimators are allowed, for example, a regress model and a probit model; the only requirement is that predict produce equation-level scores with the score option after an estimation command. The models may be estimated on different samples, due either to explicit if or in selection or to missing values. If weights are applied, the same weights (type and values) should be applied to all models in namelist. The estimators should be estimated without vce(robust) or vce(cluster clustvar) options. suest returns the robust VCE, allows the vce(cluster clustvar) option, and automatically works with results from the svy prefix command (only for vce(linearized)).

Since suest posts its results like a proper estimation command, its results can be stored via estimates store. Moreover, like other estimation commands, suest typed without arguments replays the results.

Friday, March 07, 2008

The Missing Semicolon!

 

New edition of The Missing Semicolon

www.sys-seminar.com/publications_tms.php.

SAS technical newsletter put together by our expert SAS consultants and trainers

 

 

 

Thursday, March 06, 2008

Metabolic Syndrome and Related Disorders Vol. 6, No. 1, Mar 2008 is now available online

 
Liebert Online Table of Contents Alert
Metabolic Syndrome and Related Disorders
Volume: 6, Number: 1 Mar 2008

The above issue is now available online from Liebert Online at:
http://www.liebertonline.com/toc/met/6/1

The table of contents for this issue is listed below. Click on the links below to view the abstract for each article, or click on the link above to read the table of contents online.

If you wish to update your preferences, please visit http://www.liebertonline.com/action/showPreferences?menuTab=Alerts

If you wish to unsubscribe from this alert, please visit http://www.liebertonline.com/action/removeAlert?ai=5aaq&sig=qHfFcznpGvU=

If you need any further help, please email us at e-toc@liebertpub.com.


  Nonalcoholic Steatohepatitis and the Metabolic Syndrome
  Joy Jiang and Natalie Torok
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 1-7.
Abstract | PDF (63 KB) | PDF Plus(102 KB)

  Metabolic Syndrome Status Changes with Fitness Level Change: A Retrospective Analysis
  Mark S. Maxwell, Brian R. Goslin, Ronald L. Gellish, Kenneth R. Hightower, Ronald E. Olson, Virinder K. Moudgil, and Gary D. Russi
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 8-14.
Abstract | PDF (72 KB) | PDF Plus(82 KB)

  Prediabetes and its Relationship with Obesity in Mexican Adults: The Mexican Diabetes Prevention (MexDiab) Study
  Fernando Guerrero-Romero, Martha Rodríguez-Morán, Ricardo Pérez-Fuentes, María C. Sánchez-Guillén, Manuel González-Ortiz, Esperanza Martínez-Abundis, Olga Brito-Zurita, Agustín Madero, Benigno Figueroa, Cristina Revilla-Monsalve, Silvia E. Flores-Martínez, Sergio Islas-Andrade, Ramón A. Rascón-Pacheco, Miguel Cruz, and José Sánchez-Corona
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 15-23.
Abstract | PDF (147 KB) | PDF Plus(126 KB)

  Genetic Variations at the CCAAT/Enhancer-binding Protein delta are Associated with Metabolic Phenotypes in the Japanese Population
  Hidesuke Kaji, Chika Fukano, Yukari Kimura, Etsuko Takiguchi, and Keiko Tanida
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 24-31.
Abstract | PDF (76 KB) | PDF Plus(85 KB)

  Unlimited Energy, Restricted Carbohydrate Diet Improves Lipid Parameters in Obese Children
  Brian S. Dunlap and James R. Bailes, Jr.
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 32-36.
Abstract | PDF (59 KB) | PDF Plus(67 KB)

  Comparisons of Metabolic Syndrome Definitions in Four Populations of the Asia-Pacific Region
  Crystal Man Ying Lee, Rachel R. Huxley, Mark Woodward, Paul Zimmet, Jonathan Shaw, Nam H. Cho, Hyung Rae Kim, Satu Viali, Makoto Tominaga, Dorte Vistisen, Knut Borch-Johnsen, and Stephen Colagiuri
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 37-46.
Abstract | PDF (326 KB) | PDF Plus(145 KB)

  Serum Uric Acid and Components of the Metabolic Syndrome in Non-Diabetic Populations in Mauritian Indians and Creoles and in Chinese in Qingdao, China
  Hairong Nan, Qing Qiao, Stefan Söderberg, Weiguo Gao, Paul Zimmet, Jonathan Shaw, George Alberti, Yanhu Dong, Ulla Uusitalo, Vassen Pauvaday, Pierrot Chitson, and Jaakko Tuomilehto
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 47-57.
Abstract | PDF (142 KB) | PDF Plus(123 KB)

  Metaboli Syndrome in Railway Employees and its Relation to Lifestyle Factors
  G.P. Parale, V.C. Patil, S.P. Patil, S.V. Sabale, C.V. Pethe, G.S. Manjunath, P.M. Kulkarni, V.N. Dhadke, and N.S. Deshpande
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 58-63.
Abstract | PDF (64 KB) | PDF Plus(70 KB)

  Insulin, hs-CRP, Leptin, and Adiponectin. An Analysis of their Relationship to the Metabolic Syndrome in an Obese Population with an Elevated Waist Circumference
  Eric Yan, Steve Chen, Kurt Hong, Woo Sung Kim, Anita Bajpai, Leo Treyzon, Luigi Gratton, Robert Elashoff, He-Jing Wang, Zhaoping Li, and David Heber
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 64-73.
Abstract | PDF (185 KB) | PDF Plus(134 KB)

  Dissertations
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 74-78.
Citation | PDF (57 KB) | PDF Plus(57 KB)

  Abstracts
  Metabolic Syndrome and Related Disorders Mar 2008, Vol. 6, No. 1: 79-83.
Citation | PDF (58 KB) | PDF Plus(58 KB)

Tuesday, March 04, 2008