Friday, January 16, 2009

Comparisons of percentage body fat, body mass inde... Flegal 2009

BACKGROUND: Body mass index (BMI), waist circumference (WC), and the waist-stature ratio (WSR) are considered to be possible proxies for adiposity. OBJECTIVE: The objective was to investigate the relations between BMI, WC, WSR, and percentage body fat (measured by dual-energy X-ray absorptiometry) in adults in a large nationally representative US population sample from the National Health and Nutrition Examination Survey (NHANES). DESIGN: BMI, WC, and WSR were compared with percentage body fat in a sample of 12,901 adults. RESULTS: WC, WSR, and BMI were significantly more correlated with each other than with percentage body fat (P < 0.0001 for all sex-age groups). Percentage body fat tended to be significantly more correlated with WC than with BMI in men but significantly more correlated with BMI than with WC in women (P < 0.0001 except in the oldest age group). WSR tended to be slightly more correlated with percentage body fat than was WC. Percentile values of BMI, WC, and WSR are shown that correspond to percentiles of percentage body fat increments of 5 percentage points. More than 90% of the sample could be categorized to within one category of percentage body fat by each measure. CONCLUSIONS: BMI, WC, and WSR perform similarly as indicators of body fatness and are more closely related to each other than with percentage body fat. These variables may be an inaccurate measure of percentage body fat for an individual, but they correspond fairly well overall with percentage body fat within sex-age groups and distinguish categories of percentage body fat.

Thursday, January 15, 2009

How should I calculate a within-subject coefficient of variation?
by Martin Bland
In the study of measurement error, we sometimes find that the within-subject variation is not uniform but is proportional to the magnitude of the measurement. It is natural to estimate it in terms of the ratio within-subject standard deviation/mean, which we call the within-subject coefficient of variation.
In our British Medical Journal Statistics Note on the subject, Measurement error proportional to the mean, Doug Altman and I described how to calculate this using a logarithmic method. We take logarithms of the data and then find the within-subject standard deviation. We take the antilog of this and subtract one to get the coefficient of variation.
Alvine Bissery, statistician at the Centre d'Investigations Cliniques, Hôpital européen Georges Pompidou, Paris, pointed out that some authors suggest a more direct approach. We find the coefficient of variation for each subject separately, square these, find their mean, and take the square root of this mean. We can call this the root mean square approach. She asked what difference there is between these two methods.
In practice, there is very little difference between these two ways of estimating within-subject coefficient of variation. They give very similar estimates.
This simulation, done in Stata, shows what happens. (The function invnorm(uniform()) gives a standard Normal random variable.)
. clear
Set sample size to 100.
. set obs 100
obs was 0, now 100
We generate true values for the variable whose measurement we are simulating.
. gen t=6+invnorm(uniform())
We generate measurements x and y, with error proportional to the true value.
. gen x = t + invnorm(uniform())*t/20
. gen y = t + invnorm(uniform())*t/20
Calculate the within-subject variance for the natural scale values. (Within-subject variance is given by difference squared over 2 when we have pairs of subjects.)
. gen s2 = (x-y)^2/2
Calculate subject mean and s squared / mean squared, i.e. CV squared.
. gen m=(x+y)/2
. gen s2m2=s2/m^2
Calculate mean of s squared / mean squared.
. sum s2m2
Variable Obs Mean Std. Dev. Min Max
---------+-----------------------------------------------------
s2m2 100 .0021519 .0030943 4.47e-07 .0166771
The within-subject CV is the square root of the mean of s squared / mean squared:
. disp sqrt(.0021519)
.04638858
Hence the within-subject CV is estimated to be 0.046 or 4.6%.
Now the log method. First we log transform.
. gen lx=log(x)
. gen ly=log(y)
Calculate the within-subject variance for the log values.
. gen s2l = (lx-ly)^2/2
. sum s2l
Variable Obs Mean Std. Dev. Min Max
---------+-----------------------------------------------------
s2l 100 .0021566 .003106 4.46e-07 .0167704
The within-subject standard deviation on the log scale is the square root of the mean within-subject variance. The CV is the antilog (exponent since we are using natural logarithms) minus one.
. disp exp(sqrt(.0021566))-1
.04753439
Hence the within-subject CV is estimated to be 0.048 or 4.8%. Compare this with the direct estimate, which was 4.6%. The two estimates are almost the same.
If we average the CV estimated for each subject, rather than their squares, we do not get the same answer.
Calculate subject CV and find the mean.
. gen cv=sqrt(s2)/m
. sum cv
Variable Obs Mean Std. Dev. Min Max
---------+-----------------------------------------------------
cv 100 .0361173 .0292567 .0006682 .1291399
This gives us the within-subject CV estimate = 0.036 or 3.6%. This is considerably smaller than the estimates by the root mean square method or the log method. The mean CV is not such a good estimate and we should avoid it.
Sometimes researchers estimate the within-subject CV using the mean and within-subject standard deviation for the whole data set. They estimate the within-subject standard deviation in the usual way, as if it were a constant. They then divide this by the mean of all the observations to give a CV. This appears to be a completely wrong approach, as it estimates a single value for a varying quantity. However, it often works remarkably well, though why it does I do not know. It works in this simulation:
. sum x y s2
Variable Obs Mean Std. Dev. Min Max
---------+-----------------------------------------------------
x 100 6.097301 1.012154 3.62283 8.696612
y 100 6.081827 1.000043 3.759932 8.447584
s2 100 .0823188 .1212132 .0000193 .605556
The within-subject standard deviation is the square root of the mean of s2 and the overall mean is the average of the X mean and the Y mean. Hence the estimate of the within-subject CV is:
. disp sqrt(.0823188)/( (6.097301 + 6.081827)/2)
.04711545
So this method gives the estimated within-subject CV as 0.047 or 4.7%. This can be compared to the estimates by the root mean squared CV and the log methods, which were 4.6% and 4.8%. Why this should be I do not know, but it works. I do not know whether it would work in all cases, so I do not recommend it.
We can find confidence intervals quite easily for estimates by either the root mean square method or the log method. For the root mean square method, this is very direct. We have the mean of the squared CV, so we use the usual confidence interval for a mean on this, then take the square root.
. sum s2m2
Variable Obs Mean Std. Dev. Min Max
---------+-----------------------------------------------------
s2m2 100 .0021519 .0030943 4.47e-07 .0166771
The standard error is the standard deviation of the CVs divided by the square root of the sample size.
. disp .0030943/sqrt(100)
.00030943
The 95% confidence interval for the squared CV can be found by the mean minus or plus 1.96 standard errors. If the sample is small we should use the t distribution here. However, the squared CVs are unlikely to be Normal, so the CI will still be very approximate.
. disp .0021519 - 1.96*.00030943
.00154542
. disp .0021519 + 1.96*.00030943
.00275838
The square roots of these limits give the 95% confidence interval for the CV.
disp sqrt(.00154542)
.03931183
. disp sqrt(.00275838)
.05252028
Hence the 95% confidence interval for the within-subject CV by the root mean square method is 0.039 to 0.053, or 3.9% to 5.3%.
For the log method, we can find a confidence interval for the within-subject standard deviation on the log scale. The standard error is sw/root(2n(m-1)), where sw is the within-subject standard deviation, n is the number of subjects, and m is the number of observations per subject.
In the simulation, sw = root(0.0021566) = 0.0464392, n = 100, and m = 2.
Hence the standard error is 0.0464392/root(2 * 100 * (2-1)) = 0.0032837.
The 95% confidence interval is 0.0464392 - 1.96*0.0032837 = 0.0400031 to 0.0464392 + 1.96*0.0032837 = 0.0528753.
Finally, we antilog these limits and subtract one to give confidence limits for the CV: exp(0.0400031)-1 = 0.040814 and exp(0.0528753)-1 = 0.05429817, so the 95% confidence interval for the within-subject CV is 0.041 to 0.053, or 4.1% to 5.3%. These are slightly narrower than the root mean square confidence limits, but very similar.
I would conclude that either the root mean square method or the log method can be used.

A List of Statistics Notes of the British Medical Journal

A List of Statistics Notes of the British Medical Journal
What Is Gestational Diabetes?

see Diabetes Care

2009 Diabetes Clinical Practice Recommendations

2009 Diabetes Clinical Practice Recommendations

http://care.diabetesjournals.org/content/vol32/Supplement_1/

Friday, January 09, 2009

In Search of a Good Doctor - NYTimes.com


Web Sites
The following is a compilation of recommendations from the physicians mentioned in this column. It is by no means exhaustive but should provide a starting point for those interested in researching their doctors or conditions.
Decision Making:
1. Foundation for Informed Medical Decision Making: This site (www.informedmedicaldecisions.org), endorsed by the Society of General Internal Medicine, offers information and tips on how patients can become more actively involved in the medical decision-making process and get the care that is right for them.
Researching on the Web:
1. Medical Library Association: The Medical Library Association has compiled a guide (www.mlanet.org/resources/userguide.html) to help individuals sort through the myriad offerings on the Web. Included is an M.L.A. “Top 10” most useful consumer health Web sites.
Researching Physicians and Hospitals:
1. State boards of medicine: State medical boards can provide information regarding a doctor’s licensure, training and history of disciplinary action.
• The New York State Board for Medicine (www.op.nysed.gov/) and, in New York City, the New York County Medical Society (www.nycms.org).
• To locate other state boards, the American Medical Association provides a listing. (www.ama-assn.org/ama/pub/category/2645.html)
2. The American Board of Medical Specialties (www.abms.org/): The A.B.M.S. has a doctor-finder function that will pull up a physician’s board certification.
3. The Agency for Healthcare Research and Quality (www.talkingquality.gov/compendium/index.html) : The A.H.R.Q. has compiled health care “report cards” that provide comparative information on the quality of health plans, hospitals, medical groups, individual physicians, nursing homes and other providers of care.
4. The United States Department of Health and Human Services, Hospital Compare (www.hospitalcompare.hhs.gov): This site provides information from participating hospitals on how well those hospitals care for patients with certain medical conditions or surgical procedures. Also included are the results from patient surveys on quality of care during hospital stays.
5. State of California Report Card (www.opa.ca.gov/report_card): This site, from the California Office of the Patient Advocate, provides report cards of various health insuranceplans, medical groups and doctors in California. Some other states have similar sites.
Researching a Disease, Condition or Procedure:
1. National professional medical societies: Below is a sampling of some of the larger national societies. Many societies have sites specifically designed for patients.
• American Academy of Pediatrics (www.aap.org)
• American Society of Anesthesiologists (www.asahq.org)
• American College of Physicians (www.acponline.org)
• Society of General Internal Medicine (www.sgim.org)
• American Society of Clinical Oncology (www.asco.org)
• American Academy of Family Physicians (www.aafp.org)
• American College of Surgeons (www.facs.org)
• American College of Obstetricians and Gynecologists (www.acog.org)
2. Medline Plus: Supported by both the National Library of Medicine and the National Institutes of Health, Medline Plus (www.medlineplus.gov) offers patients an array of information on health topics, drugs and current news. In addition, there are interactive tutorials, surgery videos, health information for older adults and links to clinical trials and health information for older adults.
3. Centers for Disease Control and Prevention: The C.D.C. (www.cdc.gov) is an excellent patient resource for communicable diseases and preventive and public health.
4. The American College of Physicians Foundation: In conjunction with the American College of Physicians, the ACP Foundation (foundation.acponline.org) offers succinct and clear health information for patients.
5. For cancer patients: There are three Web sites that can serve as excellent starting points:
• The American Society of Clinical Oncology (www.cancer.net)
6. My HealtheVet: Veterans, their advocates and employees of the Veterans Health Administration have access to extensive quality information and patient education resources through this site (www.myhealth.va.gov), from the Department of Veterans Affairs.

full text here

Thursday, January 08, 2009

AHA/CDC statement: Markers of Inflammation and Cardiovascular Disease

http://circ.ahajournals.org/cgi/content/full/107/3/499#SEC6

Markers of Inflammation and Cardiovascular Disease Application to Clinical and Public Health Practice: A Statement for Healthcare Professionals From the Centers for Disease Control and Prevention and the American Heart Association

In 1998, the American Heart Association convened Prevention Conference V to examine strategies for the identification of high-risk patients who need primary prevention. Among the strategies discussed was the measurement of markers of inflammation.1 The Conference concluded that "many of these markers (including inflammatory markers) are not yet considered applicable for routine risk assessment because of: (1) lack of measurement standardization, (2) lack of consistency in epidemiological findings from prospective studies with endpoints, and (3) lack of evidence that the novel marker adds to risk prediction over and above that already achievable through the use of established risk factors." The National Cholesterol Education Program Adult Treatment Panel III Guidelines identified these markers as emerging risk factors,1a which could be used as an optional risk factor measurement to adjust estimates of absolute risk obtained using standard risk factors. Since these publications, a large number of peer-reviewed scientific reports have been published relating inflammatory markers to cardiovascular disease (CVD). Several commercial assays for inflammatory markers have become available. As a consequence of the expanding research base and availability of assays, the number of inflammatory marker tests ordered by clinicians for CVD risk prediction has grown rapidly. Despite this, there has been no consensus from professional societies or governmental agencies as to how these assays of markers of inflammation should be used in clinical practice.

How large of SE/Variance is too large?

How large of SE/Variance is too large?

A relative standard error (RSE) greater than 30% was used to identify unreliable estimates. The RSE is defined as the ratio of the standard error of the estimate divided by the estimate multiplied by 100 [RSE = 100 x SE(b) / |b|, which is similar to Coefficient of Variation (CV) =  SD(b) / |b|, ].
Example of a MMWR article about Surveillance for Fatal and Nonfatal Injuries
National counts or estimates determined to be unstable are indicated with a footnote in the tables. Fatal injuries were identified as unstable if the number of deaths was <20 or the coefficient of variation (CV) was >30%, where CV = (SE / number of deaths) × 100. Nonfatal injuries were identified as unstable if the national estimate was <1,200, the number of sample cases used was <20, or CV was >30%, where CV = (SE / national estimate) × 100.

Why are rates based on fewer than 20 cases marked as being unreliable?
by NY State Department of Health: Data Sources and Tools - Chronic Diseases and Conditions

Example of a NHIS article (Variance Estimation and Significance Testing)
"... Standard errors are shown for all percentages in the tables (but not for the frequencies). Estimates with relative standard errors (RSE) of greater than 30% and less than or equal to 50% are considered statistically unreliable and are indicated with an asterisk (*). Estimates with a relative standard error greater than 50% are indicated with a dagger (†) and the estimates are not shown..."

More about reliability

Wednesday, January 07, 2009

Comments on : Women may get diabetes earlier than men

Comments on : Women may get diabetes earlier than men

O-o-m-m. This conclusion must base on some assumptions, such as the definition of pre-diabetes is correct, pre-diabetes must move into diabetes, the normal level of these index are exactly same between women and men (no gender difference?), incubation from pre-diabetes to clinic diabetes are exactly same between women and men, etc. In light of my poor memory, there are very solid evidences that women are living longer than men generally, women have less heart disease risk than men. I don't have any evidences to doubt their findings (actually these are very interesting findings), but I doubt the simple conclusion.

Just some thoughts.

_____________________________________________
Subject:        Chicago Health: Women may get diabetes earlier than men

January 7, 2009
Chicago Health
Women May Get Diabetes Earlier Than Men
http://www.nbcchicago.com/health/topics/Women_May_Get_Diabetes_Earlier_Than_Men.html
Women may show signs of diabetes far earlier than men, according to new research. The findings could lead to new diabetes screening procedures to help identify who is at greatest risk of developing the disease.

Researchers from the University of Buffalo studied newly identified risk factors for type 2 diabetes, a disease of metabolism in which the body produces insufficient amounts or fails to use insulin, the hormone needed for cells to process glucose. This leads to a buildup of glucose in the bloodstream. Type 2 diabetes increases the risk of developing heart disease, stroke, eye and kidney diseases and other chronic illnesses.

According to the National Institutes of Health, an estimated 20.8 million Americans7 percent of the populationhad diabetes in 2005, 6.2 million of them undiagnosed. Of the diabetic population, an estimated 90 to 95 percent had the most common form of the disease, type 2 diabetes. In addition, government estimates indicate that at least 43 million Americans have prediabetes, a condition that occurs when blood glucose levels are high but not high enough to be classified as diabetes.

Recent research has shown that levels of chronic sub-acute inflammation, blood clotting factors and dysfunction in the cells lining the inside of arteries may be indicators of diabetes risk factors when tested in the blood.

The Buffalo researchers looked at 1,455 healthy men and women who participated in the Western New York Study between 1996 and 2001. That study tracked alcohol consumption and risk factors for cardiovascular disease. Participants were disease free with no indications of diabetes. They were tested and given physical examinations at the outset and during a six-year follow-up period. The researchers re-examined the participants between 2002 and 2004 and compared the new blood tests to results from 1996-2001.

The blood tests included fasting glucose and insulin levels, C-reactive protein, proinflammatory markers and markers for dysfunction of endothelial tissue lining blood vessels. C-reactive protein (CRP) is a substance produced by the liver that increases whenever there is inflammation in the body. CRP levels rise whenever there is an immune system response or activation. Women in the study had a higher incidence of prediabetes than men. Researchers could not explain why the differences occurred and said more studies are needed.

"Because these pre-diabetic markers are not routinely assessed and because diabetes is strongly linked with coronary heart disease, the study may help explain why the decline in death rates for heart disease in diabetic women lags behind that of diabetic men," lead author Dr. Richard Donahue said in a press release.

Donahue added: "Previous research had shown that hypertension and cholesterol were elevated among women who later developed diabetes. However, current findings that these novel risk factors [markers of endothelial dysfunction, chronic sub-acute inflammation and blood clotting factors] are elevated among women even earlier than previously recognized does suggest that the 'diabetic clock' starts ticking sooner for women than for men."

He suggested that women whose blood glucose levels increase over time should perhaps be screened more intensively for cardiovascular disease.

Saturday, January 03, 2009

Epidemiology & Biostatistics