题名

Removing an Extraneous Effect in Measuring Correlation Coefficient

DOI

10.29973/JCSA.200712.0004

作者

Chong-Yau Fu;Min-Hsun Tsai

关键词

Pearson correlation coefficient ; partial correlation coefficient ; corrected correlation coefficient ; deviated value ; stratification ; regression fitted

期刊名称

中國統計學報

卷期/出版年月

45卷4期(2007 / 12 / 01)

页次

386 - 401

内容语文

英文

英文摘要

Pearson correlation coefficient is a measure for the linearity degree between two continuous variables (X, Y). When this measure is spurious from an extraneous variable effect (Z), the partial correlation coefficient is often referred to adjust. However, it is possible to obtain a biased result from only the linear effect removed in the partial correlation coefficient applied. This study combines the techniques of ”stratification” and ”regression fitting” to replace the deviated value of the formula in the Pearson correlation coefficient. The r(subscript PR), r(subscript GM) and r(subscript GR) are proposed and investigated through simulation technique. And, the results show that r(subscript GR), r(subscript PR) perform very well in simulation Ⅰ, and r(subscript GR) still performs very well in simulation Ⅱ. Meanwhile, in a fetal study, the linear association between femur length and weight is estimated to be about 0.5 (r(subscript PR), r(subscript GR)), instead of 0.91 (uncorrected Pearson correlation coefficient), which is masked from a strong linear association with gestational age. Therefore, r(subscript GR) is the most reliable estimation and r(subscript PR) provides a possible method for more than one extraneous variable adjusted. Also, noted that the performance of r(subscript PR) varies with two data structure (simulation Ⅰ & Ⅱ).

主题分类 基礎與應用科學 > 統計
参考文献
  1. Fu, C. Y.,Hung, J. H.,Liu, S. H.,Lin, H. C.(2006).The implementation of factor analysis for stratified data.Journal of the Chinese Statistical Association,44,296-315.
    連結:
  2. Cochran, W. G.(1977).Sampling Techniques.New York:Wiley.
  3. Fleiss, J. L.,Tanur, J. M.(1971).A note on the partial correlation coefficient.The American Statistician,25,43-45.
  4. Hastie, T.,Tibshirani, R.,Friedman, J.(2001).The elements of statistical learning: data mining, inference, and prediction.New York:Springer.
  5. Hung J. H.,Fu C. Y., Hung J.(2006).Combination of fetal doppler velocimetric resistance values predict academic grown-restricted neonates.American Institute of Ultrasound in Medicine,25,957-962.
  6. Selvin S.(1995).Practical Biostatistical Method.Duxbury Press.
  7. StataCorp(2005).Stat Statistical Software: Release 9.College Station, TX:Stata-Corp LP.
  8. Steinberg, Dan,Colla(1997).CART-Classification and Regression Trees.San Diego, CA:Salford Systems.