On Selecting Relevant Covariates and Correlation Structure in Longitudinal Binary Model: Analyzing Impact of Height on Type II Diabetes

  • Md. Erfanul Hoque University of Dhaka
  • Mahfuzur Rahman Khokan University of Dhaka
  • Wasimul Bari University of Dhaka


To examine the impact of height on the occurrence of Type II diabetes, a longitudinal binary data set has been analyzed.  The relevant covariates were selected by using quasi-likelihood based information criteria (QIC) and correlation information criteria (CIC) was used to select the correlation structure appropriate for the repeated binary responses.  The consistent and efficient estimates of regression parameters were obtained from the generalized estimating equations (GEE).  With the selected covariates height, education level, gender and unstructured correlation structure, it is found that there exists a statistically significant inverse relationship between height of an individual and the development of Type II diabetes. Risk Ratios for different covariates along with standard errors and confidence intervals are also given.   

Author Biographies

Md. Erfanul Hoque, University of Dhaka
Statistics, Biostatistics & Informatics, Lecturer
Mahfuzur Rahman Khokan, University of Dhaka
Statistics, Biostatistics & Informatics, Lecturer
Wasimul Bari, University of Dhaka
Statistics, Biostatistics & Informatics, Professor


1. International Diabetes Federation (1998). Diabetes around the world.
2. Janghorbani M, and Amini M. (2010). Comparison of body mass index with abdominal obesity indicators and waist-to-stature ratio for prediction of type 2 diabetes: the Isfahan diabetes prevention study. Obesity Research & Clinical Practice 4: e25-e32.
3. WHO (2000). Obesity: preventing and managing the global epidemic. Report of a WHO consultation. World Health Organization Technical Report 2000; 894: i-xii, 1-253.
4. Schulze MB, Heidemann C, Schienkiewitz A. Bergmann MM, Hoffmann K, and Boeing H. (2006). Comparison of anthropometric characteristics in predicting the incidence of type 2 diabetes in the EPIC-Potsdam Study. Diabetes Care 29: 1921-1923.
5. Sicree RA, Zimmet PZ, Dunstan DW, Cameron AJ, Wel-born TA, and Shaw JE. (2008). Differences in height explain gender differences in the response to the oral glucose tolerance test the AusDiab study. Diabetic Medicine 25(3):296-302
6. Snijder MB, Dekker JM, Visser M, Bouter LM, Stehouwer CDA, Kostense PJ, Yudkin JS, Heine RJ, Nijpels G, and Seidell JC. (2003). Association of hip and thing circumferences independent of waist circumference with the incidence of type 2 diabetes: the Hoorn study. The American Journal of Clinical Nutrition 77: 1192-1197.
7. Bozorgmanesh M, Hadaegh F, Zabetian A, and Azizi F. (2011). Impact of hip circumference and height on incident diabetes: result from 6-year follow-up in the Tehran lipid and glucose study. Diabetic Medicine 28: 1330-1336.
8. Wang SL, Pan WH, Hwu CM, Ho LT, Lo CH, Lin SL, and Jong YS. (1997). Incidence of NIDDM and the effects of gender, obesity, and hyperinsulinaemia in Taiwan. Diabetologia 40: 1431-1438.
9. Njolstad I, Amesen E, and Lund-Larsen PG. (1998). Sex-differences in risk factors for clinical diabetes mellitus in a general population: a 12-years follow-up of the Finnmark Study. American journal of Epidemiology 147: 49-58.
10. Lorenzo C, Williams K, Stern MP, and Haffner SM. (2009). Height, ethnicity and the incidence of diabetes: the San Antonio Heart Study. Metabolism 58: 1530-1535.
11. Liang, K.-Y. and Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika 73: 13–22.
12. Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. Budapest: Akademiai Kiado.
13. Pan, W. (2001a). Akaike’s information criterion in generalized estimating equations. Biometrics 57: 120-125.
14. Pan, W. (2001b). Model selection in estimating equations. Biometrics 57: 529-534.
15. Pan, W., and Lee, C. T. (2001). Bootstrap model selection in generalized linear models. Journal of Agricultural, Biological & Environmental Statistics 6: 49-61.
16. Cantoni, E., Flemming, J. M., and Ronchetti, E. (2005). Variable selection for marginal longitudinal generalized linear models. Biometrics 61: 507-514.
17. Cantoni, E., Flemming, J. M., and Ronchetti, E. (2008). Longitudinal variable selection by cross-validation in the case of many covariates. Statistics in Medicine 26: 919–930.
18. Hin, L. Y. and Wang, Y. G. (2009). Working-correlation-structure identification in generalized estimating equations. Statistics in Medicine 28(4): 642-658.
19. Kleinbaum D. G., and Klein M. (2005). Survival Analysis: A Self-Learning Text, 2nd edition. ISBN: Springer-Verlag New York, Inc; 105-127.
20. WHO. (2007). World Health Organization. "Definition, diagnosis and classification of diabetes mellitus and its complications: Report of a WHO Consultation. Part 1. Diagnosis and classification of diabetes mellitus".
21. WHO/IDF. (2006). Definition and diagnosis of diabetes mellitus and intermediate hyperglycemia: report of a WHO/IDF consultation. Geneva: World Health Organization. p. 21. ISBN 978-92-4-159493-6.
22. Centers for Disease Control and Prevention (CDC) and National Center for Chronic Disease Prevention and Health Promotion. (2009). The Power of Prevention: Chronic Disease: The Public Health Challenge of the 21st Century. Atlanta, GA:CDC
23. Hirschhorn J. N., Lindgren C. M., Daly M. J. et al. (2001). Genomewide linkage analysis of stature in multiple populations reveals several regions with evidence of linkage to adult height. American Journal of Human Genetics 69: 106-116.
24. Park H. S., Yim K. S., and Cho S. I. (2004). Gender differences in familial aggregation of obesity-related phenotypes and dietary intake pattern in Korean families. Annals of Epidemiology 14: 486-491.
25. Li J. K., Ng M. C., So W. Y. et al. (2006). Phenotype and genetic clustering of diabetes and metabolic syndrome in Chinese families with type 2 diabetes mellitus. Diabetes/Metabolism Research and Reviews 22: 46-52.
How to Cite
Hoque, M. E., Khokan, M. R., & Bari, W. (2015). On Selecting Relevant Covariates and Correlation Structure in Longitudinal Binary Model: Analyzing Impact of Height on Type II Diabetes. Austrian Journal of Statistics, 44(3), 3-15. https://doi.org/https://doi.org/10.17713/ajs.v44i3.17