Hierarchical Models for Mitochondrial DNA Sequence Data

Authors

  • Paola Berchialla University of Torino, Italy

DOI:

https://doi.org/10.17713/ajs.v36i1.320

Abstract

We introduce a Bayesian hierarchical model for mitochondrial DNA sequence data, which is fitted via acceptance-rejection algorithms. The model incorporates parametric models of population history explicitly as well as a mutational process allowing for a simultaneous parameter estimation whose importance has become increasingly clear in many recent studies. The model is applied to a sample of DNA sequences from the Italian population.

References

Anderson, S., Bankier, A. T., Barell, B. G., De Bruijn, M. H. L., and Coulson, A. R. (1981). Sequence and organization of the human mitochondrial genome. Nature, 290, 457-465.

Bataille, M., Crainic, K., Leterreux, M., Durigon, M., and de Mazancourt, P. (1999). Multiplex amplification of mitochondrial DNA for human and species identification in forensic evaluation. Forensic Science International, 99, 165-170.

Bonneuill, N. (1998). Population paths implied by the mean number of pairwise nucleotide differences among mitochondrial DNA sequences. Annals of Human Genetics, 62, 61-73.

Cann, R. L., Stoneking, M., and Wilson, A. C. (1987). Mitochondrial DNA and human evolution. Nature, 325, 31-36.

Cavalli-Sforza, L. L., Menozzi, A., and Piazza, A. (1994). The History and Geography of Human Genes. Princeton: Princeton University Press.

Denver, D. R., Morris, K., Lynch, M., Vassilieva, L. L., and Thomas, W. K. (2000). High direct estimate of the mutation rate in the mitochondrial genome of Caenorhabditis elegans. Science, 289, 2342-2344.

Foreman, L. A., Smith, A. F. M., and Evett, I. W. (1997). Bayesian analysis of DNA profiling data in forensic identification applications. Journal of the Royal Statistical Society, A, 160, 429-469.

Fullerton, S. M., Harding, R. M., Boyce, A. J., and Clegg, J. B. (1994). Molecular and population genetics analysis of allelic sequence diversity at the human beta-clobin locus. Proceedings of the National Academy of Science of the USA, 91, 1805-1809.

Hammer, M. F. (1995). A recent common ancestry for human Y chromosomes. Nature, 378, 376-378.

Hartl, D. L., and Clark, A. G. (1997). Principles of Population Genetics (3rd ed.). Sunderland: Sinauer.

Hein, J. (2004). Human evolution: pedigrees for all humanity. Nature, 431, 562-6.

Heyer, E., Zietkiewicz, E., Rochowski, A., Yotova, V., Puymirat, J., and Labuda, D. (2001). Phylogenetic and familial estimates of mitochondrial substitution rates: Study of control region mutations in deep-rooting pedigrees. American Journal of Human Genetics, 69, 1113-1126.

Hudson, R. R. (1990). Gene genealogies and the coalescent process. In D. J. Futuyama and J. D. Antonovics (Eds.), Oxford Surveys in Evolutionary Biology (Vol. 7, p. 1-44). New York: Oxford University Press.

Jaruzelska, J., Zietkiewicz, E., Batzer, M., Cole, D. E. C., Moisan, J. P., Scozzari, R., et al. (1999). Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: Analysis of the Haplotype structure and genealogy. Genetics, 152, 1091-1101.

Jorde, L. B., Watkins, W. S., and Bamshad, M. J. (2001). Population genomics: a bridge from evolutionary history to genetic medicine. Human Molecular Genetics, 10, 2199-2207.

Kingman, J. F. C. (1982). The coalescent. Stochastic Processes and their Applications, 13, 225-248.

Monson, K. L., Miller, K. W. P., Wilson, M. R., DiZinno, J. A., and Budowle, B. (2002). The mtDNA population database: an integrated software and database resource for forensic comparison. Forensic Science International, 4.

(http://www.fbi.gov/hq/lab/fsc/backissu/april2002/miller1.htm)

Nicholson, G., Smith, A. V., Jonsson, F., Gustaffson, O., Stefansson, K., and Donnelly, P. (2002). Assessing population differentiation and isolation from single-nucleotide polymorphism data. Journal of the Royal Statistical Society, B, 64, 695-715.

R Development Core Team. (2006). R: A language and environment for statistical computing. Vienna, Austria. (ISBN 3-900051-07-0)

Ripley, B. D. (1987). Stochastic Simulation. New York: Wiley.

Stephens, M. (2001). Inference under the coalescent. In D. J. Balding, C. Cannings, and M. Bishop (Eds.), Handbook of Statistical Genetics. Chichester: Wiley.

Stephens, M., and Donnelly, P. (2000). Inference in molecular population genects (with discussion). Journal of the Royal Statistical Society, B, 62, 605-635.

Swofford, D. L., Olsen, G. J., Waddell, P. J., and Hillis, D. M. (1995). Accomodating rate heterogeneity among sites. In D. M. Hillis, C. Moritz, and B. K. Mable (Eds.), Molecular Systematics. Sunderland: Sinauer.

Tavaré, S., Balding, D. J., Griffiths, R. C., and Donnelly, P. (1997). Inferring coalescence times from DNA sequence data. Genetics, 145, 505-518.

Wakeley, J. (1994). Substitution rate variation among sites and the estimation of transition bias. Molecular Biology and Evolution, 11, 436-442.

Weir, B. S. (1990). Genetic Data Analysis. Sunderland: Sinauer.

Wilson, I., and Balding, D. J. (1998). Genealogical inference from microsatellite data. Genetics, 150, 499-510.

Wilson, I., Weale, M. E., and Balding, D. J. (2003). Inferences from DNA data: population histories, evolutionary processes and forensic match probabilities. Journal of the Royal Statistical Society, A, 166, 1-33.

Yang, Z. (1996). Statistical properties of a DNA sample under the finite-site model. Genetics, 144, 1941-1950.

Downloads

Published

2016-04-03

Issue

Section

Articles

How to Cite

Hierarchical Models for Mitochondrial DNA Sequence Data. (2016). Austrian Journal of Statistics, 36(1), 53–64. https://doi.org/10.17713/ajs.v36i1.320