ObStruct: A method to objectively analyse factors driving population structure using Bayesian ancestry profiles

Gayevskiy, Velimir, Klaere, Steffen, Knight, Sarah and Goddard, Matthew R. (2014) ObStruct: A method to objectively analyse factors driving population structure using Bayesian ancestry profiles. PLoS ONE, 9 (1). e85196. ISSN 1932-6203

Full content URL: http://journals.plos.org/plosone/article?id=10.137...

Documents
ObStruct: A method to objectively analyse factors driving population structure using Bayesian ancestry profiles
[img]
[Download]
[img]
Preview
PDF
ObStruct.pdf - Whole Document
Available under License Creative Commons Attribution.

700kB
Item Type:Article
Item Status:Live Archive

Abstract

Bayesian inference methods are extensively used to detect the presence of population structure given genetic data. The primary output of software implementing these methods are ancestry profiles of sampled individuals. While these profiles robustly partition the data into subgroups, currently there is no objective method to determine whether the fixed factor of interest (e.g. geographic origin) correlates with inferred subgroups or not, and if so, which populations are driving this correlation. We present ObStruct, a novel tool to objectively analyse the nature of structure revealed in Bayesian ancestry profiles using established statistical methods. ObStruct evaluates the extent of structural similarity between sampled and inferred populations, tests the significance of population differentiation, provides information on the contribution of sampled and inferred populations to the observed structure and crucially determines whether the predetermined factor of interest correlates with inferred population structure. Analyses of simulated and experimental data highlight ObStruct's ability to objectively assess the nature of structure in populations. We show the method is capable of capturing an increase in the level of structure with increasing time since divergence between simulated populations. Further, we applied the method to a highly structured dataset of 1,484 humans from seven continents and a less structured dataset of 179 Saccharomyces cerevisiae from three regions in New Zealand. Our results show that ObStruct provides an objective metric to classify the degree, drivers and significance of inferred structure, as well as providing novel insights into the relationships between sampled populations, and adds a final step to the pipeline for population structure analyses. © 2014 Gayevskiy et al.

Additional Information:This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Keywords:article, Bayes theorem, computer program, genetic database, genetic variability, human, microsatellite marker, New Zealand, nonhuman, partition coefficient, population differentiation, population structure, Saccharomyces cerevisiae, structure analysis, Bayes Theorem, Continental Population Groups, Genetic Variation, Humans, Microsatellite Repeats, Models, Genetic, Phylogeography, Population Dynamics, Software, JCOpen
Subjects:C Biological Sciences > C400 Genetics
C Biological Sciences > C170 Population Biology
Divisions:College of Science > School of Life Sciences
Related URLs:
ID Code:17145
Deposited On:15 May 2015 09:52

Repository Staff Only: item control page