The search for a method that utilizes biological information to predict humans’ place of origin has occupied scientists for millennia. Over the past four decades, scientists have employed genetic data in an effort to achieve this goal but with limited success. While biogeographical algorithms using next-generation sequencing data have achieved an accuracy of 700 km in Europe, they were inaccurate elsewhere. Here we describe the Geographic Population Structure (GPS) algorithm and demonstrate its accuracy with three data sets using 40,000–130,000 SNPs. GPS placed 83% of worldwide individuals in their country of origin. Applied to over 200 Sardinians villagers, GPS placed a quarter of them in their villages and most of the rest within 50 km of their villages. GPS’s accuracy and power to infer the biogeography of worldwide individuals down to their country or, in some cases, village, of origin, underscores the promise of admixture-based methods for biogeography and has ramifications for genetic ancestry testing.

Geographic population structure analysis of worldwide human populations infers their biogeographical origins

TOFANELLI, SERGIO;
2014-01-01

Abstract

The search for a method that utilizes biological information to predict humans’ place of origin has occupied scientists for millennia. Over the past four decades, scientists have employed genetic data in an effort to achieve this goal but with limited success. While biogeographical algorithms using next-generation sequencing data have achieved an accuracy of 700 km in Europe, they were inaccurate elsewhere. Here we describe the Geographic Population Structure (GPS) algorithm and demonstrate its accuracy with three data sets using 40,000–130,000 SNPs. GPS placed 83% of worldwide individuals in their country of origin. Applied to over 200 Sardinians villagers, GPS placed a quarter of them in their villages and most of the rest within 50 km of their villages. GPS’s accuracy and power to infer the biogeography of worldwide individuals down to their country or, in some cases, village, of origin, underscores the promise of admixture-based methods for biogeography and has ramifications for genetic ancestry testing.
2014
Eran, Elhaik; Tatiana, Tatarinova; Dmitri, Chebotarev; Ignazio S., Piras; Carla Maria, Calò; Antonella De, Montis; Manuela, Atzori; Monica, Marini; Tofanelli, Sergio; Paolo, Francalacci; Luca, Pagani; Chris Tyler, Smith; Yali, Xue; Francesco, Cucca; Theodore G., Schurr; Jill B., Gaieski; Carlalynne, Melendez; Miguel G., Vilar; Amanda C., Owings; Rocío, Gómez; Ricardo, Fujita; Fabrício R., Santos; David, Comas; Oleg, Balanovsky; Elena, Balanovska; Pierre, Zalloua; Himla, Soodyall; Ramasamy, Pitchappan; Arunkumar, Ganeshprasad; Michael, Hammer; Lisa Matisoo, Smith; R., Spencer Wells; Oscar, Acosta; Syama, Adhikarla; Christina J., Adler; Jaume, Bertranpetit; Andrew C., Clarke; Alan, Cooper; Clio S. I., Der Sarkissian; Wolfgang, Haak; Marc, Haber; Li, Jin; Matthew E., Kaplan; Hui, Li; Shilin, Li; Begoña Martínez, Cruz; Nirav C., Merchant; John R., Mitchell; Laxmi, Parida; Daniel E., Platt; Lluis Quintana, Murci; Colin, Renfrew; Daniela R., Lacerda; Ajay K., Royyuru; Jose Raul, Sandoval; Arun Varatharajan, Santhakumari; David F., Soria Hernanz; Pandikumar, Swamikrishnan; Janet S., Ziegle
File in questo prodotto:
File Dimensione Formato  
Elhaik et al NATURE COMMUNICATIONS 2014.pdf

accesso aperto

Tipologia: Versione finale editoriale
Licenza: Creative commons
Dimensione 4.57 MB
Formato Adobe PDF
4.57 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/433469
Citazioni
  • ???jsp.display-item.citation.pmc??? 55
  • Scopus 97
  • ???jsp.display-item.citation.isi??? 94
social impact