Predictive DNA analysis for biogeographical ancestry

Elaine Y.Y. Cheung, Michelle Elizabeth Gahan, Dennis McNevin

Research output: Contribution to journalArticle

Abstract

Establishment of national DNA databases in Australia and overseas has increased the number of criminal convictions, yet a high volume of serious crime cases remain with no suspect profile nor any DNA database matches. In these circumstances prediction of biogeographical ancestry (BGA) and externally visible characteristics can assist by providing forensic intelligence in conjunction with, or in place of, eyewitness testimonies. To predict the BGA of an individual requires: genetic markers selected for their ability to differentiate between BGAs; representative BGA reference populations; and a prediction algorithm (‘classifier’) that predicts the BGA of an unknown individual based on genetic markers in the reference populations. The human genome contains autosomal ancestry informative markers that are easily harvested from publicly accessible collections of genotypes with associated ancestry information. A number of classification methods are available including Bayesian approaches and distance-based algorithms. BGA is likely to be continuous rather than discrete and some methods are inappropriate for the prediction of admixed BGA. As predictive services become available to the public and private sectors, there is a risk of results being misinterpreted if an inappropriate tool is applied. Understanding the underlying marker sets, reference populations and classification algorithms is required to prevent ill-informed predictions.

Original languageEnglish
Pages (from-to)651-658
Number of pages8
JournalAustralian Journal of Forensic Sciences
Volume50
Issue number6
Early online date12 Jan 2018
DOIs
Publication statusPublished - 2 Nov 2018

Fingerprint

Nucleic Acid Databases
Genetic Markers
DNA
Population
Aptitude
Bayes Theorem
Private Sector
Public Sector
Human Genome
Crime
Intelligence
Genotype

Cite this

Cheung, Elaine Y.Y. ; Gahan, Michelle Elizabeth ; McNevin, Dennis. / Predictive DNA analysis for biogeographical ancestry. In: Australian Journal of Forensic Sciences. 2018 ; Vol. 50, No. 6. pp. 651-658.
@article{6508574067b243799b428963a9e73b31,
title = "Predictive DNA analysis for biogeographical ancestry",
abstract = "Establishment of national DNA databases in Australia and overseas has increased the number of criminal convictions, yet a high volume of serious crime cases remain with no suspect profile nor any DNA database matches. In these circumstances prediction of biogeographical ancestry (BGA) and externally visible characteristics can assist by providing forensic intelligence in conjunction with, or in place of, eyewitness testimonies. To predict the BGA of an individual requires: genetic markers selected for their ability to differentiate between BGAs; representative BGA reference populations; and a prediction algorithm (‘classifier’) that predicts the BGA of an unknown individual based on genetic markers in the reference populations. The human genome contains autosomal ancestry informative markers that are easily harvested from publicly accessible collections of genotypes with associated ancestry information. A number of classification methods are available including Bayesian approaches and distance-based algorithms. BGA is likely to be continuous rather than discrete and some methods are inappropriate for the prediction of admixed BGA. As predictive services become available to the public and private sectors, there is a risk of results being misinterpreted if an inappropriate tool is applied. Understanding the underlying marker sets, reference populations and classification algorithms is required to prevent ill-informed predictions.",
keywords = "ancestry informative markers, Biogeographical ancestry, classification, genetic distance, multidimensional scaling, STRUCTURE",
author = "Cheung, {Elaine Y.Y.} and Gahan, {Michelle Elizabeth} and Dennis McNevin",
year = "2018",
month = "11",
day = "2",
doi = "10.1080/00450618.2017.1422021",
language = "English",
volume = "50",
pages = "651--658",
journal = "Australian Journal of Forensic Sciences",
issn = "0045-0618",
publisher = "Taylor and Francis Ltd.",
number = "6",

}

Predictive DNA analysis for biogeographical ancestry. / Cheung, Elaine Y.Y.; Gahan, Michelle Elizabeth; McNevin, Dennis.

In: Australian Journal of Forensic Sciences, Vol. 50, No. 6, 02.11.2018, p. 651-658.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Predictive DNA analysis for biogeographical ancestry

AU - Cheung, Elaine Y.Y.

AU - Gahan, Michelle Elizabeth

AU - McNevin, Dennis

PY - 2018/11/2

Y1 - 2018/11/2

N2 - Establishment of national DNA databases in Australia and overseas has increased the number of criminal convictions, yet a high volume of serious crime cases remain with no suspect profile nor any DNA database matches. In these circumstances prediction of biogeographical ancestry (BGA) and externally visible characteristics can assist by providing forensic intelligence in conjunction with, or in place of, eyewitness testimonies. To predict the BGA of an individual requires: genetic markers selected for their ability to differentiate between BGAs; representative BGA reference populations; and a prediction algorithm (‘classifier’) that predicts the BGA of an unknown individual based on genetic markers in the reference populations. The human genome contains autosomal ancestry informative markers that are easily harvested from publicly accessible collections of genotypes with associated ancestry information. A number of classification methods are available including Bayesian approaches and distance-based algorithms. BGA is likely to be continuous rather than discrete and some methods are inappropriate for the prediction of admixed BGA. As predictive services become available to the public and private sectors, there is a risk of results being misinterpreted if an inappropriate tool is applied. Understanding the underlying marker sets, reference populations and classification algorithms is required to prevent ill-informed predictions.

AB - Establishment of national DNA databases in Australia and overseas has increased the number of criminal convictions, yet a high volume of serious crime cases remain with no suspect profile nor any DNA database matches. In these circumstances prediction of biogeographical ancestry (BGA) and externally visible characteristics can assist by providing forensic intelligence in conjunction with, or in place of, eyewitness testimonies. To predict the BGA of an individual requires: genetic markers selected for their ability to differentiate between BGAs; representative BGA reference populations; and a prediction algorithm (‘classifier’) that predicts the BGA of an unknown individual based on genetic markers in the reference populations. The human genome contains autosomal ancestry informative markers that are easily harvested from publicly accessible collections of genotypes with associated ancestry information. A number of classification methods are available including Bayesian approaches and distance-based algorithms. BGA is likely to be continuous rather than discrete and some methods are inappropriate for the prediction of admixed BGA. As predictive services become available to the public and private sectors, there is a risk of results being misinterpreted if an inappropriate tool is applied. Understanding the underlying marker sets, reference populations and classification algorithms is required to prevent ill-informed predictions.

KW - ancestry informative markers

KW - Biogeographical ancestry

KW - classification

KW - genetic distance

KW - multidimensional scaling

KW - STRUCTURE

UR - http://www.scopus.com/inward/record.url?scp=85040998035&partnerID=8YFLogxK

UR - http://www.mendeley.com/research/predictive-dna-analysis-biogeographical-ancestry

U2 - 10.1080/00450618.2017.1422021

DO - 10.1080/00450618.2017.1422021

M3 - Article

VL - 50

SP - 651

EP - 658

JO - Australian Journal of Forensic Sciences

JF - Australian Journal of Forensic Sciences

SN - 0045-0618

IS - 6

ER -