Ontology type: schema:ScholarlyArticle Open Access: True
2008-12-16
AUTHORSJukka Corander, Pekka Marttinen, Jukka Sirén, Jing Tang
ABSTRACTBackgroundDuring the most recent decade many Bayesian statistical models and software for answering questions related to the genetic structure underlying population samples have appeared in the scientific literature. Most of these methods utilize molecular markers for the inferences, while some are also capable of handling DNA sequence data. In a number of earlier works, we have introduced an array of statistical methods for population genetic inference that are implemented in the software BAPS. However, the complexity of biological problems related to genetic structure analysis keeps increasing such that in many cases the current methods may provide either inappropriate or insufficient solutions.ResultsWe discuss the necessity of enhancing the statistical approaches to face the challenges posed by the ever-increasing amounts of molecular data generated by scientists over a wide range of research areas and introduce an array of new statistical tools implemented in the most recent version of BAPS. With these methods it is possible, e.g., to fit genetic mixture models using user-specified numbers of clusters and to estimate levels of admixture under a genetic linkage model. Also, alleles representing a different ancestry compared to the average observed genomic positions can be tracked for the sampled individuals, and a priori specified hypotheses about genetic population structure can be directly compared using Bayes' theorem. In general, we have improved further the computational characteristics of the algorithms behind the methods implemented in BAPS facilitating the analyses of large and complex datasets. In particular, analysis of a single dataset can now be spread over multiple computers using a script interface to the software.ConclusionThe Bayesian modelling methods introduced in this article represent an array of enhanced tools for learning the genetic structure of populations. Their implementations in the BAPS software are designed to meet the increasing need for analyzing large-scale population genetics data. The software is freely downloadable for Windows, Linux and Mac OS X systems at http://web.abo.fi/fak/mnf//mate/jc/software/baps.html. More... »
PAGES539
http://scigraph.springernature.com/pub.10.1186/1471-2105-9-539
DOIhttp://dx.doi.org/10.1186/1471-2105-9-539
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1049532772
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/19087322
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/01",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Mathematical Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0104",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Statistics",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Artificial Intelligence and Image Processing",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Algorithms",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Alleles",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Bayes Theorem",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Cluster Analysis",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Computational Biology",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Databases, Genetic",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genetic Linkage",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genetic Structures",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genetics, Population",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Humans",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Models, Genetic",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Population",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Sequence Analysis, DNA",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Software",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Stochastic Processes",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Department of Mathematics, F\u00e4nriksgatan 3B, \u00c5bo Akademi University, Fin-20500, \u00c5bo, Finland",
"id": "http://www.grid.ac/institutes/grid.13797.3b",
"name": [
"Department of Mathematics, F\u00e4nriksgatan 3B, \u00c5bo Akademi University, Fin-20500, \u00c5bo, Finland"
],
"type": "Organization"
},
"familyName": "Corander",
"givenName": "Jukka",
"id": "sg:person.01125514227.61",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01125514227.61"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Mathematics and Statistics, University of Helsinki, P.O. Box 68, Fin-00014, Finland",
"id": "http://www.grid.ac/institutes/grid.7737.4",
"name": [
"Department of Mathematics and Statistics, University of Helsinki, P.O. Box 68, Fin-00014, Finland"
],
"type": "Organization"
},
"familyName": "Marttinen",
"givenName": "Pekka",
"id": "sg:person.0753733617.28",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0753733617.28"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Mathematics and Statistics, University of Helsinki, P.O. Box 68, Fin-00014, Finland",
"id": "http://www.grid.ac/institutes/grid.7737.4",
"name": [
"Department of Mathematics and Statistics, University of Helsinki, P.O. Box 68, Fin-00014, Finland"
],
"type": "Organization"
},
"familyName": "Sir\u00e9n",
"givenName": "Jukka",
"id": "sg:person.0752104153.37",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0752104153.37"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Department of Mathematics and Statistics, University of Helsinki, P.O. Box 68, Fin-00014, Finland",
"id": "http://www.grid.ac/institutes/grid.7737.4",
"name": [
"Department of Mathematics and Statistics, University of Helsinki, P.O. Box 68, Fin-00014, Finland"
],
"type": "Organization"
},
"familyName": "Tang",
"givenName": "Jing",
"id": "sg:person.01364107431.03",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01364107431.03"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1007/s11538-006-9161-1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1051348821",
"https://doi.org/10.1007/s11538-006-9161-1"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s10592-005-9098-1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1049775591",
"https://doi.org/10.1007/s10592-005-9098-1"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11222-006-9391-y",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1042853180",
"https://doi.org/10.1007/s11222-006-9391-y"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrg1318",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1045116249",
"https://doi.org/10.1038/nrg1318"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrg1904",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1008780244",
"https://doi.org/10.1038/nrg1904"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-9-421",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1008209350",
"https://doi.org/10.1186/1471-2105-9-421"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s00180-007-0072-x",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016990478",
"https://doi.org/10.1007/s00180-007-0072-x"
],
"type": "CreativeWork"
}
],
"datePublished": "2008-12-16",
"datePublishedReg": "2008-12-16",
"description": "BackgroundDuring the most recent decade many Bayesian statistical models and software for answering questions related to the genetic structure underlying population samples have appeared in the scientific literature. Most of these methods utilize molecular markers for the inferences, while some are also capable of handling DNA sequence data. In a number of earlier works, we have introduced an array of statistical methods for population genetic inference that are implemented in the software BAPS. However, the complexity of biological problems related to genetic structure analysis keeps increasing such that in many cases the current methods may provide either inappropriate or insufficient solutions.ResultsWe discuss the necessity of enhancing the statistical approaches to face the challenges posed by the ever-increasing amounts of molecular data generated by scientists over a wide range of research areas and introduce an array of new statistical tools implemented in the most recent version of BAPS. With these methods it is possible, e.g., to fit genetic mixture models using user-specified numbers of clusters and to estimate levels of admixture under a genetic linkage model. Also, alleles representing a different ancestry compared to the average observed genomic positions can be tracked for the sampled individuals, and a priori specified hypotheses about genetic population structure can be directly compared using Bayes' theorem. In general, we have improved further the computational characteristics of the algorithms behind the methods implemented in BAPS facilitating the analyses of large and complex datasets. In particular, analysis of a single dataset can now be spread over multiple computers using a script interface to the software.ConclusionThe Bayesian modelling methods introduced in this article represent an array of enhanced tools for learning the genetic structure of populations. Their implementations in the BAPS software are designed to meet the increasing need for analyzing large-scale population genetics data. The software is freely downloadable for Windows, Linux and Mac OS X systems at http://web.abo.fi/fak/mnf//mate/jc/software/baps.html.",
"genre": "article",
"id": "sg:pub.10.1186/1471-2105-9-539",
"inLanguage": "en",
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.4247660",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1023786",
"issn": [
"1471-2105"
],
"name": "BMC Bioinformatics",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "9"
}
],
"keywords": [
"genetic structure",
"BAPS software",
"genetic linkage model",
"Bayesian modelling methods",
"Bayesian statistical model",
"new statistical tools",
"genetic mixture models",
"genetic population structure",
"genetic structure analysis",
"DNA sequence data",
"population genetic inference",
"levels of admixture",
"population genetic data",
"statistical model",
"genomic positions",
"Bayesian modelling",
"genetic inferences",
"statistical methods",
"molecular data",
"population structure",
"statistical approach",
"sequence data",
"statistical tools",
"computational characteristics",
"genetic data",
"mixture model",
"molecular markers",
"Bayes' theorem",
"sampled individuals",
"modelling method",
"theorem",
"user-specified number",
"different ancestries",
"biological problems",
"Mac OS X systems",
"complex datasets",
"inference",
"earlier work",
"structure analysis",
"BAPS",
"model",
"multiple computers",
"recent version",
"current methods",
"alleles",
"ancestry",
"linkage model",
"algorithm",
"wide range",
"population",
"X system",
"modelling",
"research area",
"array",
"structure",
"problem",
"BAP",
"solution",
"enhanced tools",
"population sample",
"single dataset",
"complexity",
"software",
"number",
"markers",
"computer",
"version",
"tool",
"recent decades",
"analysis",
"dataset",
"hypothesis",
"approach",
"system",
"clusters",
"admixture",
"data",
"implementation",
"levels",
"cases",
"work",
"individuals",
"position",
"insufficient solution",
"scientific literature",
"range",
"ResultsWe",
"amount",
"decades",
"window",
"scientists",
"interface",
"literature",
"characteristics",
"area",
"necessity",
"article",
"questions",
"Linux",
"samples",
"method",
"challenges",
"need"
],
"name": "Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations",
"pagination": "539",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1049532772"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/1471-2105-9-539"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"19087322"
]
}
],
"sameAs": [
"https://doi.org/10.1186/1471-2105-9-539",
"https://app.dimensions.ai/details/publication/pub.1049532772"
],
"sdDataset": "articles",
"sdDatePublished": "2022-05-20T07:24",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220519/entities/gbq_results/article/article_463.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1186/1471-2105-9-539"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-539'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-539'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-539'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-539'
This table displays all metadata directly associated to this object as RDF triples.
294 TRIPLES
22 PREDICATES
154 URIs
135 LITERALS
22 BLANK NODES