Ontology type: schema:ScholarlyArticle Open Access: True
2017-01-25
AUTHORSKelly M. Robinson, Jonathan Crabtree, John S. A. Mattick, Kathleen E. Anderson, Julie C. Dunning Hotopp
ABSTRACTBackgroundA variety of bacteria are known to influence carcinogenesis. Therefore, we sought to investigate if publicly available whole genome and whole transcriptome sequencing data generated by large public cancer genome efforts, like The Cancer Genome Atlas (TCGA), could be used to identify bacteria associated with cancer. The Burrows-Wheeler aligner (BWA) was used to align a subset of Illumina paired-end sequencing data from TCGA to the human reference genome and all complete bacterial genomes in the RefSeq database in an effort to identify bacterial read pairs from the microbiome.ResultsThrough careful consideration of all of the bacterial taxa present in the cancer types investigated, their relative abundance, and batch effects, we were able to identify some read pairs from certain taxa as likely resulting from contamination. In particular, the presence of Mycobacterium tuberculosis complex in the ovarian serous cystadenocarcinoma (OV) and glioblastoma multiforme (GBM) samples was correlated with the sequencing center of the samples. Additionally, there was a correlation between the presence of Ralstonia spp. and two specific plates of acute myeloid leukemia (AML) samples. At the end, associations remained between Pseudomonas-like and Acinetobacter-like read pairs in AML, and Pseudomonas-like read pairs in stomach adenocarcinoma (STAD) that could not be explained through batch effects or systematic contamination as seen in other samples.ConclusionsThis approach suggests that it is possible to identify bacteria that may be present in human tumor samples from public genome sequencing data that can be examined further experimentally. More weight should be given to this approach in the future when bacterial associations with diseases are suspected. More... »
PAGES9
http://scigraph.springernature.com/pub.10.1186/s40168-016-0224-8
DOIhttp://dx.doi.org/10.1186/s40168-016-0224-8
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1074206649
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/28118849
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/11",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Medical and Health Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/1112",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Oncology and Carcinogenesis",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Acinetobacter",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Bacteria",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Base Sequence",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Carcinoma",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Carcinoma, Ovarian Epithelial",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Chromosome Mapping",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Cystadenocarcinoma, Serous",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Databases, Genetic",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genome, Bacterial",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genome, Human",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Glioblastoma",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "High-Throughput Nucleotide Sequencing",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Humans",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Leukemia, Myeloid, Acute",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Microbiota",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Mycobacterium tuberculosis",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Neoplasms, Glandular and Epithelial",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Ovarian Neoplasms",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Pseudomonas",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA",
"id": "http://www.grid.ac/institutes/grid.411024.2",
"name": [
"Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Robinson",
"givenName": "Kelly M.",
"id": "sg:person.01215361553.27",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01215361553.27"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA",
"id": "http://www.grid.ac/institutes/grid.411024.2",
"name": [
"Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Crabtree",
"givenName": "Jonathan",
"id": "sg:person.01064750006.82",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01064750006.82"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA",
"id": "http://www.grid.ac/institutes/grid.411024.2",
"name": [
"Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Mattick",
"givenName": "John S. A.",
"id": "sg:person.016454136004.09",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016454136004.09"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA",
"id": "http://www.grid.ac/institutes/grid.411024.2",
"name": [
"Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Anderson",
"givenName": "Kathleen E.",
"id": "sg:person.013073434143.70",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013073434143.70"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD, USA",
"id": "http://www.grid.ac/institutes/grid.411024.2",
"name": [
"Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA",
"Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, USA",
"Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Dunning Hotopp",
"givenName": "Julie C.",
"id": "sg:person.01322263734.83",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01322263734.83"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1038/nrc3610",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1038185390",
"https://doi.org/10.1038/nrc3610"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrc1433",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1052782069",
"https://doi.org/10.1038/nrc1433"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-12-385",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037780208",
"https://doi.org/10.1186/1471-2105-12-385"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nrc703",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1029485597",
"https://doi.org/10.1038/nrc703"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature13480",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1035707124",
"https://doi.org/10.1038/nature13480"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt.1868",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1046017009",
"https://doi.org/10.1038/nbt.1868"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/ncomms3513",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1036430565",
"https://doi.org/10.1038/ncomms3513"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-11-s9-s6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039890050",
"https://doi.org/10.1186/1471-2105-11-s9-s6"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/bjc.2015.465",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1029382454",
"https://doi.org/10.1038/bjc.2015.465"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/s12915-014-0087-z",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1027737035",
"https://doi.org/10.1186/s12915-014-0087-z"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1741-7015-11-236",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1018718292",
"https://doi.org/10.1186/1741-7015-11-236"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2164-15-262",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1023921293",
"https://doi.org/10.1186/1471-2164-15-262"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1111/j.1572-0241.2000.01860.x",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1006204926",
"https://doi.org/10.1111/j.1572-0241.2000.01860.x"
],
"type": "CreativeWork"
}
],
"datePublished": "2017-01-25",
"datePublishedReg": "2017-01-25",
"description": "BackgroundA variety of bacteria are known to influence carcinogenesis. Therefore, we sought to investigate if publicly available whole genome and whole transcriptome sequencing data generated by large public cancer genome efforts, like The Cancer Genome Atlas (TCGA), could be used to identify bacteria associated with cancer. The Burrows-Wheeler aligner (BWA) was used to align a subset of Illumina paired-end sequencing data from TCGA to the human reference genome and all complete bacterial genomes in the RefSeq database in an effort to identify bacterial read pairs from the microbiome.ResultsThrough careful consideration of all of the bacterial taxa present in the cancer types investigated, their relative abundance, and batch effects, we were able to identify some read pairs from certain taxa as likely resulting from contamination. In particular, the presence of Mycobacterium tuberculosis complex in the ovarian serous cystadenocarcinoma (OV) and glioblastoma multiforme (GBM) samples was correlated with the sequencing center of the samples. Additionally, there was a correlation between the presence of Ralstonia spp. and two specific plates of acute myeloid leukemia (AML) samples. At the end, associations remained between Pseudomonas-like and Acinetobacter-like read pairs in AML, and Pseudomonas-like read pairs in stomach adenocarcinoma (STAD) that could not be explained through batch effects or systematic contamination as seen in other samples.ConclusionsThis approach suggests that it is possible to identify bacteria that may be present in human tumor samples from public genome sequencing data that can be examined further experimentally. More weight should be given to this approach in the future when bacterial associations with diseases are suspected.",
"genre": "article",
"id": "sg:pub.10.1186/s40168-016-0224-8",
"inLanguage": "en",
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.3110393",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.2355386",
"type": "MonetaryGrant"
},
{
"id": "sg:grant.4455956",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1048878",
"issn": [
"2049-2618"
],
"name": "Microbiome",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "5"
}
],
"keywords": [
"ovarian serous cystadenocarcinoma",
"stomach adenocarcinoma",
"acute myeloid leukemia samples",
"human tumor samples",
"Cancer Genome Atlas",
"Mycobacterium tuberculosis complex",
"serous cystadenocarcinoma",
"secondary data analysis",
"tumor samples",
"cancer types",
"glioblastoma multiforme samples",
"BackgroundA variety",
"leukemia samples",
"Genome Atlas",
"tuberculosis complex",
"association",
"more weight",
"Burrows-Wheeler Aligner",
"ConclusionsThis approach",
"cystadenocarcinoma",
"adenocarcinoma",
"sequencing data",
"AML",
"cancer",
"careful consideration",
"transcriptome sequencing data",
"disease",
"carcinogenesis",
"TCGA",
"whole transcriptome sequencing data",
"bacterial associations",
"bacterial taxa",
"Ralstonia spp",
"bacteria",
"microbiome",
"genome sequencing data",
"effect",
"samples",
"data",
"Acinetobacter",
"presence",
"Pseudomonas",
"subset",
"data analysis",
"atlas",
"database",
"weight",
"center",
"correlation",
"spp",
"batch effects",
"whole genome",
"available whole genome",
"efforts",
"contamination",
"specific plate",
"relative abundance",
"types",
"end",
"genome",
"genome efforts",
"analysis",
"variety",
"human reference genome",
"approach",
"consideration",
"future",
"aligners",
"genome sequence data",
"pairs",
"sequencing centers",
"RefSeq database",
"complexes",
"plate",
"abundance",
"bacterial genomes",
"sequence data",
"certain taxa",
"reference genome",
"systematic contamination",
"complete bacterial genomes",
"Illumina paired-end sequencing data",
"taxa",
"paired-end sequencing data"
],
"name": "Distinguishing potential bacteria-tumor associations from contamination in a secondary data analysis of public cancer genome sequence data",
"pagination": "9",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1074206649"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/s40168-016-0224-8"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"28118849"
]
}
],
"sameAs": [
"https://doi.org/10.1186/s40168-016-0224-8",
"https://app.dimensions.ai/details/publication/pub.1074206649"
],
"sdDataset": "articles",
"sdDatePublished": "2022-06-01T22:15",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220601/entities/gbq_results/article/article_728.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1186/s40168-016-0224-8"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/s40168-016-0224-8'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/s40168-016-0224-8'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/s40168-016-0224-8'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/s40168-016-0224-8'
This table displays all metadata directly associated to this object as RDF triples.
310 TRIPLES
22 PREDICATES
142 URIs
121 LITERALS
26 BLANK NODES