Ontology type: schema:ScholarlyArticle Open Access: True
2008-12
AUTHORSZhenqiu Liu, Ronald B Gartenhaus, Ming Tan, Feng Jiang, Xiaoli Jiao
ABSTRACTBACKGROUND: Identifying genes and pathways associated with diseases such as cancer has been a subject of considerable research in recent years in the area of bioinformatics and computational biology. It has been demonstrated that the magnitude of differential expression does not necessarily indicate biological significance. Even a very small change in the expression of particular gene may have dramatic physiological consequences if the protein encoded by this gene plays a catalytic role in a specific cell function. Moreover, highly correlated genes may function together on the same pathway biologically. Finally, in sparse logistic regression with Lp (p < 1) penalty, the degree of the sparsity obtained is determined by the value of the regularization parameter. Usually this parameter must be carefully tuned through cross-validation, which is time consuming. RESULTS: In this paper, we proposed a simple Bayesian approach to integrate the regularization parameter out analytically using a new prior. Therefore, there is no longer a need for parameter selection, as it is eliminated entirely from the model. The proposed algorithm (BLpLog) is typically two or three orders of magnitude faster than the original algorithm and free from bias in performance estimation. We also define a novel similarity measure and develop an integrated algorithm to hunt the regulatory genes with low expression changes but having high correlation with the selected genes. Pathways of those correlated genes were identified with DAVID http://david.abcc.ncifcrf.gov/. CONCLUSION: Experimental results with gene expression data demonstrate that the proposed methods can be utilized to identify important genes and pathways that are related to cancer and build a parsimonious model for future patient predictions. More... »
PAGES412
http://scigraph.springernature.com/pub.10.1186/1471-2105-9-412
DOIhttp://dx.doi.org/10.1186/1471-2105-9-412
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1035056507
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/18834526
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Algorithms",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Bayes Theorem",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Computer Simulation",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Gene Expression Profiling",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Logistic Models",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Models, Biological",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Proteome",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Regression Analysis",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Signal Transduction",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"name": [
"Division of Biostatistics, University of Maryland Greenebaum Cancer Center, 22 South Greene Street, 21201, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Liu",
"givenName": "Zhenqiu",
"id": "sg:person.010555727257.30",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010555727257.30"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Maryland, Baltimore",
"id": "https://www.grid.ac/institutes/grid.411024.2",
"name": [
"Department of Medicine and Greenebaum Cancer Center, The University of Maryland School of Medicine, 21201, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Gartenhaus",
"givenName": "Ronald B",
"id": "sg:person.012130702547.18",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012130702547.18"
],
"type": "Person"
},
{
"affiliation": {
"name": [
"Division of Biostatistics, University of Maryland Greenebaum Cancer Center, 22 South Greene Street, 21201, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Tan",
"givenName": "Ming",
"id": "sg:person.01007114400.76",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01007114400.76"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "University of Maryland, Baltimore",
"id": "https://www.grid.ac/institutes/grid.411024.2",
"name": [
"Department of Pathology, The University of Maryland School of Medicine, 21201, Balitimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Jiang",
"givenName": "Feng",
"id": "sg:person.01160026175.40",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01160026175.40"
],
"type": "Person"
},
{
"affiliation": {
"name": [
"Division of Biostatistics, University of Maryland Greenebaum Cancer Center, 22 South Greene Street, 21201, Baltimore, MD, USA"
],
"type": "Organization"
},
"familyName": "Jiao",
"givenName": "Xiaoli",
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1038/415484a",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1000172143",
"https://doi.org/10.1038/415484a"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/415484a",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1000172143",
"https://doi.org/10.1038/415484a"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1162/neco.1995.7.1.117",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1010218959"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1162/089976699300016331",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1011160299"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.2202/1544-6115.1189",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1018113049"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-8-35",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1019129425",
"https://doi.org/10.1186/1471-2105-8-35"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/18.10.1332",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1026055739"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btg308",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1028669560"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/s0140-6736(03)12775-4",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1033430834"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.2202/1544-6115.1248",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1033452644"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bti736",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1036795811"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1073/pnas.0506580102",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037705714"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1073/pnas.0506580102",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037705714"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1214/009053604000000067",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1038945634"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/gb-2002-3-4-research0017",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039756082",
"https://doi.org/10.1186/gb-2002-3-4-research0017"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btl386",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1039998987"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/415530a",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1043001094",
"https://doi.org/10.1038/415530a"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/415530a",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1043001094",
"https://doi.org/10.1038/415530a"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1089/106652703322756177",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1059204991"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1214/009053607000000127",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1064389040"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1214/aos/1015957397",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1064405888"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1152/physiolgenomics.2001.5.2.99",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1074780766"
],
"type": "CreativeWork"
},
{
"id": "https://app.dimensions.ai/details/publication/pub.1075262336",
"type": "CreativeWork"
}
],
"datePublished": "2008-12",
"datePublishedReg": "2008-12-01",
"description": "BACKGROUND: Identifying genes and pathways associated with diseases such as cancer has been a subject of considerable research in recent years in the area of bioinformatics and computational biology. It has been demonstrated that the magnitude of differential expression does not necessarily indicate biological significance. Even a very small change in the expression of particular gene may have dramatic physiological consequences if the protein encoded by this gene plays a catalytic role in a specific cell function. Moreover, highly correlated genes may function together on the same pathway biologically. Finally, in sparse logistic regression with Lp (p < 1) penalty, the degree of the sparsity obtained is determined by the value of the regularization parameter. Usually this parameter must be carefully tuned through cross-validation, which is time consuming.\nRESULTS: In this paper, we proposed a simple Bayesian approach to integrate the regularization parameter out analytically using a new prior. Therefore, there is no longer a need for parameter selection, as it is eliminated entirely from the model. The proposed algorithm (BLpLog) is typically two or three orders of magnitude faster than the original algorithm and free from bias in performance estimation. We also define a novel similarity measure and develop an integrated algorithm to hunt the regulatory genes with low expression changes but having high correlation with the selected genes. Pathways of those correlated genes were identified with DAVID http://david.abcc.ncifcrf.gov/.\nCONCLUSION: Experimental results with gene expression data demonstrate that the proposed methods can be utilized to identify important genes and pathways that are related to cancer and build a parsimonious model for future patient predictions.",
"genre": "research_article",
"id": "sg:pub.10.1186/1471-2105-9-412",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.2569355",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1023786",
"issn": [
"1471-2105"
],
"name": "BMC Bioinformatics",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "9"
}
],
"name": "Gene and pathway identification with Lppenalized Bayesian logistic regression",
"pagination": "412",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"a6fefd6de2668f75f3d9d7db86acad24cdbe7e4101c99ace48650dbf2bb0af5c"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"18834526"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"100965194"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/1471-2105-9-412"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1035056507"
]
}
],
"sameAs": [
"https://doi.org/10.1186/1471-2105-9-412",
"https://app.dimensions.ai/details/publication/pub.1035056507"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-10T22:30",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8690_00000506.jsonl",
"type": "ScholarlyArticle",
"url": "http://link.springer.com/10.1186%2F1471-2105-9-412"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-412'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-412'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-412'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-412'
This table displays all metadata directly associated to this object as RDF triples.
203 TRIPLES
21 PREDICATES
58 URIs
30 LITERALS
18 BLANK NODES