Ontology type: schema:ScholarlyArticle Open Access: True
2008-01-23
AUTHORSRekin's Janky, Jacques van Helden
ABSTRACTBackgroundThe detection of conserved motifs in promoters of orthologous genes (phylogenetic footprints) has become a common strategy to predict cis-acting regulatory elements. Several software tools are routinely used to raise hypotheses about regulation. However, these tools are generally used as black boxes, with default parameters. A systematic evaluation of optimal parameters for a footprint discovery strategy can bring a sizeable improvement to the predictions.ResultsWe evaluate the performances of a footprint discovery approach based on the detection of over-represented spaced motifs. This method is particularly suitable for (but not restricted to) Bacteria, since such motifs are typically bound by factors containing a Helix-Turn-Helix domain. We evaluated footprint discovery in 368 Escherichia coli K12 genes with annotated sites, under 40 different combinations of parameters (taxonomical level, background model, organism-specific filtering, operon inference). Motifs are assessed both at the levels of correctness and significance. We further report a detailed analysis of 181 bacterial orthologs of the LexA repressor. Distinct motifs are detected at various taxonomical levels, including the 7 previously characterized taxon-specific motifs. In addition, we highlight a significantly stronger conservation of half-motifs in Actinobacteria, relative to Firmicutes, suggesting an intermediate state in specificity switching between the two Gram-positive phyla, and thereby revealing the on-going evolution of LexA auto-regulation.ConclusionThe footprint discovery method proposed here shows excellent results with E. coli and can readily be extended to predict cis-acting regulatory signals and propose testable hypotheses in bacterial genomes for which nothing is known about regulation. More... »
PAGES37
http://scigraph.springernature.com/pub.10.1186/1471-2105-9-37
DOIhttp://dx.doi.org/10.1186/1471-2105-9-37
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1008642915
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/18215291
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/06",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Biological Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0604",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Genetics",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0605",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Microbiology",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Actinobacteria",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Algorithms",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Amino Acid Motifs",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Bacterial Proteins",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Conserved Sequence",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "DNA Footprinting",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Escherichia coli K12",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Evolution, Molecular",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Genome, Bacterial",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Gram-Positive Endospore-Forming Bacteria",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Phylogeny",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Promoter Regions, Genetic",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Sequence Homology, Nucleic Acid",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Serine Endopeptidases",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Software",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Laboratoire de Bioinformatique des G\u00e9nomes et des R\u00e9seaux, Universit\u00e9 Libre de Bruxelles (ULB), Campus Plaine, CP 263, Boulevard du Triomphe, 1050, Bruxelles, Belgium",
"id": "http://www.grid.ac/institutes/grid.4989.c",
"name": [
"Laboratoire de Bioinformatique des G\u00e9nomes et des R\u00e9seaux, Universit\u00e9 Libre de Bruxelles (ULB), Campus Plaine, CP 263, Boulevard du Triomphe, 1050, Bruxelles, Belgium"
],
"type": "Organization"
},
"familyName": "Janky",
"givenName": "Rekin's",
"id": "sg:person.01161224330.14",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01161224330.14"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Laboratoire de Bioinformatique des G\u00e9nomes et des R\u00e9seaux, Universit\u00e9 Libre de Bruxelles (ULB), Campus Plaine, CP 263, Boulevard du Triomphe, 1050, Bruxelles, Belgium",
"id": "http://www.grid.ac/institutes/grid.4989.c",
"name": [
"Laboratoire de Bioinformatique des G\u00e9nomes et des R\u00e9seaux, Universit\u00e9 Libre de Bruxelles (ULB), Campus Plaine, CP 263, Boulevard du Triomphe, 1050, Bruxelles, Belgium"
],
"type": "Organization"
},
"familyName": "van Helden",
"givenName": "Jacques",
"id": "sg:person.0626672543.46",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0626672543.46"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1038/nbt1098-939",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1049107556",
"https://doi.org/10.1038/nbt1098-939"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nature01644",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1010517605",
"https://doi.org/10.1038/nature01644"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/10343",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009819816",
"https://doi.org/10.1038/10343"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s004380051066",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1034470159",
"https://doi.org/10.1007/s004380051066"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-7-488",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007199178",
"https://doi.org/10.1186/1471-2105-7-488"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/978-1-59745-514-5_18",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037981308",
"https://doi.org/10.1007/978-1-59745-514-5_18"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-5-170",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1000329510",
"https://doi.org/10.1186/1471-2105-5-170"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2164-7-147",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016837348",
"https://doi.org/10.1186/1471-2164-7-147"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s00438-003-0952-x",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1050711091",
"https://doi.org/10.1007/s00438-003-0952-x"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-5-6",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007965784",
"https://doi.org/10.1186/1471-2105-5-6"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/nbt1053",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1030939237",
"https://doi.org/10.1038/nbt1053"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1038/79965",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1027354871",
"https://doi.org/10.1038/79965"
],
"type": "CreativeWork"
}
],
"datePublished": "2008-01-23",
"datePublishedReg": "2008-01-23",
"description": "BackgroundThe detection of conserved motifs in promoters of orthologous genes (phylogenetic footprints) has become a common strategy to predict cis-acting regulatory elements. Several software tools are routinely used to raise hypotheses about regulation. However, these tools are generally used as black boxes, with default parameters. A systematic evaluation of optimal parameters for a footprint discovery strategy can bring a sizeable improvement to the predictions.ResultsWe evaluate the performances of a footprint discovery approach based on the detection of over-represented spaced motifs. This method is particularly suitable for (but not restricted to) Bacteria, since such motifs are typically bound by factors containing a Helix-Turn-Helix domain. We evaluated footprint discovery in 368 Escherichia coli K12 genes with annotated sites, under 40 different combinations of parameters (taxonomical level, background model, organism-specific filtering, operon inference). Motifs are assessed both at the levels of correctness and significance. We further report a detailed analysis of 181 bacterial orthologs of the LexA repressor. Distinct motifs are detected at various taxonomical levels, including the 7 previously characterized taxon-specific motifs. In addition, we highlight a significantly stronger conservation of half-motifs in Actinobacteria, relative to Firmicutes, suggesting an intermediate state in specificity switching between the two Gram-positive phyla, and thereby revealing the on-going evolution of LexA auto-regulation.ConclusionThe footprint discovery method proposed here shows excellent results with E. coli and can readily be extended to predict cis-acting regulatory signals and propose testable hypotheses in bacterial genomes for which nothing is known about regulation.",
"genre": "article",
"id": "sg:pub.10.1186/1471-2105-9-37",
"inLanguage": "en",
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.6770571",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1023786",
"issn": [
"1471-2105"
],
"name": "BMC Bioinformatics",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "9"
}
],
"keywords": [
"Escherichia coli K12 gene",
"cis-acting regulatory elements",
"cis-acting regulatory signals",
"cis-regulatory elements",
"Gram-positive phylum",
"specificity switching",
"orthologous genes",
"bacterial orthologs",
"strong conservation",
"helix domain",
"bacterial genomes",
"K12 gene",
"LexA repressor",
"regulatory elements",
"regulatory signals",
"taxonomical levels",
"distinct motifs",
"helix turn",
"such motifs",
"motif",
"E. coli",
"genes",
"testable hypotheses",
"discovery strategies",
"spaced motifs",
"regulation",
"orthologs",
"discovery approach",
"LexA",
"repressor",
"genome",
"phyla",
"Actinobacteria",
"discovery",
"Firmicutes",
"promoter",
"common strategy",
"coli",
"conservation",
"bacteria",
"evolution",
"intermediate state",
"discovery methods",
"hypothesis",
"domain",
"sites",
"different combinations",
"detailed analysis",
"levels",
"elements",
"box",
"strategies",
"signals",
"tool",
"default parameters",
"factors",
"addition",
"significance",
"analysis",
"software tools",
"combination",
"systematic evaluation",
"detection",
"black box",
"results",
"prediction",
"switching",
"approach",
"state",
"method",
"parameters",
"level of correctness",
"sizeable improvements",
"evaluation",
"improvement",
"performance",
"optimal parameters",
"correctness",
"excellent results"
],
"name": "Evaluation of phylogenetic footprint discovery for predicting bacterial cis-regulatory elements and revealing their evolution",
"pagination": "37",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1008642915"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/1471-2105-9-37"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"18215291"
]
}
],
"sameAs": [
"https://doi.org/10.1186/1471-2105-9-37",
"https://app.dimensions.ai/details/publication/pub.1008642915"
],
"sdDataset": "articles",
"sdDatePublished": "2022-05-20T07:24",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220519/entities/gbq_results/article/article_451.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1186/1471-2105-9-37"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-37'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-37'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-37'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-9-37'
This table displays all metadata directly associated to this object as RDF triples.
261 TRIPLES
22 PREDICATES
133 URIs
112 LITERALS
22 BLANK NODES