2001-10
AUTHORS ABSTRACTRandom forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression. More... »
PAGES5-32
http://scigraph.springernature.com/pub.10.1023/a:1010933404324
DOIhttp://dx.doi.org/10.1023/a:1010933404324
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1024739340
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0705",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Forestry Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/07",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Agricultural and Veterinary Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "University of California, Berkeley",
"id": "https://www.grid.ac/institutes/grid.47840.3f",
"name": [
"Statistics Department, University of California, 94720, Berkeley, CA"
],
"type": "Organization"
},
"familyName": "Breiman",
"givenName": "Leo",
"id": "sg:person.01275565034.02",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01275565034.02"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1007/bf00058655",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1002929950",
"https://doi.org/10.1007/bf00058655"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1023/a:1007515423169",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1017116781",
"https://doi.org/10.1023/a:1007515423169"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1214/aos/1024691352",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1035391848"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1162/neco.1997.9.7.1545",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1045836958"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/34.709601",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061156844"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/34.857004",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061157096"
],
"type": "CreativeWork"
}
],
"datePublished": "2001-10",
"datePublishedReg": "2001-10-01",
"description": "Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148\u2013156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.",
"genre": "research_article",
"id": "sg:pub.10.1023/a:1010933404324",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isPartOf": [
{
"id": "sg:journal.1125588",
"issn": [
"0885-6125",
"1573-0565"
],
"name": "Machine Learning",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "45"
}
],
"name": "Random Forests",
"pagination": "5-32",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"75dc04bd80c4a93a35ef04b6eb647b3ce9a56ed6a742cc19ee5557f930ad5cbe"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1023/a:1010933404324"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1024739340"
]
}
],
"sameAs": [
"https://doi.org/10.1023/a:1010933404324",
"https://app.dimensions.ai/details/publication/pub.1024739340"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-10T18:17",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8675_00000499.jsonl",
"type": "ScholarlyArticle",
"url": "http://link.springer.com/10.1023/A:1010933404324"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'
This table displays all metadata directly associated to this object as RDF triples.
81 TRIPLES
21 PREDICATES
33 URIs
19 LITERALS
7 BLANK NODES