Random Forests View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2001-10

AUTHORS

Leo Breiman

ABSTRACT

Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression. More... »

PAGES

5-32

References to SciGraph publications

Journal

TITLE

Machine Learning

ISSUE

1

VOLUME

45

Related Patents

  • Analyzing The Expression Of Biomarkers In Cells With Moments
  • Analyzing The Expression Of Biomarkers In Cells With Clusters
  • 9-Oxo-Octadecadienoic Acid (9-Oxo-Hode)As As Biomarker For Healthy Ageing
  • Method For Generating A Super-Resolved Image And Apparatus For Generating A Super-Resolved Image
  • Methods For Predicting Acute Severe Colitis Treatment Response
  • System And Method For Seismic Facies Identification Using Machine Learning
  • Analyzing The Expression Of Biomarkers In Cells With Moments
  • Blood Plasma Biomarkers For Bevacizumab Combination Therapies For Treatment Of Breast Cancer
  • Methods For Classification And Visualization Of Cellular Populations On A Single Cell Level Based On Microscopy Images
  • Method And Device For Associating Frames In A Video Of An Activity Of A Person With An Event
  • Combined Feature Ensamble Nutual Information Image Registration
  • Image Processing Method, Image Processing Apparatus, And Recording Medium
  • Method And Device For Processing An Image Of Pixels, Corresponding Computer Program Product And Computer-Readable Medium
  • Biometric Recognition System
  • Method For Indicating The Presence Or Non-Presence Of Prostate Cancer
  • Method And Device For Calculating A Camera Or Object Pose
  • Method And Apparatus For Learning-Enhanced Altas-Based Auto-Segmentation
  • Image Data Pre-Processing
  • Method And Apparatus For Associating Of Images In A Video An Activity Of A Person With A Event
  • System For Predicting Efficacy Of A Target-Directed Drug To Treat A Disease
  • Compositions And Methods For Detecting Predisposition To Cardiovascular Disease
  • Predicting Customer Churn In A Telecommunications Network Environment
  • Synergistic Combination Of Biomarkers For Detecting And Assessing Hepatic Fibrosis
  • Detecting Anomalous Behavior Via User Authentication Graphs
  • Automatic Spatial Context Based Multi-Object Segmentation In 3d Images
  • Method For Indicating A Presence Or Non-Presence Of Aggressive Prostate Cancer
  • Phenylacetylglutamine As Biomarker For Healthy Ageing
  • P-Cresol Sulphate As Biomarker For Healthy Ageing
  • Generate Percentage Of Positive Cells For Biomarkers By Normalizing And Autothresholding The Image Intensity Produced By Immunohistochemistry Technique
  • Combined Feature Ensamble Nutual Information Image Registration
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1023/a:1010933404324

    DOI

    http://dx.doi.org/10.1023/a:1010933404324

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1024739340


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0705", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Forestry Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/07", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Agricultural and Veterinary Sciences", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "University of California, Berkeley", 
              "id": "https://www.grid.ac/institutes/grid.47840.3f", 
              "name": [
                "Statistics Department, University of California, 94720, Berkeley, CA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Breiman", 
            "givenName": "Leo", 
            "id": "sg:person.01275565034.02", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01275565034.02"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1007/bf00058655", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1002929950", 
              "https://doi.org/10.1007/bf00058655"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1023/a:1007515423169", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1017116781", 
              "https://doi.org/10.1023/a:1007515423169"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1214/aos/1024691352", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1035391848"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1162/neco.1997.9.7.1545", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1045836958"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1109/34.709601", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1061156844"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1109/34.857004", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1061157096"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2001-10", 
        "datePublishedReg": "2001-10-01", 
        "description": "Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148\u2013156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.", 
        "genre": "research_article", 
        "id": "sg:pub.10.1023/a:1010933404324", 
        "inLanguage": [
          "en"
        ], 
        "isAccessibleForFree": true, 
        "isPartOf": [
          {
            "id": "sg:journal.1125588", 
            "issn": [
              "0885-6125", 
              "1573-0565"
            ], 
            "name": "Machine Learning", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "1", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "45"
          }
        ], 
        "name": "Random Forests", 
        "pagination": "5-32", 
        "productId": [
          {
            "name": "readcube_id", 
            "type": "PropertyValue", 
            "value": [
              "75dc04bd80c4a93a35ef04b6eb647b3ce9a56ed6a742cc19ee5557f930ad5cbe"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1023/a:1010933404324"
            ]
          }, 
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1024739340"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1023/a:1010933404324", 
          "https://app.dimensions.ai/details/publication/pub.1024739340"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2019-04-10T18:17", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8675_00000499.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "http://link.springer.com/10.1023/A:1010933404324"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'


     

    This table displays all metadata directly associated to this object as RDF triples.

    81 TRIPLES      21 PREDICATES      33 URIs      19 LITERALS      7 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1023/a:1010933404324 schema:about anzsrc-for:07
    2 anzsrc-for:0705
    3 schema:author Nd99b196e7dd549be9ded08bca9f184d9
    4 schema:citation sg:pub.10.1007/bf00058655
    5 sg:pub.10.1023/a:1007515423169
    6 https://doi.org/10.1109/34.709601
    7 https://doi.org/10.1109/34.857004
    8 https://doi.org/10.1162/neco.1997.9.7.1545
    9 https://doi.org/10.1214/aos/1024691352
    10 schema:datePublished 2001-10
    11 schema:datePublishedReg 2001-10-01
    12 schema:description Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.
    13 schema:genre research_article
    14 schema:inLanguage en
    15 schema:isAccessibleForFree true
    16 schema:isPartOf Nd69bfb0f96b8419a83e7a82b87a9fa68
    17 Ne0e54a6831d746fab3f0beba9a296726
    18 sg:journal.1125588
    19 schema:name Random Forests
    20 schema:pagination 5-32
    21 schema:productId N1f230a140cfd455a8946bb7211dc5daf
    22 Na31cec8923b741d3b9f72c7330fae61a
    23 Nd31ad6cabb424cdfa530ca8253262358
    24 schema:sameAs https://app.dimensions.ai/details/publication/pub.1024739340
    25 https://doi.org/10.1023/a:1010933404324
    26 schema:sdDatePublished 2019-04-10T18:17
    27 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    28 schema:sdPublisher N0aa36625c37d4e859330b44d91698a15
    29 schema:url http://link.springer.com/10.1023/A:1010933404324
    30 sgo:license sg:explorer/license/
    31 sgo:sdDataset articles
    32 rdf:type schema:ScholarlyArticle
    33 N0aa36625c37d4e859330b44d91698a15 schema:name Springer Nature - SN SciGraph project
    34 rdf:type schema:Organization
    35 N1f230a140cfd455a8946bb7211dc5daf schema:name readcube_id
    36 schema:value 75dc04bd80c4a93a35ef04b6eb647b3ce9a56ed6a742cc19ee5557f930ad5cbe
    37 rdf:type schema:PropertyValue
    38 Na31cec8923b741d3b9f72c7330fae61a schema:name doi
    39 schema:value 10.1023/a:1010933404324
    40 rdf:type schema:PropertyValue
    41 Nd31ad6cabb424cdfa530ca8253262358 schema:name dimensions_id
    42 schema:value pub.1024739340
    43 rdf:type schema:PropertyValue
    44 Nd69bfb0f96b8419a83e7a82b87a9fa68 schema:volumeNumber 45
    45 rdf:type schema:PublicationVolume
    46 Nd99b196e7dd549be9ded08bca9f184d9 rdf:first sg:person.01275565034.02
    47 rdf:rest rdf:nil
    48 Ne0e54a6831d746fab3f0beba9a296726 schema:issueNumber 1
    49 rdf:type schema:PublicationIssue
    50 anzsrc-for:07 schema:inDefinedTermSet anzsrc-for:
    51 schema:name Agricultural and Veterinary Sciences
    52 rdf:type schema:DefinedTerm
    53 anzsrc-for:0705 schema:inDefinedTermSet anzsrc-for:
    54 schema:name Forestry Sciences
    55 rdf:type schema:DefinedTerm
    56 sg:journal.1125588 schema:issn 0885-6125
    57 1573-0565
    58 schema:name Machine Learning
    59 rdf:type schema:Periodical
    60 sg:person.01275565034.02 schema:affiliation https://www.grid.ac/institutes/grid.47840.3f
    61 schema:familyName Breiman
    62 schema:givenName Leo
    63 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01275565034.02
    64 rdf:type schema:Person
    65 sg:pub.10.1007/bf00058655 schema:sameAs https://app.dimensions.ai/details/publication/pub.1002929950
    66 https://doi.org/10.1007/bf00058655
    67 rdf:type schema:CreativeWork
    68 sg:pub.10.1023/a:1007515423169 schema:sameAs https://app.dimensions.ai/details/publication/pub.1017116781
    69 https://doi.org/10.1023/a:1007515423169
    70 rdf:type schema:CreativeWork
    71 https://doi.org/10.1109/34.709601 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061156844
    72 rdf:type schema:CreativeWork
    73 https://doi.org/10.1109/34.857004 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061157096
    74 rdf:type schema:CreativeWork
    75 https://doi.org/10.1162/neco.1997.9.7.1545 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045836958
    76 rdf:type schema:CreativeWork
    77 https://doi.org/10.1214/aos/1024691352 schema:sameAs https://app.dimensions.ai/details/publication/pub.1035391848
    78 rdf:type schema:CreativeWork
    79 https://www.grid.ac/institutes/grid.47840.3f schema:alternateName University of California, Berkeley
    80 schema:name Statistics Department, University of California, 94720, Berkeley, CA
    81 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...