Random Forests View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2001-10

AUTHORS

Leo Breiman

ABSTRACT

Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression. More... »

PAGES

5-32

References to SciGraph publications

Journal

TITLE

Machine Learning

ISSUE

1

VOLUME

45

Related Patents

  • Compact Briefcase Oct System For Point-Of-Care Imaging
  • Model-Driven Evaluator Bias Detection
  • Methods Of Identifying Subjects Having Or At Risk Of Having A Coagulation Related Disorder
  • Combination Of Biomarkers For Detecting And Evaluating A Hepatic Fibrosis
  • Method And Computer Server System For Receiving And Presenting Information To A User In A Computer Network
  • Method For Generating A Super-Resolved Image And Apparatus For Generating A Super-Resolved Image
  • Analyzing The Expression Of Biomarkers In Cells With Clusters
  • System And Method For Seismic Facies Identification Using Machine Learning
  • 9-Oxo-Octadecadienoic Acid (9-Oxo-Hode)As As Biomarker For Healthy Ageing
  • Methods For Predicting Acute Severe Colitis Treatment Response
  • Method For Annotating Images
  • Method, Device And Computer Program For Virtual Adapting Of A Spectacle Frame
  • System And Method For Achieving Improved Accuracy From Efficient Computer Architectures
  • Analyzing The Expression Of Biomarkers In Cells With Moments
  • Salivary Biomarker For Cancer, Method And Device For Assaying Same, And Method For Determining Salivary Biomarker For Cancer
  • Detecting Anomalous Behavior Via User Authentication Graphs
  • Salivary Biomarkers For Breast Cancer
  • Biomarker Panel For Diagnosis And Prognosis Of Cancer
  • Diagnostic Biomarkers, Clinical Variables, And Techniques For Selecting And Using Them
  • Method And Device For Associating Frames In A Video Of An Activity Of A Person With An Event
  • System And Method Of Otoscopy Image Analysis To Diagnose Ear Pathology
  • Biomarker For Outcome In Aml Patients
  • Methods Of Predicting And Preventing Cancer In Patients Having Premalignant Lesions
  • System And Method For Automated Seismic Interpretation
  • Use Of Gdf15 As A Marker For Exercise Management
  • Long Non-Coding Rnas (Lncrnas) For The Diagnosis And Therapeutics Of Brain Disorders, In Particular Cognitive Disorders
  • Use Of Sk1 As Biomarker For Predicting Response To Immunecheckpoint Inhibitors
  • Methods For Classification And Visualization Of Cellular Populations On A Single Cell Level Based On Microscopy Images
  • Analyzing The Expression Of Biomarkers In Cells With Moments
  • Classification Of Medical Diagnostic Images
  • Method And State Machine System For Detecting An Operation Status For A Sensor
  • Method For Analysing Sedimentary Samples With Automatic Recognition Of Nanofossils
  • Methods And Systems For Pre-Symptomatic Detection Of Exposure To An Agent
  • Method And Apparatus For Automated Delineation Of Structure Shape For Image Guided Treatment Planning
  • Compositions And Methods To Detect Head And Neck Cancer
  • Patient Classification And Prognostic Method
  • Methods And Compositions For Treating Melanoma
  • Detecting Neoplasm
  • Blood Plasma Biomarkers For Bevacizumab Combination Therapies For Treatment Of Breast Cancer
  • Method And System For Augmented Imaging In Open Treatment Using Multispectral Information
  • Method, Device And Computer Program For Virtual Adapting Of A Spectacle Frame
  • Image Processing Method, Image Processing Apparatus, And Recording Medium
  • Method, Device And Computer Program For Virtual Adapting Of A Spectacle Frame
  • Methods And Compositions For Identifying Whether A Subject Suffering From A Cancer Will Achieve A Response With An Immune-Checkpoint Inhibitor
  • Detecting Neoplasm
  • Molecular Diagnostic Test For Cancer
  • Gene Signatures Predictive Of Metastatic Disease
  • New Gene Signatures For Predicting Survival Time In Patients Suffering From Renal Cell Carcinoma
  • Methods For Predicting Outcome And Treatment Of Patients Suffering From Prostate Cancer Or Breast Cancer
  • Histological Differentiation Grade Prediction Of Hepatocellular Carcinoma In Computed Tomography Images
  • Method Of Detecting Infection With Pathogens Causing Tuberculosis
  • Combined Feature Ensamble Nutual Information Image Registration
  • Method And System For Regression And Classification In Subsurface Models To Support Decision Making For Hydrocarbon Operations
  • Salivary Biomarkers For Oral Cancer
  • Early-Stage Alzheimer's Disease Autoantibody Biomarkers, Target Antigens And Diagnostic Uses Thereof
  • Method For Identifying Gene Expression Signatures
  • Use Of Amniotic Fluid Peptides For Predicting Postnatal Renal Function In Congenital Anomalies Of The Kidney And The Urinary Tract
  • Molecular Biomarker For Prognosis Of Sepsis Patients
  • Device, Method And Computer Program For Capturing Optical Image Data And For Determining A Position Of A Lateral Boundary Of A Patient Supporting Device
  • Method And System For Augmented Imaging Using Multispectral Information
  • Methods For Assessing Pregnancy Outcome
  • Calcium Level Analysis In Cardiomyocyte
  • Methods For Diagnosis, Prognosis And Methods Of Treatment
  • Algorithm And An In Vitro Method Based On Rna Editing To Select Particular Effect Induced By Active Compounds
  • New Biomarkers And Biotargets In Renal Cell Carcinoma
  • Method And Device For Calculating A Camera Or Object Pose
  • Biometric Recognition System
  • Method And Device For Processing An Image Of Pixels, Corresponding Computer Program Product And Computer-Readable Medium
  • Method And Apparatus For Learning-Enhanced Altas-Based Auto-Segmentation
  • Systems And Methods For Full Body Measurements Extraction Using Multiple Deep Learning Networks For Body Feature Measurements
  • Methods For Predicting Metastatic Potential In Patients Suffering From Sdhb-Mutated Paraganglioma
  • Blood-Based Screen For Detecting Neurological Disease In Primary Care Settings
  • Method To Predict The Need For Therapy Of Patients Suffering From A Cancer
  • Method For Indicating The Presence Or Non-Presence Of Prostate Cancer
  • 9-Oxo-Octadecadienoic Acid (9-Oxo-Hode) As As Biomarker For Healthy Ageing
  • Hydroxy-Sphingomyelin 22:1 As Biomarker For Healthy Ageing
  • Non-Coding Rnas (Ncrna) For The Diagnosis Of Cognitive Disorders
  • Model-Driven Evaluator Bias Detection
  • Detection Of T Cell Exhaustion Or Lack Of T Cell Costimulation And Uses Thereof
  • Compositions And Methods For Detecting Predisposition To Cardiovascular Disease
  • Detecting Anomalous Behavior Via User Authentication Graphs
  • Performance Model Adverse Impact Correction
  • Synergistic Combination Of Biomarkers For Detecting And Assessing Hepatic Fibrosis
  • Method For Indicating A Presence Or Non-Presence Of Aggressive Prostate Cancer
  • Shoes For Ball Sports
  • Method To Predict The Need For Therapy Of Patients Suffering From Chronic Lymphocytic Leukemia
  • Automatic Spatial Context Based Multi-Object Segmentation In 3d Images
  • Methods And Systems For Automatic Generation Of Massive Training Data Sets From 3d Models For Training Deep Learning Networks
  • Image Data Pre-Processing
  • Method And System For Evaluating Variability In Subsurface Models To Support Decision Making For Hydrocarbon Operations
  • Combined Feature Ensamble Nutual Information Image Registration
  • Predicting Program, Predicting Method, And Predicting Apparatus
  • Method And Apparatus For Associating Images In A Video Of A Person's Activity With An Event
  • Predicting Customer Churn In A Telecommunications Network Environment
  • System For Predicting Efficacy Of A Target-Directed Drug To Treat A Disease
  • Generate Percentage Of Positive Cells For Biomarkers By Normalizing And Autothresholding The Image Intensity Produced By Immunohistochemistry Technique
  • Systems And Methods For Full Body Measurements Extraction
  • Phenylacetylglutamine As Biomarker For Healthy Ageing
  • P-Cresol Sulphate As Biomarker For Healthy Ageing
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1023/a:1010933404324

    DOI

    http://dx.doi.org/10.1023/a:1010933404324

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1024739340


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Information and Computing Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/17", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Psychology and Cognitive Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Artificial Intelligence and Image Processing", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Information Systems", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/1702", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Cognitive Sciences", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "Statistics Department, University of California, 94720, Berkeley, CA", 
              "id": "http://www.grid.ac/institutes/grid.47840.3f", 
              "name": [
                "Statistics Department, University of California, 94720, Berkeley, CA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Breiman", 
            "givenName": "Leo", 
            "id": "sg:person.01275565034.02", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01275565034.02"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1023/a:1007515423169", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1017116781", 
              "https://doi.org/10.1023/a:1007515423169"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/bf00058655", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1002929950", 
              "https://doi.org/10.1007/bf00058655"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2001-10", 
        "datePublishedReg": "2001-10-01", 
        "description": "Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148\u2013156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.", 
        "genre": "article", 
        "id": "sg:pub.10.1023/a:1010933404324", 
        "inLanguage": "en", 
        "isAccessibleForFree": true, 
        "isPartOf": [
          {
            "id": "sg:journal.1125588", 
            "issn": [
              "0885-6125", 
              "1573-0565"
            ], 
            "name": "Machine Learning", 
            "publisher": "Springer Nature", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "1", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "45"
          }
        ], 
        "keywords": [
          "individual trees", 
          "number of trees", 
          "forest", 
          "tree predictor", 
          "trees", 
          "selection", 
          "vector", 
          "A.", 
          "number", 
          "response", 
          "random forest", 
          "importance", 
          "same distribution", 
          "number of features", 
          "variable importance", 
          "features", 
          "distribution", 
          "internal estimate", 
          "correlation", 
          "random selection", 
          "combination", 
          "estimates", 
          "rate", 
          "respect", 
          "idea", 
          "values", 
          "strength", 
          "limit", 
          "error rate", 
          "regression", 
          "predictors", 
          "error", 
          "splitting", 
          "tree classifier", 
          "classifier", 
          "noise", 
          "generalization error", 
          "AdaBoost", 
          "random vectors", 
          "converges a.", 
          "forests converges a.", 
          "node yields error rates", 
          "yields error rates"
        ], 
        "name": "Random Forests", 
        "pagination": "5-32", 
        "productId": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1024739340"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1023/a:1010933404324"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1023/a:1010933404324", 
          "https://app.dimensions.ai/details/publication/pub.1024739340"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2021-12-01T19:11", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-springernature-scigraph/baseset/20211201/entities/gbq_results/article/article_307.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "https://doi.org/10.1023/a:1010933404324"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1023/a:1010933404324'


     

    This table displays all metadata directly associated to this object as RDF triples.

    121 TRIPLES      22 PREDICATES      74 URIs      61 LITERALS      6 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1023/a:1010933404324 schema:about anzsrc-for:08
    2 anzsrc-for:0801
    3 anzsrc-for:0806
    4 anzsrc-for:17
    5 anzsrc-for:1702
    6 schema:author N45c90a45201d473b9ade2d8f93c8ea17
    7 schema:citation sg:pub.10.1007/bf00058655
    8 sg:pub.10.1023/a:1007515423169
    9 schema:datePublished 2001-10
    10 schema:datePublishedReg 2001-10-01
    11 schema:description Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.
    12 schema:genre article
    13 schema:inLanguage en
    14 schema:isAccessibleForFree true
    15 schema:isPartOf N0ebe6da314274b1e948910e19d9255cc
    16 N8a77ebb7056a41cc920aaee87e20b6f4
    17 sg:journal.1125588
    18 schema:keywords A.
    19 AdaBoost
    20 classifier
    21 combination
    22 converges a.
    23 correlation
    24 distribution
    25 error
    26 error rate
    27 estimates
    28 features
    29 forest
    30 forests converges a.
    31 generalization error
    32 idea
    33 importance
    34 individual trees
    35 internal estimate
    36 limit
    37 node yields error rates
    38 noise
    39 number
    40 number of features
    41 number of trees
    42 predictors
    43 random forest
    44 random selection
    45 random vectors
    46 rate
    47 regression
    48 respect
    49 response
    50 same distribution
    51 selection
    52 splitting
    53 strength
    54 tree classifier
    55 tree predictor
    56 trees
    57 values
    58 variable importance
    59 vector
    60 yields error rates
    61 schema:name Random Forests
    62 schema:pagination 5-32
    63 schema:productId Ne73e4970f8cb43a4a5706cfa62313068
    64 Nfd487a1fdf2e4a35ae5572c610538a96
    65 schema:sameAs https://app.dimensions.ai/details/publication/pub.1024739340
    66 https://doi.org/10.1023/a:1010933404324
    67 schema:sdDatePublished 2021-12-01T19:11
    68 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    69 schema:sdPublisher N989c003062c84faca1428d5739506b97
    70 schema:url https://doi.org/10.1023/a:1010933404324
    71 sgo:license sg:explorer/license/
    72 sgo:sdDataset articles
    73 rdf:type schema:ScholarlyArticle
    74 N0ebe6da314274b1e948910e19d9255cc schema:volumeNumber 45
    75 rdf:type schema:PublicationVolume
    76 N45c90a45201d473b9ade2d8f93c8ea17 rdf:first sg:person.01275565034.02
    77 rdf:rest rdf:nil
    78 N8a77ebb7056a41cc920aaee87e20b6f4 schema:issueNumber 1
    79 rdf:type schema:PublicationIssue
    80 N989c003062c84faca1428d5739506b97 schema:name Springer Nature - SN SciGraph project
    81 rdf:type schema:Organization
    82 Ne73e4970f8cb43a4a5706cfa62313068 schema:name dimensions_id
    83 schema:value pub.1024739340
    84 rdf:type schema:PropertyValue
    85 Nfd487a1fdf2e4a35ae5572c610538a96 schema:name doi
    86 schema:value 10.1023/a:1010933404324
    87 rdf:type schema:PropertyValue
    88 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
    89 schema:name Information and Computing Sciences
    90 rdf:type schema:DefinedTerm
    91 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
    92 schema:name Artificial Intelligence and Image Processing
    93 rdf:type schema:DefinedTerm
    94 anzsrc-for:0806 schema:inDefinedTermSet anzsrc-for:
    95 schema:name Information Systems
    96 rdf:type schema:DefinedTerm
    97 anzsrc-for:17 schema:inDefinedTermSet anzsrc-for:
    98 schema:name Psychology and Cognitive Sciences
    99 rdf:type schema:DefinedTerm
    100 anzsrc-for:1702 schema:inDefinedTermSet anzsrc-for:
    101 schema:name Cognitive Sciences
    102 rdf:type schema:DefinedTerm
    103 sg:journal.1125588 schema:issn 0885-6125
    104 1573-0565
    105 schema:name Machine Learning
    106 schema:publisher Springer Nature
    107 rdf:type schema:Periodical
    108 sg:person.01275565034.02 schema:affiliation grid-institutes:grid.47840.3f
    109 schema:familyName Breiman
    110 schema:givenName Leo
    111 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01275565034.02
    112 rdf:type schema:Person
    113 sg:pub.10.1007/bf00058655 schema:sameAs https://app.dimensions.ai/details/publication/pub.1002929950
    114 https://doi.org/10.1007/bf00058655
    115 rdf:type schema:CreativeWork
    116 sg:pub.10.1023/a:1007515423169 schema:sameAs https://app.dimensions.ai/details/publication/pub.1017116781
    117 https://doi.org/10.1023/a:1007515423169
    118 rdf:type schema:CreativeWork
    119 grid-institutes:grid.47840.3f schema:alternateName Statistics Department, University of California, 94720, Berkeley, CA
    120 schema:name Statistics Department, University of California, 94720, Berkeley, CA
    121 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...