A Caution Regarding Rules of Thumb for Variance Inflation Factors View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2007-03-13

AUTHORS

Robert M. O’brien

ABSTRACT

The Variance Inflation Factor (VIF) and tolerance are both widely used measures of the degree of multi-collinearity of the ith independent variable with the other independent variables in a regression model. Unfortunately, several rules of thumb – most commonly the rule of 10 – associated with VIF are regarded by many practitioners as a sign of severe or serious multi-collinearity (this rule appears in both scholarly articles and advanced statistical textbooks). When VIF reaches these threshold values researchers often attempt to reduce the collinearity by eliminating one or more variables from their analysis; using Ridge Regression to analyze their data; or combining two or more independent variables into a single index. These techniques for curing problems associated with multi-collinearity can create problems more serious than those they solve. Because of this, we examine these rules of thumb and find that threshold values of the VIF (and tolerance) need to be evaluated in the context of several other factors that influence the variance of regression coefficients. Values of the VIF of 10, 20, 40, or even higher do not, by themselves, discount the results of regression analyses, call for the elimination of one or more independent variables from the analysis, suggest the use of ridge regression, or require combining of independent variable into a single index. More... »

PAGES

673-690

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/s11135-006-9018-6

DOI

http://dx.doi.org/10.1007/s11135-006-9018-6

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1005771510


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/01", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Mathematical Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/16", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Studies in Human Society", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/17", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Psychology and Cognitive Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0104", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Statistics", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/1608", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Sociology", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/1701", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Psychology", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Department of Sociology, University of Oregon, 97408, Eugene, OR, USA", 
          "id": "http://www.grid.ac/institutes/grid.170202.6", 
          "name": [
            "Department of Sociology, University of Oregon, 97408, Eugene, OR, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "O\u2019brien", 
        "givenName": "Robert M.", 
        "id": "sg:person.01116167567.15", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01116167567.15"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2007-03-13", 
    "datePublishedReg": "2007-03-13", 
    "description": "The Variance Inflation Factor (VIF) and tolerance are both widely used measures of the degree of multi-collinearity of the ith independent variable with the other independent variables in a regression model. Unfortunately, several rules of thumb \u2013 most commonly the rule of 10 \u2013 associated with VIF are regarded by many practitioners as a sign of severe or serious multi-collinearity (this rule appears in both scholarly articles and advanced statistical textbooks). When VIF reaches these threshold values researchers often attempt to reduce the collinearity by eliminating one or more variables from their analysis; using Ridge Regression to analyze their data; or combining two or more independent variables into a single index. These techniques for curing problems associated with multi-collinearity can create problems more serious than those they solve. Because of this, we examine these rules of thumb and find that threshold values of the VIF (and tolerance) need to be evaluated in the context of several other factors that influence the variance of regression coefficients. Values of the VIF of 10, 20, 40, or even higher do not, by themselves, discount the results of regression analyses, call for the elimination of one or more independent variables from the analysis, suggest the use of ridge regression, or require combining of independent variable into a single index.", 
    "genre": "article", 
    "id": "sg:pub.10.1007/s11135-006-9018-6", 
    "isAccessibleForFree": false, 
    "isPartOf": [
      {
        "id": "sg:journal.1053888", 
        "issn": [
          "0033-5177", 
          "1573-7845"
        ], 
        "name": "Quality & Quantity", 
        "publisher": "Springer Nature", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "5", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "41"
      }
    ], 
    "keywords": [
      "variance inflation factor", 
      "rule of thumb", 
      "inflation factor", 
      "single index", 
      "independent variables", 
      "ridge regression", 
      "more variables", 
      "variables", 
      "regression models", 
      "value Researchers", 
      "rules", 
      "regression coefficients", 
      "more independent variables", 
      "regression", 
      "regression analysis", 
      "thumb", 
      "index", 
      "measures", 
      "collinearity", 
      "analysis", 
      "model", 
      "threshold value", 
      "factors", 
      "values", 
      "context", 
      "problem", 
      "variance", 
      "researchers", 
      "data", 
      "degree", 
      "practitioners", 
      "caution", 
      "results", 
      "use", 
      "coefficient", 
      "signs", 
      "elimination", 
      "technique", 
      "tolerance"
    ], 
    "name": "A Caution Regarding Rules of Thumb for Variance Inflation Factors", 
    "pagination": "673-690", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1005771510"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/s11135-006-9018-6"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1007/s11135-006-9018-6", 
      "https://app.dimensions.ai/details/publication/pub.1005771510"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2022-08-04T16:57", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/article/article_447.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "https://doi.org/10.1007/s11135-006-9018-6"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s11135-006-9018-6'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s11135-006-9018-6'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s11135-006-9018-6'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s11135-006-9018-6'


 

This table displays all metadata directly associated to this object as RDF triples.

112 TRIPLES      20 PREDICATES      67 URIs      55 LITERALS      6 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/s11135-006-9018-6 schema:about anzsrc-for:01
2 anzsrc-for:0104
3 anzsrc-for:16
4 anzsrc-for:1608
5 anzsrc-for:17
6 anzsrc-for:1701
7 schema:author Nc9124ba0b6da41e5b78d40be54cc0836
8 schema:datePublished 2007-03-13
9 schema:datePublishedReg 2007-03-13
10 schema:description The Variance Inflation Factor (VIF) and tolerance are both widely used measures of the degree of multi-collinearity of the ith independent variable with the other independent variables in a regression model. Unfortunately, several rules of thumb – most commonly the rule of 10 – associated with VIF are regarded by many practitioners as a sign of severe or serious multi-collinearity (this rule appears in both scholarly articles and advanced statistical textbooks). When VIF reaches these threshold values researchers often attempt to reduce the collinearity by eliminating one or more variables from their analysis; using Ridge Regression to analyze their data; or combining two or more independent variables into a single index. These techniques for curing problems associated with multi-collinearity can create problems more serious than those they solve. Because of this, we examine these rules of thumb and find that threshold values of the VIF (and tolerance) need to be evaluated in the context of several other factors that influence the variance of regression coefficients. Values of the VIF of 10, 20, 40, or even higher do not, by themselves, discount the results of regression analyses, call for the elimination of one or more independent variables from the analysis, suggest the use of ridge regression, or require combining of independent variable into a single index.
11 schema:genre article
12 schema:isAccessibleForFree false
13 schema:isPartOf N14fe3161c44c45d09bf82d66cceca874
14 N82eed6c7b5504f20916cfab5effc40f7
15 sg:journal.1053888
16 schema:keywords analysis
17 caution
18 coefficient
19 collinearity
20 context
21 data
22 degree
23 elimination
24 factors
25 independent variables
26 index
27 inflation factor
28 measures
29 model
30 more independent variables
31 more variables
32 practitioners
33 problem
34 regression
35 regression analysis
36 regression coefficients
37 regression models
38 researchers
39 results
40 ridge regression
41 rule of thumb
42 rules
43 signs
44 single index
45 technique
46 threshold value
47 thumb
48 tolerance
49 use
50 value Researchers
51 values
52 variables
53 variance
54 variance inflation factor
55 schema:name A Caution Regarding Rules of Thumb for Variance Inflation Factors
56 schema:pagination 673-690
57 schema:productId N42bcb5baead94fff98e0ba81405765e4
58 N7a8d3220cdbc42aaa5a97ad4c46da660
59 schema:sameAs https://app.dimensions.ai/details/publication/pub.1005771510
60 https://doi.org/10.1007/s11135-006-9018-6
61 schema:sdDatePublished 2022-08-04T16:57
62 schema:sdLicense https://scigraph.springernature.com/explorer/license/
63 schema:sdPublisher N4deb544277a9445a886e24fb394436c8
64 schema:url https://doi.org/10.1007/s11135-006-9018-6
65 sgo:license sg:explorer/license/
66 sgo:sdDataset articles
67 rdf:type schema:ScholarlyArticle
68 N14fe3161c44c45d09bf82d66cceca874 schema:issueNumber 5
69 rdf:type schema:PublicationIssue
70 N42bcb5baead94fff98e0ba81405765e4 schema:name doi
71 schema:value 10.1007/s11135-006-9018-6
72 rdf:type schema:PropertyValue
73 N4deb544277a9445a886e24fb394436c8 schema:name Springer Nature - SN SciGraph project
74 rdf:type schema:Organization
75 N7a8d3220cdbc42aaa5a97ad4c46da660 schema:name dimensions_id
76 schema:value pub.1005771510
77 rdf:type schema:PropertyValue
78 N82eed6c7b5504f20916cfab5effc40f7 schema:volumeNumber 41
79 rdf:type schema:PublicationVolume
80 Nc9124ba0b6da41e5b78d40be54cc0836 rdf:first sg:person.01116167567.15
81 rdf:rest rdf:nil
82 anzsrc-for:01 schema:inDefinedTermSet anzsrc-for:
83 schema:name Mathematical Sciences
84 rdf:type schema:DefinedTerm
85 anzsrc-for:0104 schema:inDefinedTermSet anzsrc-for:
86 schema:name Statistics
87 rdf:type schema:DefinedTerm
88 anzsrc-for:16 schema:inDefinedTermSet anzsrc-for:
89 schema:name Studies in Human Society
90 rdf:type schema:DefinedTerm
91 anzsrc-for:1608 schema:inDefinedTermSet anzsrc-for:
92 schema:name Sociology
93 rdf:type schema:DefinedTerm
94 anzsrc-for:17 schema:inDefinedTermSet anzsrc-for:
95 schema:name Psychology and Cognitive Sciences
96 rdf:type schema:DefinedTerm
97 anzsrc-for:1701 schema:inDefinedTermSet anzsrc-for:
98 schema:name Psychology
99 rdf:type schema:DefinedTerm
100 sg:journal.1053888 schema:issn 0033-5177
101 1573-7845
102 schema:name Quality & Quantity
103 schema:publisher Springer Nature
104 rdf:type schema:Periodical
105 sg:person.01116167567.15 schema:affiliation grid-institutes:grid.170202.6
106 schema:familyName O’brien
107 schema:givenName Robert M.
108 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01116167567.15
109 rdf:type schema:Person
110 grid-institutes:grid.170202.6 schema:alternateName Department of Sociology, University of Oregon, 97408, Eugene, OR, USA
111 schema:name Department of Sociology, University of Oregon, 97408, Eugene, OR, USA
112 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...