Free R value: a novel statistical quantity for assessing the accuracy of crystal structures View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

1992-01

AUTHORS

Axel T. Brünger

ABSTRACT

THE determination of macromolecular structure by crystallography involves fitting atomic models to the observed diffraction data1. The traditional measure of the quality of this fit, and presumably the accuracy of the model, is theR value. Despite stereochemical restraints2, it is possible to overfit or 'misfit' the diffraction data: an incorrect model can be refined to fairly good R values as several recent examples have shown3. Here I propose a reliable and unbiased indicator of the accuracy of such models. By analogy with the cross-validation method4,5 of testing statistical models I define a statistical quantity (RfreeT) that measures the agreement between observed and computed structure factor amplitudes for a 'test' set of reflections that is omitted in the modelling and refinement process. As examples show, there is a high correlation between RfreeT and the accuracy of the atomic model phases. This is useful because experimental phase information is usually inaccurate, incomplete or unavailable. I expect that RfreeT will provide a measure of the information content of recently proposed models of thermal motion and disorder6–8, time-averaging9 and bulk solvent10. More... »

PAGES

472-475

Journal

TITLE

Nature

ISSUE

6359

VOLUME

355

Related Patents

  • Stereoselective Synthesis Of 9-Cis.13,14-Dihydroretinoic Acid And Its Ethyl Esters
  • Engineered Multifunctional Enzymes And Methods Of Use
  • Crystal Structure Of Hepatocyte Growth Factor Activator Complexed With Kunitz Domain Inhibitor
  • Active-Site Engineering Of Nucleotidylyltransferases And General Enzymatic Methods For The Synthesis Of Natural And “Unnatural” Udp- And Tdp-Nucleotide Sugars
  • Crystals Of Angiotensin-Converting Enzyme (Ace)
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Compounds Modulating C-Kit Activity
  • Inhibitors Of Protein Tyrosine Phosphatases
  • Circularly Permuted Biotin Binding Proteins
  • Glucoamylase Variants With Altered Properties
  • Interleukin-13 Binding Proteins
  • Interleukin-13 Binding Proteins
  • Interleukin-13 Binding Proteins
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Compositions And Uses Thereof
  • De Novo Designed Hemagglutinin Binding Proteins
  • Crystal Structure Of An Angiotensin-Converting Enzyme (Ace) And Uses Thereof
  • Peptide Mimetic Ligands Of Polo-Like Kinase 1 Polo Box Domain And Methods Of Use
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Glucoamylase Variants With Altered Properties
  • Compounds Modulating C-Fms And/Or C-Kit Activity And Uses Therefor
  • Engineered Multifunctional Enzymes And Methods Of Use
  • Compounds Modulating C-Fms And/Or C-Kit Activity And Uses Therefor
  • Compounds Modulating C-Fms And/Or C-Kit Activity
  • Glucoamylase Variants
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Pde4b Inhibitors And Uses Therefor
  • Synthesis Of Heterocyclic Compounds
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Glucoamylase Variants With Altered Properties
  • Molecular Sleds Comprising A Positively-Charged Amino Acid Sequence And A Molecular Cargo And Uses Thereof
  • Compounds And Methods For Development Of Ret Modulators
  • Pde4b Inhibitors
  • Compounds Modulating C-Kit Activity
  • Compounds Modulating C-Fms And/Or C-Kit Activity And Uses Therefor
  • Compounds Modulating C-Fms And/Or C-Kit Activity And Uses Therefor
  • Compounds And Methods For Development Of Ret Modulators
  • Method Of Identifying Inhibitors Of Lck
  • Compounds Modulating C-Kit And C-Fms Activity And Uses Therefor
  • Mutein Of A Bone Morphogenetic Protein And Use Thereof
  • Crystal Structure Of The N-Terminal Domain Of Angiotensin Converting Enzyme And Uses Thereof
  • Methods And Compositions For Analyzing Nucleic Acid
  • Compositions And Uses Thereof
  • Crystal Structure Of An Angiotensin-Converting Enzyme (Ace) And Uses Thereof
  • Method Of Changing Conformation Of A Matrix Metalloproteinase
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Crystal
  • Crystallization And Structure Determination Of Staphylococcus Aureus Udp-N-Acetylenolpyruvylglucosamine Reductase (S. Aureus Murb)
  • Synthesis Of Heterocyclic Compounds
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Reducing Sequence Read Count Error In Assessment Of Complex Genetic Variations
  • Pde4b Inhibitors And Uses Therefor
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Human Anti Tshr Antibodies
  • Human Anti Tshr Antibodies
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Methods And Processes For Non-Invasive Assessment Of Genetic Variations
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Compounds Modulating C-Fms And/Or C-Kit Activity And Uses Therefor
  • Glucoamylase Variants
  • Crystals Of The Tyrosine Kinase Domain Of Non-Insulin Receptor Tyrosine Kinases
  • Solid Forms Of A Pharmaceutically Active Substance
  • Compounds And Methods For Kinase Modulation, And Indications Therefor
  • Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1038/355472a0

    DOI

    http://dx.doi.org/10.1038/355472a0

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1048571716

    PUBMED

    https://www.ncbi.nlm.nih.gov/pubmed/18481394


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/01", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Mathematical Sciences", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0104", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Statistics", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Crystallography, X-Ray", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Models, Statistical", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Molecular Structure", 
            "type": "DefinedTerm"
          }, 
          {
            "inDefinedTermSet": "https://www.nlm.nih.gov/mesh/", 
            "name": "Scattering, Radiation", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "The Howard Hughes Medical Institute and Department of Molecular Biophysics and Biochemistry, Yale University, 06511, New Haven, USA", 
              "id": "http://www.grid.ac/institutes/grid.47100.32", 
              "name": [
                "The Howard Hughes Medical Institute and Department of Molecular Biophysics and Biochemistry, Yale University, 06511, New Haven, USA"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Br\u00fcnger", 
            "givenName": "Axel T.", 
            "id": "sg:person.0740476750.50", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0740476750.50"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "sg:pub.10.1038/343687a0", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1035234503", 
              "https://doi.org/10.1038/343687a0"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1038/266140a0", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1044663807", 
              "https://doi.org/10.1038/266140a0"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "1992-01", 
        "datePublishedReg": "1992-01-01", 
        "description": "THE determination of macromolecular structure by crystallography involves fitting atomic models to the observed diffraction data1. The traditional measure of the quality of this fit, and presumably the accuracy of the model, is theR value. Despite stereochemical restraints2, it is possible to overfit or 'misfit' the diffraction data: an incorrect model can be refined to fairly good R values as several recent examples have shown3. Here I propose a reliable and unbiased indicator of the accuracy of such models. By analogy with the cross-validation method4,5 of testing statistical models I define a statistical quantity (RfreeT) that measures the agreement between observed and computed structure factor amplitudes for a 'test' set of reflections that is omitted in the modelling and refinement process. As examples show, there is a high correlation between RfreeT and the accuracy of the atomic model phases. This is useful because experimental phase information is usually inaccurate, incomplete or unavailable. I expect that RfreeT will provide a measure of the information content of recently proposed models of thermal motion and disorder6\u20138, time-averaging9 and bulk solvent10.", 
        "genre": "article", 
        "id": "sg:pub.10.1038/355472a0", 
        "inLanguage": "en", 
        "isAccessibleForFree": false, 
        "isPartOf": [
          {
            "id": "sg:journal.1018957", 
            "issn": [
              "0028-0836", 
              "1476-4687"
            ], 
            "name": "Nature", 
            "publisher": "Springer Nature", 
            "type": "Periodical"
          }, 
          {
            "issueNumber": "6359", 
            "type": "PublicationIssue"
          }, 
          {
            "type": "PublicationVolume", 
            "volumeNumber": "355"
          }
        ], 
        "keywords": [
          "statistical quantities", 
          "novel statistical quantity", 
          "incorrect model", 
          "structure-factor amplitudes", 
          "such models", 
          "experimental phase information", 
          "thermal motion", 
          "model I", 
          "atomic model", 
          "factor amplitudes", 
          "accuracy", 
          "model", 
          "diffraction data", 
          "phase information", 
          "motion", 
          "set of reflections", 
          "theR value", 
          "model phases", 
          "information content", 
          "modelling", 
          "quantity", 
          "shown3", 
          "analogy", 
          "fit", 
          "structure", 
          "misfit", 
          "set", 
          "amplitude", 
          "data1", 
          "refinement process", 
          "recent examples", 
          "values", 
          "agreement", 
          "crystal structure", 
          "macromolecular structure", 
          "reflection", 
          "measures", 
          "phase", 
          "information", 
          "data", 
          "process", 
          "determination", 
          "correlation", 
          "high correlation", 
          "traditional measures", 
          "unbiased indicator", 
          "quality", 
          "test", 
          "crystallography", 
          "indicators", 
          "example", 
          "content", 
          "observed diffraction data1", 
          "diffraction data1", 
          "stereochemical restraints2", 
          "restraints2", 
          "statistical models I", 
          "RfreeT", 
          "atomic model phases", 
          "disorder6\u20138", 
          "bulk solvent10", 
          "solvent10"
        ], 
        "name": "Free R value: a novel statistical quantity for assessing the accuracy of crystal structures", 
        "pagination": "472-475", 
        "productId": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1048571716"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1038/355472a0"
            ]
          }, 
          {
            "name": "pubmed_id", 
            "type": "PropertyValue", 
            "value": [
              "18481394"
            ]
          }
        ], 
        "sameAs": [
          "https://doi.org/10.1038/355472a0", 
          "https://app.dimensions.ai/details/publication/pub.1048571716"
        ], 
        "sdDataset": "articles", 
        "sdDatePublished": "2022-01-01T18:05", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-springernature-scigraph/baseset/20220101/entities/gbq_results/article/article_221.jsonl", 
        "type": "ScholarlyArticle", 
        "url": "https://doi.org/10.1038/355472a0"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1038/355472a0'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1038/355472a0'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1038/355472a0'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1038/355472a0'


     

    This table displays all metadata directly associated to this object as RDF triples.

    148 TRIPLES      22 PREDICATES      95 URIs      85 LITERALS      11 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1038/355472a0 schema:about N7d1ceed7e7ab400086d97adc987ee3ed
    2 Na7c25400e6da46369835858105d54eeb
    3 Nd5d11cfba9984015b6755b7b49437658
    4 Nf6f20481f7e04ba98a22a29e73c90c97
    5 anzsrc-for:01
    6 anzsrc-for:0104
    7 schema:author N89e7bef16d1e4f698d656456a78a2b30
    8 schema:citation sg:pub.10.1038/266140a0
    9 sg:pub.10.1038/343687a0
    10 schema:datePublished 1992-01
    11 schema:datePublishedReg 1992-01-01
    12 schema:description THE determination of macromolecular structure by crystallography involves fitting atomic models to the observed diffraction data1. The traditional measure of the quality of this fit, and presumably the accuracy of the model, is theR value. Despite stereochemical restraints2, it is possible to overfit or 'misfit' the diffraction data: an incorrect model can be refined to fairly good R values as several recent examples have shown3. Here I propose a reliable and unbiased indicator of the accuracy of such models. By analogy with the cross-validation method4,5 of testing statistical models I define a statistical quantity (RfreeT) that measures the agreement between observed and computed structure factor amplitudes for a 'test' set of reflections that is omitted in the modelling and refinement process. As examples show, there is a high correlation between RfreeT and the accuracy of the atomic model phases. This is useful because experimental phase information is usually inaccurate, incomplete or unavailable. I expect that RfreeT will provide a measure of the information content of recently proposed models of thermal motion and disorder6–8, time-averaging9 and bulk solvent10.
    13 schema:genre article
    14 schema:inLanguage en
    15 schema:isAccessibleForFree false
    16 schema:isPartOf N63b4d2b47c974703a0df12d444ef743a
    17 Nb0f86e72f2c64a62958faeffc9893583
    18 sg:journal.1018957
    19 schema:keywords RfreeT
    20 accuracy
    21 agreement
    22 amplitude
    23 analogy
    24 atomic model
    25 atomic model phases
    26 bulk solvent10
    27 content
    28 correlation
    29 crystal structure
    30 crystallography
    31 data
    32 data1
    33 determination
    34 diffraction data
    35 diffraction data1
    36 disorder6–8
    37 example
    38 experimental phase information
    39 factor amplitudes
    40 fit
    41 high correlation
    42 incorrect model
    43 indicators
    44 information
    45 information content
    46 macromolecular structure
    47 measures
    48 misfit
    49 model
    50 model I
    51 model phases
    52 modelling
    53 motion
    54 novel statistical quantity
    55 observed diffraction data1
    56 phase
    57 phase information
    58 process
    59 quality
    60 quantity
    61 recent examples
    62 refinement process
    63 reflection
    64 restraints2
    65 set
    66 set of reflections
    67 shown3
    68 solvent10
    69 statistical models I
    70 statistical quantities
    71 stereochemical restraints2
    72 structure
    73 structure-factor amplitudes
    74 such models
    75 test
    76 theR value
    77 thermal motion
    78 traditional measures
    79 unbiased indicator
    80 values
    81 schema:name Free R value: a novel statistical quantity for assessing the accuracy of crystal structures
    82 schema:pagination 472-475
    83 schema:productId N08314ea9a30b476cbd93d2c6b126764f
    84 N27af302239cb4d50a2af78970c33a06d
    85 Nf68df671ea584aed996149e7b6f90f32
    86 schema:sameAs https://app.dimensions.ai/details/publication/pub.1048571716
    87 https://doi.org/10.1038/355472a0
    88 schema:sdDatePublished 2022-01-01T18:05
    89 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    90 schema:sdPublisher Nac53f2323e494cc6b2ee1768a1746947
    91 schema:url https://doi.org/10.1038/355472a0
    92 sgo:license sg:explorer/license/
    93 sgo:sdDataset articles
    94 rdf:type schema:ScholarlyArticle
    95 N08314ea9a30b476cbd93d2c6b126764f schema:name doi
    96 schema:value 10.1038/355472a0
    97 rdf:type schema:PropertyValue
    98 N27af302239cb4d50a2af78970c33a06d schema:name pubmed_id
    99 schema:value 18481394
    100 rdf:type schema:PropertyValue
    101 N63b4d2b47c974703a0df12d444ef743a schema:volumeNumber 355
    102 rdf:type schema:PublicationVolume
    103 N7d1ceed7e7ab400086d97adc987ee3ed schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    104 schema:name Models, Statistical
    105 rdf:type schema:DefinedTerm
    106 N89e7bef16d1e4f698d656456a78a2b30 rdf:first sg:person.0740476750.50
    107 rdf:rest rdf:nil
    108 Na7c25400e6da46369835858105d54eeb schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    109 schema:name Scattering, Radiation
    110 rdf:type schema:DefinedTerm
    111 Nac53f2323e494cc6b2ee1768a1746947 schema:name Springer Nature - SN SciGraph project
    112 rdf:type schema:Organization
    113 Nb0f86e72f2c64a62958faeffc9893583 schema:issueNumber 6359
    114 rdf:type schema:PublicationIssue
    115 Nd5d11cfba9984015b6755b7b49437658 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    116 schema:name Molecular Structure
    117 rdf:type schema:DefinedTerm
    118 Nf68df671ea584aed996149e7b6f90f32 schema:name dimensions_id
    119 schema:value pub.1048571716
    120 rdf:type schema:PropertyValue
    121 Nf6f20481f7e04ba98a22a29e73c90c97 schema:inDefinedTermSet https://www.nlm.nih.gov/mesh/
    122 schema:name Crystallography, X-Ray
    123 rdf:type schema:DefinedTerm
    124 anzsrc-for:01 schema:inDefinedTermSet anzsrc-for:
    125 schema:name Mathematical Sciences
    126 rdf:type schema:DefinedTerm
    127 anzsrc-for:0104 schema:inDefinedTermSet anzsrc-for:
    128 schema:name Statistics
    129 rdf:type schema:DefinedTerm
    130 sg:journal.1018957 schema:issn 0028-0836
    131 1476-4687
    132 schema:name Nature
    133 schema:publisher Springer Nature
    134 rdf:type schema:Periodical
    135 sg:person.0740476750.50 schema:affiliation grid-institutes:grid.47100.32
    136 schema:familyName Brünger
    137 schema:givenName Axel T.
    138 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0740476750.50
    139 rdf:type schema:Person
    140 sg:pub.10.1038/266140a0 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044663807
    141 https://doi.org/10.1038/266140a0
    142 rdf:type schema:CreativeWork
    143 sg:pub.10.1038/343687a0 schema:sameAs https://app.dimensions.ai/details/publication/pub.1035234503
    144 https://doi.org/10.1038/343687a0
    145 rdf:type schema:CreativeWork
    146 grid-institutes:grid.47100.32 schema:alternateName The Howard Hughes Medical Institute and Department of Molecular Biophysics and Biochemistry, Yale University, 06511, New Haven, USA
    147 schema:name The Howard Hughes Medical Institute and Department of Molecular Biophysics and Biochemistry, Yale University, 06511, New Haven, USA
    148 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...