Latent Argumentative Pruning for Compact MEDLINE Indexing View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2005

AUTHORS

Patrick Ruch , Robert Baud , Johann Marty , Antoine Geissbühler , Imad Tbahriti , Anne-Lise Veuthey

ABSTRACT

PURPOSE: We evaluate how argumentation in scientific articles can be used to propose an original index pruning strategy, which significantly reduce the size of the engine’s indexes but having a limited impact on retrieval effectiveness. METHODS: A Bayesian classifier trained on explicitly structured MEDLINE abstracts generates these argumentative categories. The categories are used to generate four different argumentative indexes. A fifth index contains the complete abstract, together with the title and the list of Medical Subject Headings (MeSH) terms. This last index is used as baseline to compare results obtained when only a specific argumentative index is retrieved. RESULTS and CONCLUSION: When titles and medical subject headings are also stored in the respective indexes, querying PURPOSE and CONCLUSION indexes can respectively achieves 78.4% and 74.3% of the baseline, while the size if the index is divided by two. It is concluded that argumentation can be a powerful index pruning strategy in complement to more traditionnal approaches. More... »

PAGES

246-250

References to SciGraph publications

  • 2004. Report on CLEF-2003 Monolingual Tracks: Fusion of Probabilistic Models for Effective Monolingual Retrieval in COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS
  • Book

    TITLE

    Artificial Intelligence in Medicine

    ISBN

    978-3-540-27831-3
    978-3-540-31884-2

    Author Affiliations

    Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1007/11527770_36

    DOI

    http://dx.doi.org/10.1007/11527770_36

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1048777026


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/1403", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Econometrics", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/14", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Economics", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "Swiss Institute of Bioinformatics", 
              "id": "https://www.grid.ac/institutes/grid.419765.8", 
              "name": [
                "Medical Informatics Service and Swiss-Prot Group, University Hospital of Geneva and Swiss Institute of Bioinformatics, 1205, Geneva, Switzerland"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Ruch", 
            "givenName": "Patrick", 
            "id": "sg:person.01060361377.20", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01060361377.20"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Swiss Institute of Bioinformatics", 
              "id": "https://www.grid.ac/institutes/grid.419765.8", 
              "name": [
                "Medical Informatics Service and Swiss-Prot Group, University Hospital of Geneva and Swiss Institute of Bioinformatics, 1205, Geneva, Switzerland"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Baud", 
            "givenName": "Robert", 
            "id": "sg:person.01065216455.04", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01065216455.04"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Swiss Institute of Bioinformatics", 
              "id": "https://www.grid.ac/institutes/grid.419765.8", 
              "name": [
                "Medical Informatics Service and Swiss-Prot Group, University Hospital of Geneva and Swiss Institute of Bioinformatics, 1205, Geneva, Switzerland"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Marty", 
            "givenName": "Johann", 
            "id": "sg:person.01366505505.13", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01366505505.13"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Swiss Institute of Bioinformatics", 
              "id": "https://www.grid.ac/institutes/grid.419765.8", 
              "name": [
                "Medical Informatics Service and Swiss-Prot Group, University Hospital of Geneva and Swiss Institute of Bioinformatics, 1205, Geneva, Switzerland"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Geissb\u00fchler", 
            "givenName": "Antoine", 
            "id": "sg:person.0600360343.20", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0600360343.20"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Swiss Institute of Bioinformatics", 
              "id": "https://www.grid.ac/institutes/grid.419765.8", 
              "name": [
                "Medical Informatics Service and Swiss-Prot Group, University Hospital of Geneva and Swiss Institute of Bioinformatics, 1205, Geneva, Switzerland"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Tbahriti", 
            "givenName": "Imad", 
            "id": "sg:person.0616733145.15", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0616733145.15"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Swiss Institute of Bioinformatics", 
              "id": "https://www.grid.ac/institutes/grid.419765.8", 
              "name": [
                "Medical Informatics Service and Swiss-Prot Group, University Hospital of Geneva and Swiss Institute of Bioinformatics, 1205, Geneva, Switzerland"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Veuthey", 
            "givenName": "Anne-Lise", 
            "id": "sg:person.01311163727.60", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01311163727.60"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "https://doi.org/10.1145/383952.383958", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1010688760"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.3115/1072228.1072337", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1021228232"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/978-3-540-30222-3_31", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1028546264", 
              "https://doi.org/10.1007/978-3-540-30222-3_31"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/978-3-540-30222-3_31", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1028546264", 
              "https://doi.org/10.1007/978-3-540-30222-3_31"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/243199.243206", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1032416718"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/j.ijmedinf.2003.09.004", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1039323169"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1093/bioinformatics/bth291", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041160894"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/582415.582416", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1041903840"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s1386-5056(02)00057-6", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1051808877"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1016/s1386-5056(02)00057-6", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1051808877"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2005", 
        "datePublishedReg": "2005-01-01", 
        "description": "PURPOSE: We evaluate how argumentation in scientific articles can be used to propose an original index pruning strategy, which significantly reduce the size of the engine\u2019s indexes but having a limited impact on retrieval effectiveness. METHODS: A Bayesian classifier trained on explicitly structured MEDLINE abstracts generates these argumentative categories. The categories are used to generate four different argumentative indexes. A fifth index contains the complete abstract, together with the title and the list of Medical Subject Headings (MeSH) terms. This last index is used as baseline to compare results obtained when only a specific argumentative index is retrieved. RESULTS and CONCLUSION: When titles and medical subject headings are also stored in the respective indexes, querying PURPOSE and CONCLUSION indexes can respectively achieves 78.4% and 74.3% of the baseline, while the size if the index is divided by two. It is concluded that argumentation can be a powerful index pruning strategy in complement to more traditionnal approaches.", 
        "editor": [
          {
            "familyName": "Miksch", 
            "givenName": "Silvia", 
            "type": "Person"
          }, 
          {
            "familyName": "Hunter", 
            "givenName": "Jim", 
            "type": "Person"
          }, 
          {
            "familyName": "Keravnou", 
            "givenName": "Elpida T.", 
            "type": "Person"
          }
        ], 
        "genre": "chapter", 
        "id": "sg:pub.10.1007/11527770_36", 
        "inLanguage": [
          "en"
        ], 
        "isAccessibleForFree": false, 
        "isPartOf": {
          "isbn": [
            "978-3-540-27831-3", 
            "978-3-540-31884-2"
          ], 
          "name": "Artificial Intelligence in Medicine", 
          "type": "Book"
        }, 
        "name": "Latent Argumentative Pruning for Compact MEDLINE Indexing", 
        "pagination": "246-250", 
        "productId": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1048777026"
            ]
          }, 
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1007/11527770_36"
            ]
          }, 
          {
            "name": "readcube_id", 
            "type": "PropertyValue", 
            "value": [
              "e85d3e7e8f2ea30e19be10f8849bcca2e45a5595d00ab97a738e41da6c58ee6e"
            ]
          }
        ], 
        "publisher": {
          "location": "Berlin, Heidelberg", 
          "name": "Springer Berlin Heidelberg", 
          "type": "Organisation"
        }, 
        "sameAs": [
          "https://doi.org/10.1007/11527770_36", 
          "https://app.dimensions.ai/details/publication/pub.1048777026"
        ], 
        "sdDataset": "chapters", 
        "sdDatePublished": "2019-04-16T08:32", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000364_0000000364/records_72859_00000000.jsonl", 
        "type": "Chapter", 
        "url": "https://link.springer.com/10.1007%2F11527770_36"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/11527770_36'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/11527770_36'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/11527770_36'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/11527770_36'


     

    This table displays all metadata directly associated to this object as RDF triples.

    135 TRIPLES      23 PREDICATES      35 URIs      20 LITERALS      8 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1007/11527770_36 schema:about anzsrc-for:14
    2 anzsrc-for:1403
    3 schema:author N10900444c10a4a818f33019e78c36bb4
    4 schema:citation sg:pub.10.1007/978-3-540-30222-3_31
    5 https://doi.org/10.1016/j.ijmedinf.2003.09.004
    6 https://doi.org/10.1016/s1386-5056(02)00057-6
    7 https://doi.org/10.1093/bioinformatics/bth291
    8 https://doi.org/10.1145/243199.243206
    9 https://doi.org/10.1145/383952.383958
    10 https://doi.org/10.1145/582415.582416
    11 https://doi.org/10.3115/1072228.1072337
    12 schema:datePublished 2005
    13 schema:datePublishedReg 2005-01-01
    14 schema:description PURPOSE: We evaluate how argumentation in scientific articles can be used to propose an original index pruning strategy, which significantly reduce the size of the engine’s indexes but having a limited impact on retrieval effectiveness. METHODS: A Bayesian classifier trained on explicitly structured MEDLINE abstracts generates these argumentative categories. The categories are used to generate four different argumentative indexes. A fifth index contains the complete abstract, together with the title and the list of Medical Subject Headings (MeSH) terms. This last index is used as baseline to compare results obtained when only a specific argumentative index is retrieved. RESULTS and CONCLUSION: When titles and medical subject headings are also stored in the respective indexes, querying PURPOSE and CONCLUSION indexes can respectively achieves 78.4% and 74.3% of the baseline, while the size if the index is divided by two. It is concluded that argumentation can be a powerful index pruning strategy in complement to more traditionnal approaches.
    15 schema:editor N03f683b53c19425bb20da4e158cbb704
    16 schema:genre chapter
    17 schema:inLanguage en
    18 schema:isAccessibleForFree false
    19 schema:isPartOf Nc221231d25c547c685c596f361f954b4
    20 schema:name Latent Argumentative Pruning for Compact MEDLINE Indexing
    21 schema:pagination 246-250
    22 schema:productId N0131cfc28b384678b85c1b6f20fd01a9
    23 N2ae13a7fa7d4486da810035c7cb53a3c
    24 N2caceccd776b47b29f73d4c94cabdc7c
    25 schema:publisher Nad44d8bfa9b84ce392ea981ae09369f5
    26 schema:sameAs https://app.dimensions.ai/details/publication/pub.1048777026
    27 https://doi.org/10.1007/11527770_36
    28 schema:sdDatePublished 2019-04-16T08:32
    29 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    30 schema:sdPublisher Na75533c7a568453e96b6da233798a709
    31 schema:url https://link.springer.com/10.1007%2F11527770_36
    32 sgo:license sg:explorer/license/
    33 sgo:sdDataset chapters
    34 rdf:type schema:Chapter
    35 N0131cfc28b384678b85c1b6f20fd01a9 schema:name readcube_id
    36 schema:value e85d3e7e8f2ea30e19be10f8849bcca2e45a5595d00ab97a738e41da6c58ee6e
    37 rdf:type schema:PropertyValue
    38 N03f683b53c19425bb20da4e158cbb704 rdf:first N19f46346a39646cb9b285de5bce3b65c
    39 rdf:rest Na46fce3a528143abb69acd6a3cae1274
    40 N09e57e61630640d186bb1a3b635593e9 rdf:first sg:person.01311163727.60
    41 rdf:rest rdf:nil
    42 N10900444c10a4a818f33019e78c36bb4 rdf:first sg:person.01060361377.20
    43 rdf:rest N7235249f03374bcb9047522f5c35667c
    44 N13a1b8e29e5548708847d5cdf20b961d rdf:first sg:person.01366505505.13
    45 rdf:rest N2298642289234d58a869016a458ad683
    46 N19f46346a39646cb9b285de5bce3b65c schema:familyName Miksch
    47 schema:givenName Silvia
    48 rdf:type schema:Person
    49 N2298642289234d58a869016a458ad683 rdf:first sg:person.0600360343.20
    50 rdf:rest Nc364f5ffbf154b7baf15ca3e58163921
    51 N2577c897309f4a58959a877fa4218e8e schema:familyName Keravnou
    52 schema:givenName Elpida T.
    53 rdf:type schema:Person
    54 N2ae13a7fa7d4486da810035c7cb53a3c schema:name dimensions_id
    55 schema:value pub.1048777026
    56 rdf:type schema:PropertyValue
    57 N2caceccd776b47b29f73d4c94cabdc7c schema:name doi
    58 schema:value 10.1007/11527770_36
    59 rdf:type schema:PropertyValue
    60 N422f8de13c024306a43013e510ad5397 schema:familyName Hunter
    61 schema:givenName Jim
    62 rdf:type schema:Person
    63 N7235249f03374bcb9047522f5c35667c rdf:first sg:person.01065216455.04
    64 rdf:rest N13a1b8e29e5548708847d5cdf20b961d
    65 N9653a034c35e4325b09b692ea48673c9 rdf:first N2577c897309f4a58959a877fa4218e8e
    66 rdf:rest rdf:nil
    67 Na46fce3a528143abb69acd6a3cae1274 rdf:first N422f8de13c024306a43013e510ad5397
    68 rdf:rest N9653a034c35e4325b09b692ea48673c9
    69 Na75533c7a568453e96b6da233798a709 schema:name Springer Nature - SN SciGraph project
    70 rdf:type schema:Organization
    71 Nad44d8bfa9b84ce392ea981ae09369f5 schema:location Berlin, Heidelberg
    72 schema:name Springer Berlin Heidelberg
    73 rdf:type schema:Organisation
    74 Nc221231d25c547c685c596f361f954b4 schema:isbn 978-3-540-27831-3
    75 978-3-540-31884-2
    76 schema:name Artificial Intelligence in Medicine
    77 rdf:type schema:Book
    78 Nc364f5ffbf154b7baf15ca3e58163921 rdf:first sg:person.0616733145.15
    79 rdf:rest N09e57e61630640d186bb1a3b635593e9
    80 anzsrc-for:14 schema:inDefinedTermSet anzsrc-for:
    81 schema:name Economics
    82 rdf:type schema:DefinedTerm
    83 anzsrc-for:1403 schema:inDefinedTermSet anzsrc-for:
    84 schema:name Econometrics
    85 rdf:type schema:DefinedTerm
    86 sg:person.01060361377.20 schema:affiliation https://www.grid.ac/institutes/grid.419765.8
    87 schema:familyName Ruch
    88 schema:givenName Patrick
    89 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01060361377.20
    90 rdf:type schema:Person
    91 sg:person.01065216455.04 schema:affiliation https://www.grid.ac/institutes/grid.419765.8
    92 schema:familyName Baud
    93 schema:givenName Robert
    94 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01065216455.04
    95 rdf:type schema:Person
    96 sg:person.01311163727.60 schema:affiliation https://www.grid.ac/institutes/grid.419765.8
    97 schema:familyName Veuthey
    98 schema:givenName Anne-Lise
    99 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01311163727.60
    100 rdf:type schema:Person
    101 sg:person.01366505505.13 schema:affiliation https://www.grid.ac/institutes/grid.419765.8
    102 schema:familyName Marty
    103 schema:givenName Johann
    104 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01366505505.13
    105 rdf:type schema:Person
    106 sg:person.0600360343.20 schema:affiliation https://www.grid.ac/institutes/grid.419765.8
    107 schema:familyName Geissbühler
    108 schema:givenName Antoine
    109 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0600360343.20
    110 rdf:type schema:Person
    111 sg:person.0616733145.15 schema:affiliation https://www.grid.ac/institutes/grid.419765.8
    112 schema:familyName Tbahriti
    113 schema:givenName Imad
    114 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0616733145.15
    115 rdf:type schema:Person
    116 sg:pub.10.1007/978-3-540-30222-3_31 schema:sameAs https://app.dimensions.ai/details/publication/pub.1028546264
    117 https://doi.org/10.1007/978-3-540-30222-3_31
    118 rdf:type schema:CreativeWork
    119 https://doi.org/10.1016/j.ijmedinf.2003.09.004 schema:sameAs https://app.dimensions.ai/details/publication/pub.1039323169
    120 rdf:type schema:CreativeWork
    121 https://doi.org/10.1016/s1386-5056(02)00057-6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1051808877
    122 rdf:type schema:CreativeWork
    123 https://doi.org/10.1093/bioinformatics/bth291 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041160894
    124 rdf:type schema:CreativeWork
    125 https://doi.org/10.1145/243199.243206 schema:sameAs https://app.dimensions.ai/details/publication/pub.1032416718
    126 rdf:type schema:CreativeWork
    127 https://doi.org/10.1145/383952.383958 schema:sameAs https://app.dimensions.ai/details/publication/pub.1010688760
    128 rdf:type schema:CreativeWork
    129 https://doi.org/10.1145/582415.582416 schema:sameAs https://app.dimensions.ai/details/publication/pub.1041903840
    130 rdf:type schema:CreativeWork
    131 https://doi.org/10.3115/1072228.1072337 schema:sameAs https://app.dimensions.ai/details/publication/pub.1021228232
    132 rdf:type schema:CreativeWork
    133 https://www.grid.ac/institutes/grid.419765.8 schema:alternateName Swiss Institute of Bioinformatics
    134 schema:name Medical Informatics Service and Swiss-Prot Group, University Hospital of Geneva and Swiss Institute of Bioinformatics, 1205, Geneva, Switzerland
    135 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...