Using the Co-occurrence of Words for Retrieval Weighting View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2000-10

AUTHORS

Elke Mittendorf, Bojidar Mateev, Peter Schäuble

ABSTRACT

We have applied the well-known Robertson-Sparck Jones weighting to sets of indexing features that are different from word-based features. Our features describe the co-occurrences of words in a window range of predefined size. The experiments have been designed to analyse the value of features that are beyond word-based features but all used retrieval methods can be motivated strictly in the probabilistic framework. Among the several implications of our experiments for weighted retrieval is the surprising result that features that describe the co-occurrences of words in sentence-size or paragraph-size windows are significantly better descriptors than purely word-based indexing features. More... »

PAGES

243-251

Identifiers

URI

http://scigraph.springernature.com/pub.10.1023/a:1026520926673

DOI

http://dx.doi.org/10.1023/a:1026520926673

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1044629971


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "name": [
            "Systor A6, CH-8048, Z\u00fcrich, Switzerland"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Mittendorf", 
        "givenName": "Elke", 
        "id": "sg:person.010437463675.80", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010437463675.80"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Eurospider Information Technology (Switzerland)", 
          "id": "https://www.grid.ac/institutes/grid.433769.c", 
          "name": [
            "Eurospider Information Technology AG, CH-8006, Z\u00fcrich, Switzerland"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Mateev", 
        "givenName": "Bojidar", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Eurospider Information Technology (Switzerland)", 
          "id": "https://www.grid.ac/institutes/grid.433769.c", 
          "name": [
            "Eurospider Information Technology AG, CH-8006, Z\u00fcrich, Switzerland"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Sch\u00e4uble", 
        "givenName": "Peter", 
        "id": "sg:person.0670254567.14", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0670254567.14"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1108/eb026647", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1000000284"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/comjnl/35.3.243", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1003892145"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/42005.42016", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1005618682"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1108/eb026637", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1009667911"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/0306-4573(94)90074-4", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1013285041"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1108/eum0000000007193", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1018513875"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/122860.122864", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1025261916"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/978-1-4471-2099-5_24", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1032195050", 
          "https://doi.org/10.1007/978-1-4471-2099-5_24"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/243199.243206", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1032416718"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/195705.195735", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1052319650"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1126/science.264.5164.1421", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1062548290"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2000-10", 
    "datePublishedReg": "2000-10-01", 
    "description": "We have applied the well-known Robertson-Sparck Jones weighting to sets of indexing features that are different from word-based features. Our features describe the co-occurrences of words in a window range of predefined size. The experiments have been designed to analyse the value of features that are beyond word-based features but all used retrieval methods can be motivated strictly in the probabilistic framework. Among the several implications of our experiments for weighted retrieval is the surprising result that features that describe the co-occurrences of words in sentence-size or paragraph-size windows are significantly better descriptors than purely word-based indexing features.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1023/a:1026520926673", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": [
      {
        "id": "sg:journal.1023664", 
        "issn": [
          "1386-4564", 
          "1573-7659"
        ], 
        "name": "Information Retrieval Journal", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "3", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "3"
      }
    ], 
    "name": "Using the Co-occurrence of Words for Retrieval Weighting", 
    "pagination": "243-251", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "d497102fe486c1c130da962f0d31f7ba4ab4e5ce904e09d674a8d15b0edf49be"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1023/a:1026520926673"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1044629971"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1023/a:1026520926673", 
      "https://app.dimensions.ai/details/publication/pub.1044629971"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-10T14:15", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8660_00000537.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "http://link.springer.com/10.1023%2FA%3A1026520926673"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1023/a:1026520926673'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1023/a:1026520926673'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1023/a:1026520926673'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1023/a:1026520926673'


 

This table displays all metadata directly associated to this object as RDF triples.

110 TRIPLES      21 PREDICATES      38 URIs      19 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1023/a:1026520926673 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author N5e2c42a18f51473c8df54f585b084353
4 schema:citation sg:pub.10.1007/978-1-4471-2099-5_24
5 https://doi.org/10.1016/0306-4573(94)90074-4
6 https://doi.org/10.1093/comjnl/35.3.243
7 https://doi.org/10.1108/eb026637
8 https://doi.org/10.1108/eb026647
9 https://doi.org/10.1108/eum0000000007193
10 https://doi.org/10.1126/science.264.5164.1421
11 https://doi.org/10.1145/122860.122864
12 https://doi.org/10.1145/195705.195735
13 https://doi.org/10.1145/243199.243206
14 https://doi.org/10.1145/42005.42016
15 schema:datePublished 2000-10
16 schema:datePublishedReg 2000-10-01
17 schema:description We have applied the well-known Robertson-Sparck Jones weighting to sets of indexing features that are different from word-based features. Our features describe the co-occurrences of words in a window range of predefined size. The experiments have been designed to analyse the value of features that are beyond word-based features but all used retrieval methods can be motivated strictly in the probabilistic framework. Among the several implications of our experiments for weighted retrieval is the surprising result that features that describe the co-occurrences of words in sentence-size or paragraph-size windows are significantly better descriptors than purely word-based indexing features.
18 schema:genre research_article
19 schema:inLanguage en
20 schema:isAccessibleForFree false
21 schema:isPartOf N029f0d3337e64c21a0ed599af249ad36
22 N44b481fc5349496ea26db523185a6c0f
23 sg:journal.1023664
24 schema:name Using the Co-occurrence of Words for Retrieval Weighting
25 schema:pagination 243-251
26 schema:productId N00e45f0f432045418abec600a734be5b
27 N621f4c44847548c4b67692d319c68de6
28 Nd3bc9ba2207645dd8cfc6feda7372438
29 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044629971
30 https://doi.org/10.1023/a:1026520926673
31 schema:sdDatePublished 2019-04-10T14:15
32 schema:sdLicense https://scigraph.springernature.com/explorer/license/
33 schema:sdPublisher Ne0343c1a02d74d568cc492400214b348
34 schema:url http://link.springer.com/10.1023%2FA%3A1026520926673
35 sgo:license sg:explorer/license/
36 sgo:sdDataset articles
37 rdf:type schema:ScholarlyArticle
38 N00e45f0f432045418abec600a734be5b schema:name doi
39 schema:value 10.1023/a:1026520926673
40 rdf:type schema:PropertyValue
41 N029f0d3337e64c21a0ed599af249ad36 schema:volumeNumber 3
42 rdf:type schema:PublicationVolume
43 N44b481fc5349496ea26db523185a6c0f schema:issueNumber 3
44 rdf:type schema:PublicationIssue
45 N5e2c42a18f51473c8df54f585b084353 rdf:first sg:person.010437463675.80
46 rdf:rest Ne307f04a084f4492a2f7a78be48fde82
47 N621f4c44847548c4b67692d319c68de6 schema:name readcube_id
48 schema:value d497102fe486c1c130da962f0d31f7ba4ab4e5ce904e09d674a8d15b0edf49be
49 rdf:type schema:PropertyValue
50 Nb2d1973d975545c6b6c3e6f73f746609 schema:name Systor A6, CH-8048, Zürich, Switzerland
51 rdf:type schema:Organization
52 Nd3bc9ba2207645dd8cfc6feda7372438 schema:name dimensions_id
53 schema:value pub.1044629971
54 rdf:type schema:PropertyValue
55 Ne0343c1a02d74d568cc492400214b348 schema:name Springer Nature - SN SciGraph project
56 rdf:type schema:Organization
57 Ne307f04a084f4492a2f7a78be48fde82 rdf:first Ne7cdf8ee2bfb4eb99353dcfc6b416e73
58 rdf:rest Nf001131571144ed28ebb3dd931603f51
59 Ne7cdf8ee2bfb4eb99353dcfc6b416e73 schema:affiliation https://www.grid.ac/institutes/grid.433769.c
60 schema:familyName Mateev
61 schema:givenName Bojidar
62 rdf:type schema:Person
63 Nf001131571144ed28ebb3dd931603f51 rdf:first sg:person.0670254567.14
64 rdf:rest rdf:nil
65 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
66 schema:name Information and Computing Sciences
67 rdf:type schema:DefinedTerm
68 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
69 schema:name Artificial Intelligence and Image Processing
70 rdf:type schema:DefinedTerm
71 sg:journal.1023664 schema:issn 1386-4564
72 1573-7659
73 schema:name Information Retrieval Journal
74 rdf:type schema:Periodical
75 sg:person.010437463675.80 schema:affiliation Nb2d1973d975545c6b6c3e6f73f746609
76 schema:familyName Mittendorf
77 schema:givenName Elke
78 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010437463675.80
79 rdf:type schema:Person
80 sg:person.0670254567.14 schema:affiliation https://www.grid.ac/institutes/grid.433769.c
81 schema:familyName Schäuble
82 schema:givenName Peter
83 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0670254567.14
84 rdf:type schema:Person
85 sg:pub.10.1007/978-1-4471-2099-5_24 schema:sameAs https://app.dimensions.ai/details/publication/pub.1032195050
86 https://doi.org/10.1007/978-1-4471-2099-5_24
87 rdf:type schema:CreativeWork
88 https://doi.org/10.1016/0306-4573(94)90074-4 schema:sameAs https://app.dimensions.ai/details/publication/pub.1013285041
89 rdf:type schema:CreativeWork
90 https://doi.org/10.1093/comjnl/35.3.243 schema:sameAs https://app.dimensions.ai/details/publication/pub.1003892145
91 rdf:type schema:CreativeWork
92 https://doi.org/10.1108/eb026637 schema:sameAs https://app.dimensions.ai/details/publication/pub.1009667911
93 rdf:type schema:CreativeWork
94 https://doi.org/10.1108/eb026647 schema:sameAs https://app.dimensions.ai/details/publication/pub.1000000284
95 rdf:type schema:CreativeWork
96 https://doi.org/10.1108/eum0000000007193 schema:sameAs https://app.dimensions.ai/details/publication/pub.1018513875
97 rdf:type schema:CreativeWork
98 https://doi.org/10.1126/science.264.5164.1421 schema:sameAs https://app.dimensions.ai/details/publication/pub.1062548290
99 rdf:type schema:CreativeWork
100 https://doi.org/10.1145/122860.122864 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025261916
101 rdf:type schema:CreativeWork
102 https://doi.org/10.1145/195705.195735 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052319650
103 rdf:type schema:CreativeWork
104 https://doi.org/10.1145/243199.243206 schema:sameAs https://app.dimensions.ai/details/publication/pub.1032416718
105 rdf:type schema:CreativeWork
106 https://doi.org/10.1145/42005.42016 schema:sameAs https://app.dimensions.ai/details/publication/pub.1005618682
107 rdf:type schema:CreativeWork
108 https://www.grid.ac/institutes/grid.433769.c schema:alternateName Eurospider Information Technology (Switzerland)
109 schema:name Eurospider Information Technology AG, CH-8006, Zürich, Switzerland
110 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...