A New Multiword Expression Metric and Its Applications View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2011-01

AUTHORS

Fan Bu, Xiao-Yan Zhu, Ming Li

ABSTRACT

Multiword Expressions (MWEs) appear frequently and ungrammatically in natural languages. Identifying MWEs in free texts is a very challenging problem. This paper proposes a knowledge-free, unsupervised, and language-independent Multiword Expression Distance (MED). The new metric is derived from an accepted physical principle, measures the distance from an n-gram to its semantics, and outperforms other state-of-the-art methods on MWEs in two applications: question answering and named entity extraction. More... »

PAGES

3-13

References to SciGraph publications

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/s11390-011-9410-0

DOI

http://dx.doi.org/10.1007/s11390-011-9410-0

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1032488355


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/2004", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Linguistics", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/20", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Language, Communication and Culture", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Tsinghua University", 
          "id": "https://www.grid.ac/institutes/grid.12527.33", 
          "name": [
            "State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Bu", 
        "givenName": "Fan", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Tsinghua University", 
          "id": "https://www.grid.ac/institutes/grid.12527.33", 
          "name": [
            "State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Zhu", 
        "givenName": "Xiao-Yan", 
        "id": "sg:person.010317723315.12", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010317723315.12"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Waterloo", 
          "id": "https://www.grid.ac/institutes/grid.46078.3d", 
          "name": [
            "David R. Cheriton School of Computer Science, University of Waterloo, N2L 3G1, Waterloo, Canada"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Li", 
        "givenName": "Ming", 
        "id": "sg:person.0621576316.79", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0621576316.79"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1145/1014052.1014077", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1000193805"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/17.2.149", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1009738168"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/j.eswa.2009.02.026", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1016378716"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1119176.1119206", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1017842044"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1072228.1072370", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1021337474"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/s11390-008-9152-9", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1022686529", 
          "https://doi.org/10.1007/s11390-008-9152-9"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/980451.980857", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1024862199"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://app.dimensions.ai/details/publication/pub.1040973888", 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/978-0-387-49820-1", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1040973888", 
          "https://doi.org/10.1007/978-0-387-49820-1"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/978-0-387-49820-1", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1040973888", 
          "https://doi.org/10.1007/978-0-387-49820-1"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/981623.981633", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1053495826"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1017/s1351324900000048", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1054922841"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/scientificamerican0603-76", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1056544393", 
          "https://doi.org/10.1038/scientificamerican0603-76"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/18.681318", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061100692"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/tit.2004.830793", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061650154"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/tit.2004.838101", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061650298"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/tkde.2007.48", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061661815"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1219840.1219885", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099221891"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1219840.1219885", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099221891"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1034678.1034730", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099239488"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1034678.1034730", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099239488"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1073083.1073154", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099239643"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1073083.1073154", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099239643"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1613692.1613700", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099244520"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2011-01", 
    "datePublishedReg": "2011-01-01", 
    "description": "Multiword Expressions (MWEs) appear frequently and ungrammatically in natural languages. Identifying MWEs in free texts is a very challenging problem. This paper proposes a knowledge-free, unsupervised, and language-independent Multiword Expression Distance (MED). The new metric is derived from an accepted physical principle, measures the distance from an n-gram to its semantics, and outperforms other state-of-the-art methods on MWEs in two applications: question answering and named entity extraction.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1007/s11390-011-9410-0", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": [
      {
        "id": "sg:journal.1320078", 
        "issn": [
          "1666-6046", 
          "1666-6038"
        ], 
        "name": "Journal of Computer Science and Technology", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "1", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "26"
      }
    ], 
    "name": "A New Multiword Expression Metric and Its Applications", 
    "pagination": "3-13", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "9ac59a3f30bfa5ed45d9e6d5d183fbd2c5fd40a7093b1b61ab7b04d38b9f9551"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/s11390-011-9410-0"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1032488355"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1007/s11390-011-9410-0", 
      "https://app.dimensions.ai/details/publication/pub.1032488355"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-11T01:09", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8697_00000522.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "http://link.springer.com/10.1007%2Fs11390-011-9410-0"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s11390-011-9410-0'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s11390-011-9410-0'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s11390-011-9410-0'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s11390-011-9410-0'


 

This table displays all metadata directly associated to this object as RDF triples.

139 TRIPLES      21 PREDICATES      47 URIs      19 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/s11390-011-9410-0 schema:about anzsrc-for:20
2 anzsrc-for:2004
3 schema:author Nd93ac13781b54171bde6f874468efa3a
4 schema:citation sg:pub.10.1007/978-0-387-49820-1
5 sg:pub.10.1007/s11390-008-9152-9
6 sg:pub.10.1038/scientificamerican0603-76
7 https://app.dimensions.ai/details/publication/pub.1040973888
8 https://doi.org/10.1016/j.eswa.2009.02.026
9 https://doi.org/10.1017/s1351324900000048
10 https://doi.org/10.1093/bioinformatics/17.2.149
11 https://doi.org/10.1109/18.681318
12 https://doi.org/10.1109/tit.2004.830793
13 https://doi.org/10.1109/tit.2004.838101
14 https://doi.org/10.1109/tkde.2007.48
15 https://doi.org/10.1145/1014052.1014077
16 https://doi.org/10.3115/1034678.1034730
17 https://doi.org/10.3115/1072228.1072370
18 https://doi.org/10.3115/1073083.1073154
19 https://doi.org/10.3115/1119176.1119206
20 https://doi.org/10.3115/1219840.1219885
21 https://doi.org/10.3115/1613692.1613700
22 https://doi.org/10.3115/980451.980857
23 https://doi.org/10.3115/981623.981633
24 schema:datePublished 2011-01
25 schema:datePublishedReg 2011-01-01
26 schema:description Multiword Expressions (MWEs) appear frequently and ungrammatically in natural languages. Identifying MWEs in free texts is a very challenging problem. This paper proposes a knowledge-free, unsupervised, and language-independent Multiword Expression Distance (MED). The new metric is derived from an accepted physical principle, measures the distance from an n-gram to its semantics, and outperforms other state-of-the-art methods on MWEs in two applications: question answering and named entity extraction.
27 schema:genre research_article
28 schema:inLanguage en
29 schema:isAccessibleForFree false
30 schema:isPartOf N33e759476e414ccf8acf8f09f2ee2352
31 Nebbcf4cf831f4728a042e8a2694f6f91
32 sg:journal.1320078
33 schema:name A New Multiword Expression Metric and Its Applications
34 schema:pagination 3-13
35 schema:productId N085742fe17f7438abfb75cf7536dc75a
36 N38c54ee9af064ba9a7fdf05715b7df26
37 N92ab986ce0944299b888b96c1abe6292
38 schema:sameAs https://app.dimensions.ai/details/publication/pub.1032488355
39 https://doi.org/10.1007/s11390-011-9410-0
40 schema:sdDatePublished 2019-04-11T01:09
41 schema:sdLicense https://scigraph.springernature.com/explorer/license/
42 schema:sdPublisher N50fa8397f72f43fdb0cab672a245a805
43 schema:url http://link.springer.com/10.1007%2Fs11390-011-9410-0
44 sgo:license sg:explorer/license/
45 sgo:sdDataset articles
46 rdf:type schema:ScholarlyArticle
47 N085742fe17f7438abfb75cf7536dc75a schema:name dimensions_id
48 schema:value pub.1032488355
49 rdf:type schema:PropertyValue
50 N0cdf6fe7dac3488bb3a0c42070bd430a rdf:first sg:person.010317723315.12
51 rdf:rest N36e2b10c32684511b6b3b63bef653668
52 N33e759476e414ccf8acf8f09f2ee2352 schema:volumeNumber 26
53 rdf:type schema:PublicationVolume
54 N36e2b10c32684511b6b3b63bef653668 rdf:first sg:person.0621576316.79
55 rdf:rest rdf:nil
56 N38c54ee9af064ba9a7fdf05715b7df26 schema:name doi
57 schema:value 10.1007/s11390-011-9410-0
58 rdf:type schema:PropertyValue
59 N50fa8397f72f43fdb0cab672a245a805 schema:name Springer Nature - SN SciGraph project
60 rdf:type schema:Organization
61 N92ab986ce0944299b888b96c1abe6292 schema:name readcube_id
62 schema:value 9ac59a3f30bfa5ed45d9e6d5d183fbd2c5fd40a7093b1b61ab7b04d38b9f9551
63 rdf:type schema:PropertyValue
64 Nd93ac13781b54171bde6f874468efa3a rdf:first Ndc73ef74b0f2422583eb108cbcf68b5b
65 rdf:rest N0cdf6fe7dac3488bb3a0c42070bd430a
66 Ndc73ef74b0f2422583eb108cbcf68b5b schema:affiliation https://www.grid.ac/institutes/grid.12527.33
67 schema:familyName Bu
68 schema:givenName Fan
69 rdf:type schema:Person
70 Nebbcf4cf831f4728a042e8a2694f6f91 schema:issueNumber 1
71 rdf:type schema:PublicationIssue
72 anzsrc-for:20 schema:inDefinedTermSet anzsrc-for:
73 schema:name Language, Communication and Culture
74 rdf:type schema:DefinedTerm
75 anzsrc-for:2004 schema:inDefinedTermSet anzsrc-for:
76 schema:name Linguistics
77 rdf:type schema:DefinedTerm
78 sg:journal.1320078 schema:issn 1666-6038
79 1666-6046
80 schema:name Journal of Computer Science and Technology
81 rdf:type schema:Periodical
82 sg:person.010317723315.12 schema:affiliation https://www.grid.ac/institutes/grid.12527.33
83 schema:familyName Zhu
84 schema:givenName Xiao-Yan
85 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010317723315.12
86 rdf:type schema:Person
87 sg:person.0621576316.79 schema:affiliation https://www.grid.ac/institutes/grid.46078.3d
88 schema:familyName Li
89 schema:givenName Ming
90 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0621576316.79
91 rdf:type schema:Person
92 sg:pub.10.1007/978-0-387-49820-1 schema:sameAs https://app.dimensions.ai/details/publication/pub.1040973888
93 https://doi.org/10.1007/978-0-387-49820-1
94 rdf:type schema:CreativeWork
95 sg:pub.10.1007/s11390-008-9152-9 schema:sameAs https://app.dimensions.ai/details/publication/pub.1022686529
96 https://doi.org/10.1007/s11390-008-9152-9
97 rdf:type schema:CreativeWork
98 sg:pub.10.1038/scientificamerican0603-76 schema:sameAs https://app.dimensions.ai/details/publication/pub.1056544393
99 https://doi.org/10.1038/scientificamerican0603-76
100 rdf:type schema:CreativeWork
101 https://app.dimensions.ai/details/publication/pub.1040973888 schema:CreativeWork
102 https://doi.org/10.1016/j.eswa.2009.02.026 schema:sameAs https://app.dimensions.ai/details/publication/pub.1016378716
103 rdf:type schema:CreativeWork
104 https://doi.org/10.1017/s1351324900000048 schema:sameAs https://app.dimensions.ai/details/publication/pub.1054922841
105 rdf:type schema:CreativeWork
106 https://doi.org/10.1093/bioinformatics/17.2.149 schema:sameAs https://app.dimensions.ai/details/publication/pub.1009738168
107 rdf:type schema:CreativeWork
108 https://doi.org/10.1109/18.681318 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061100692
109 rdf:type schema:CreativeWork
110 https://doi.org/10.1109/tit.2004.830793 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061650154
111 rdf:type schema:CreativeWork
112 https://doi.org/10.1109/tit.2004.838101 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061650298
113 rdf:type schema:CreativeWork
114 https://doi.org/10.1109/tkde.2007.48 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061661815
115 rdf:type schema:CreativeWork
116 https://doi.org/10.1145/1014052.1014077 schema:sameAs https://app.dimensions.ai/details/publication/pub.1000193805
117 rdf:type schema:CreativeWork
118 https://doi.org/10.3115/1034678.1034730 schema:sameAs https://app.dimensions.ai/details/publication/pub.1099239488
119 rdf:type schema:CreativeWork
120 https://doi.org/10.3115/1072228.1072370 schema:sameAs https://app.dimensions.ai/details/publication/pub.1021337474
121 rdf:type schema:CreativeWork
122 https://doi.org/10.3115/1073083.1073154 schema:sameAs https://app.dimensions.ai/details/publication/pub.1099239643
123 rdf:type schema:CreativeWork
124 https://doi.org/10.3115/1119176.1119206 schema:sameAs https://app.dimensions.ai/details/publication/pub.1017842044
125 rdf:type schema:CreativeWork
126 https://doi.org/10.3115/1219840.1219885 schema:sameAs https://app.dimensions.ai/details/publication/pub.1099221891
127 rdf:type schema:CreativeWork
128 https://doi.org/10.3115/1613692.1613700 schema:sameAs https://app.dimensions.ai/details/publication/pub.1099244520
129 rdf:type schema:CreativeWork
130 https://doi.org/10.3115/980451.980857 schema:sameAs https://app.dimensions.ai/details/publication/pub.1024862199
131 rdf:type schema:CreativeWork
132 https://doi.org/10.3115/981623.981633 schema:sameAs https://app.dimensions.ai/details/publication/pub.1053495826
133 rdf:type schema:CreativeWork
134 https://www.grid.ac/institutes/grid.12527.33 schema:alternateName Tsinghua University
135 schema:name State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China
136 rdf:type schema:Organization
137 https://www.grid.ac/institutes/grid.46078.3d schema:alternateName University of Waterloo
138 schema:name David R. Cheriton School of Computer Science, University of Waterloo, N2L 3G1, Waterloo, Canada
139 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...