Summarizing Short Texts Through a Discourse-Centered Approach in a Multilingual Context View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2013-06-03

AUTHORS

Daniel Alexandru Anechitei , Dan Cristea , Ioannidis Dimosthenis , Eugen Ignat , Diman Karagiozov , Svetla Koeva , Mateusz Kopeć , Cristina Vertan

ABSTRACT

The chapter presents the architecture of a system targeting summaries of short texts in six languages. At the core of a summary, which comprises clauses and sentences extracted from the original text, is the structure of the discourse and its relationship with its coreferential links. The approach shows a uniform design for all languages, while language specificity is attributed to the resources that fuel the component modules. The design described here includes a number of feedback loops used to fine-tune the parameters by comparing the output of the modules against annotated corpora. “Average” summaries over some human-produced ones are used to evaluate the accuracy of each of the monolingual systems. The study also presents some quantitative data on the corpora used, showing a comparison among languages and results that, mostly, prove to be above the state of the art. More... »

PAGES

109-135

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-1-4614-6934-6_6

DOI

http://dx.doi.org/10.1007/978-1-4614-6934-6_6

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1047826160


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/20", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Language, Communication and Culture", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/2004", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Linguistics", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Department of Computer Science, \u201cAlexandru Ioan Cuza\u201d University of Ia\u015fi, Ia\u015fi, Romania", 
          "id": "http://www.grid.ac/institutes/grid.8168.7", 
          "name": [
            "Department of Computer Science, \u201cAlexandru Ioan Cuza\u201d University of Ia\u015fi, Ia\u015fi, Romania"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Anechitei", 
        "givenName": "Daniel Alexandru", 
        "id": "sg:person.010415743505.20", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010415743505.20"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Institute for Computer Science, Romanian Academy, Ia\u015fi Branch, Ia\u015fi, Romania", 
          "id": "http://www.grid.ac/institutes/grid.418333.e", 
          "name": [
            "Department of Computer Science, \u201cAlexandru Ioan Cuza\u201d University of Ia\u015fi, Ia\u015fi, Romania", 
            "Institute for Computer Science, Romanian Academy, Ia\u015fi Branch, Ia\u015fi, Romania"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Cristea", 
        "givenName": "Dan", 
        "id": "sg:person.015666404207.49", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015666404207.49"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Atlantis Consulting SA, Thessaloniki, Greece", 
          "id": "http://www.grid.ac/institutes/grid.431913.d", 
          "name": [
            "Atlantis Consulting SA, Thessaloniki, Greece"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Dimosthenis", 
        "givenName": "Ioannidis", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Department of Computer Science, \u201cAlexandru Ioan Cuza\u201d University of Ia\u015fi, Ia\u015fi, Romania", 
          "id": "http://www.grid.ac/institutes/grid.8168.7", 
          "name": [
            "Department of Computer Science, \u201cAlexandru Ioan Cuza\u201d University of Ia\u015fi, Ia\u015fi, Romania"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Ignat", 
        "givenName": "Eugen", 
        "id": "sg:person.013316234115.41", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013316234115.41"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Tetracom Interactive Solutions Ltd., Sofia, Bulgaria", 
          "id": "http://www.grid.ac/institutes/None", 
          "name": [
            "Tetracom Interactive Solutions Ltd., Sofia, Bulgaria"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Karagiozov", 
        "givenName": "Diman", 
        "id": "sg:person.013605677554.48", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013605677554.48"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Institute for Bulgarian Language, Bulgarian Academy of Sciences, Sofia, Bulgaria", 
          "id": "http://www.grid.ac/institutes/grid.493346.f", 
          "name": [
            "Institute for Bulgarian Language, Bulgarian Academy of Sciences, Sofia, Bulgaria"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Koeva", 
        "givenName": "Svetla", 
        "id": "sg:person.013773166334.54", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013773166334.54"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland", 
          "id": "http://www.grid.ac/institutes/grid.425308.8", 
          "name": [
            "Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Kope\u0107", 
        "givenName": "Mateusz", 
        "id": "sg:person.015752120247.87", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015752120247.87"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Department of Linguistics, University of Hamburg, Hamburg, Germany", 
          "id": "http://www.grid.ac/institutes/grid.9026.d", 
          "name": [
            "Department of Linguistics, University of Hamburg, Hamburg, Germany"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Vertan", 
        "givenName": "Cristina", 
        "id": "sg:person.010045351446.06", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010045351446.06"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2013-06-03", 
    "datePublishedReg": "2013-06-03", 
    "description": "The chapter presents the architecture of a system targeting summaries of short texts in six languages. At the core of a summary, which comprises clauses and sentences extracted from the original text, is the structure of the discourse and its relationship with its coreferential links. The approach shows a uniform design for all languages, while language specificity is attributed to the resources that fuel the component modules. The design described here includes a number of feedback loops used to fine-tune the parameters by comparing the output of the modules against annotated corpora. \u201cAverage\u201d summaries over some human-produced ones are used to evaluate the accuracy of each of the monolingual systems. The study also presents some quantitative data on the corpora used, showing a comparison among languages and results that, mostly, prove to be above the state of the art.", 
    "editor": [
      {
        "familyName": "Neustein", 
        "givenName": "Amy", 
        "type": "Person"
      }, 
      {
        "familyName": "Markowitz", 
        "givenName": "Judith A.", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-1-4614-6934-6_6", 
    "inLanguage": "en", 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-1-4614-6933-9", 
        "978-1-4614-6934-6"
      ], 
      "name": "Where Humans Meet Machines", 
      "type": "Book"
    }, 
    "keywords": [
      "short texts", 
      "discourse-centered approach", 
      "multilingual contexts", 
      "language specificity", 
      "monolingual systems", 
      "original text", 
      "language", 
      "text", 
      "corpus", 
      "discourse", 
      "sentences", 
      "clauses", 
      "art", 
      "context", 
      "component modules", 
      "chapter", 
      "quantitative data", 
      "relationship", 
      "link", 
      "resources", 
      "one", 
      "approach", 
      "summary", 
      "study", 
      "state", 
      "core", 
      "structure", 
      "system", 
      "module", 
      "number", 
      "uniform design", 
      "output", 
      "design", 
      "specificity", 
      "comparison", 
      "data", 
      "architecture", 
      "results", 
      "feedback loop", 
      "accuracy", 
      "loop", 
      "parameters", 
      "coreferential links", 
      "human-produced ones"
    ], 
    "name": "Summarizing Short Texts Through a Discourse-Centered Approach in a Multilingual Context", 
    "pagination": "109-135", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1047826160"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-1-4614-6934-6_6"
        ]
      }
    ], 
    "publisher": {
      "name": "Springer Nature", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-1-4614-6934-6_6", 
      "https://app.dimensions.ai/details/publication/pub.1047826160"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2021-12-01T20:09", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20211201/entities/gbq_results/chapter/chapter_401.jsonl", 
    "type": "Chapter", 
    "url": "https://doi.org/10.1007/978-1-4614-6934-6_6"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-6934-6_6'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-6934-6_6'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-6934-6_6'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-1-4614-6934-6_6'


 

This table displays all metadata directly associated to this object as RDF triples.

176 TRIPLES      23 PREDICATES      68 URIs      61 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-1-4614-6934-6_6 schema:about anzsrc-for:20
2 anzsrc-for:2004
3 schema:author N64e34b47b6814f1db08b71f542f7fec2
4 schema:datePublished 2013-06-03
5 schema:datePublishedReg 2013-06-03
6 schema:description The chapter presents the architecture of a system targeting summaries of short texts in six languages. At the core of a summary, which comprises clauses and sentences extracted from the original text, is the structure of the discourse and its relationship with its coreferential links. The approach shows a uniform design for all languages, while language specificity is attributed to the resources that fuel the component modules. The design described here includes a number of feedback loops used to fine-tune the parameters by comparing the output of the modules against annotated corpora. “Average” summaries over some human-produced ones are used to evaluate the accuracy of each of the monolingual systems. The study also presents some quantitative data on the corpora used, showing a comparison among languages and results that, mostly, prove to be above the state of the art.
7 schema:editor Nf82f8581c9c746a5af1afb36320971e3
8 schema:genre chapter
9 schema:inLanguage en
10 schema:isAccessibleForFree false
11 schema:isPartOf Ne29ef4eb04b94226a38f5661f9c23a17
12 schema:keywords accuracy
13 approach
14 architecture
15 art
16 chapter
17 clauses
18 comparison
19 component modules
20 context
21 core
22 coreferential links
23 corpus
24 data
25 design
26 discourse
27 discourse-centered approach
28 feedback loop
29 human-produced ones
30 language
31 language specificity
32 link
33 loop
34 module
35 monolingual systems
36 multilingual contexts
37 number
38 one
39 original text
40 output
41 parameters
42 quantitative data
43 relationship
44 resources
45 results
46 sentences
47 short texts
48 specificity
49 state
50 structure
51 study
52 summary
53 system
54 text
55 uniform design
56 schema:name Summarizing Short Texts Through a Discourse-Centered Approach in a Multilingual Context
57 schema:pagination 109-135
58 schema:productId N37a423e60f42402eaac52558caf99f83
59 Nabd8d226c01144c39191111c986b4472
60 schema:publisher Na730cccee33e4ba9a05cdb1d29914662
61 schema:sameAs https://app.dimensions.ai/details/publication/pub.1047826160
62 https://doi.org/10.1007/978-1-4614-6934-6_6
63 schema:sdDatePublished 2021-12-01T20:09
64 schema:sdLicense https://scigraph.springernature.com/explorer/license/
65 schema:sdPublisher N11ecb15d444b48dba1a4e566e378f813
66 schema:url https://doi.org/10.1007/978-1-4614-6934-6_6
67 sgo:license sg:explorer/license/
68 sgo:sdDataset chapters
69 rdf:type schema:Chapter
70 N0b54b4a5dcd0444989abb162324346a5 rdf:first sg:person.015752120247.87
71 rdf:rest N2613a72e38644e979ccd6f67aacd2011
72 N11408ce90cb54bbbbdf8acdedb892be5 rdf:first sg:person.013605677554.48
73 rdf:rest N673c1791ffea4eb6b8305e1859efdb42
74 N11ecb15d444b48dba1a4e566e378f813 schema:name Springer Nature - SN SciGraph project
75 rdf:type schema:Organization
76 N2613a72e38644e979ccd6f67aacd2011 rdf:first sg:person.010045351446.06
77 rdf:rest rdf:nil
78 N37a423e60f42402eaac52558caf99f83 schema:name dimensions_id
79 schema:value pub.1047826160
80 rdf:type schema:PropertyValue
81 N64e34b47b6814f1db08b71f542f7fec2 rdf:first sg:person.010415743505.20
82 rdf:rest N8403ecccede24f42a8f8cbc874c7ae9e
83 N650e3ab91fa343c49fd3ad53172eb0af schema:familyName Markowitz
84 schema:givenName Judith A.
85 rdf:type schema:Person
86 N673c1791ffea4eb6b8305e1859efdb42 rdf:first sg:person.013773166334.54
87 rdf:rest N0b54b4a5dcd0444989abb162324346a5
88 N6e873ab4b9ec46e1aaec7705eea51b48 rdf:first sg:person.013316234115.41
89 rdf:rest N11408ce90cb54bbbbdf8acdedb892be5
90 N8403ecccede24f42a8f8cbc874c7ae9e rdf:first sg:person.015666404207.49
91 rdf:rest Nc5367ecc6d8f496fb71c7eecfd8d14be
92 N9c0aa3dbb70d412da93affc225356cc1 rdf:first N650e3ab91fa343c49fd3ad53172eb0af
93 rdf:rest rdf:nil
94 Na730cccee33e4ba9a05cdb1d29914662 schema:name Springer Nature
95 rdf:type schema:Organisation
96 Nabd8d226c01144c39191111c986b4472 schema:name doi
97 schema:value 10.1007/978-1-4614-6934-6_6
98 rdf:type schema:PropertyValue
99 Nc5367ecc6d8f496fb71c7eecfd8d14be rdf:first Nc66fb53286474a729c5dc2eb25713f06
100 rdf:rest N6e873ab4b9ec46e1aaec7705eea51b48
101 Nc66fb53286474a729c5dc2eb25713f06 schema:affiliation grid-institutes:grid.431913.d
102 schema:familyName Dimosthenis
103 schema:givenName Ioannidis
104 rdf:type schema:Person
105 Ne29ef4eb04b94226a38f5661f9c23a17 schema:isbn 978-1-4614-6933-9
106 978-1-4614-6934-6
107 schema:name Where Humans Meet Machines
108 rdf:type schema:Book
109 Ne3eae810ddb647f38d1cb87b651b2f12 schema:familyName Neustein
110 schema:givenName Amy
111 rdf:type schema:Person
112 Nf82f8581c9c746a5af1afb36320971e3 rdf:first Ne3eae810ddb647f38d1cb87b651b2f12
113 rdf:rest N9c0aa3dbb70d412da93affc225356cc1
114 anzsrc-for:20 schema:inDefinedTermSet anzsrc-for:
115 schema:name Language, Communication and Culture
116 rdf:type schema:DefinedTerm
117 anzsrc-for:2004 schema:inDefinedTermSet anzsrc-for:
118 schema:name Linguistics
119 rdf:type schema:DefinedTerm
120 sg:person.010045351446.06 schema:affiliation grid-institutes:grid.9026.d
121 schema:familyName Vertan
122 schema:givenName Cristina
123 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010045351446.06
124 rdf:type schema:Person
125 sg:person.010415743505.20 schema:affiliation grid-institutes:grid.8168.7
126 schema:familyName Anechitei
127 schema:givenName Daniel Alexandru
128 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010415743505.20
129 rdf:type schema:Person
130 sg:person.013316234115.41 schema:affiliation grid-institutes:grid.8168.7
131 schema:familyName Ignat
132 schema:givenName Eugen
133 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013316234115.41
134 rdf:type schema:Person
135 sg:person.013605677554.48 schema:affiliation grid-institutes:None
136 schema:familyName Karagiozov
137 schema:givenName Diman
138 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013605677554.48
139 rdf:type schema:Person
140 sg:person.013773166334.54 schema:affiliation grid-institutes:grid.493346.f
141 schema:familyName Koeva
142 schema:givenName Svetla
143 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013773166334.54
144 rdf:type schema:Person
145 sg:person.015666404207.49 schema:affiliation grid-institutes:grid.418333.e
146 schema:familyName Cristea
147 schema:givenName Dan
148 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015666404207.49
149 rdf:type schema:Person
150 sg:person.015752120247.87 schema:affiliation grid-institutes:grid.425308.8
151 schema:familyName Kopeć
152 schema:givenName Mateusz
153 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015752120247.87
154 rdf:type schema:Person
155 grid-institutes:None schema:alternateName Tetracom Interactive Solutions Ltd., Sofia, Bulgaria
156 schema:name Tetracom Interactive Solutions Ltd., Sofia, Bulgaria
157 rdf:type schema:Organization
158 grid-institutes:grid.418333.e schema:alternateName Institute for Computer Science, Romanian Academy, Iaşi Branch, Iaşi, Romania
159 schema:name Department of Computer Science, “Alexandru Ioan Cuza” University of Iaşi, Iaşi, Romania
160 Institute for Computer Science, Romanian Academy, Iaşi Branch, Iaşi, Romania
161 rdf:type schema:Organization
162 grid-institutes:grid.425308.8 schema:alternateName Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
163 schema:name Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
164 rdf:type schema:Organization
165 grid-institutes:grid.431913.d schema:alternateName Atlantis Consulting SA, Thessaloniki, Greece
166 schema:name Atlantis Consulting SA, Thessaloniki, Greece
167 rdf:type schema:Organization
168 grid-institutes:grid.493346.f schema:alternateName Institute for Bulgarian Language, Bulgarian Academy of Sciences, Sofia, Bulgaria
169 schema:name Institute for Bulgarian Language, Bulgarian Academy of Sciences, Sofia, Bulgaria
170 rdf:type schema:Organization
171 grid-institutes:grid.8168.7 schema:alternateName Department of Computer Science, “Alexandru Ioan Cuza” University of Iaşi, Iaşi, Romania
172 schema:name Department of Computer Science, “Alexandru Ioan Cuza” University of Iaşi, Iaşi, Romania
173 rdf:type schema:Organization
174 grid-institutes:grid.9026.d schema:alternateName Department of Linguistics, University of Hamburg, Hamburg, Germany
175 schema:name Department of Linguistics, University of Hamburg, Hamburg, Germany
176 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...