Geoparsing of Czech RSS News and Evaluation of Its Spatial Distribution View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2011

AUTHORS

Jiří Horák , Pavel Belaj , Igor Ivan , Peter Nemec , Jiří Ardielli , Jan Růžička

ABSTRACT

Geoparsing assigns geographic identifiers to textual words and phrases in documents. The specific problem is how to apply geoparsing in languages where changes of word termination occur. An appropriate method requires a flexible solution reflecting different strategies and priorities. Sixteen Czech RSS news channels were evaluated according to ten criteria. Three selected RSS channels were monitored for more than two years. The applied geoparsing included successive steps of different filters’ application and utilized the generation of different grammatical cases for recognized entities. Various problems with geographical names are classified and documented. The quality assessment shows satisfactory results namely for identification of names in domiciles (94%). The pessimistic strategy is applied to analyze a geographical balance of news distribution. The results show significant differences between distribution of news in monitored channels and document a high concentration of cultural and national news in several locations. More... »

PAGES

353-367

Book

TITLE

Semantic Methods for Knowledge Management and Communication

ISBN

978-3-642-23417-0
978-3-642-23418-7

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-642-23418-7_31

DOI

http://dx.doi.org/10.1007/978-3-642-23418-7_31

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1049549984


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/2004", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Linguistics", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/20", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Language, Communication and Culture", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Technical University of Ostrava", 
          "id": "https://www.grid.ac/institutes/grid.440850.d", 
          "name": [
            "Institute of Geoinformatics, VSB Technical University of Ostrava, 17. listopadu 15, 70833, Ostrava, Poruba, Czech Republic"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Hor\u00e1k", 
        "givenName": "Ji\u0159\u00ed", 
        "id": "sg:person.015061123613.29", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015061123613.29"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Technical University of Ostrava", 
          "id": "https://www.grid.ac/institutes/grid.440850.d", 
          "name": [
            "Institute of Geoinformatics, VSB Technical University of Ostrava, 17. listopadu 15, 70833, Ostrava, Poruba, Czech Republic"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Belaj", 
        "givenName": "Pavel", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Technical University of Ostrava", 
          "id": "https://www.grid.ac/institutes/grid.440850.d", 
          "name": [
            "Institute of Geoinformatics, VSB Technical University of Ostrava, 17. listopadu 15, 70833, Ostrava, Poruba, Czech Republic"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Ivan", 
        "givenName": "Igor", 
        "id": "sg:person.014167141246.41", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014167141246.41"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Software602 (Czechia)", 
          "id": "https://www.grid.ac/institutes/grid.448299.a", 
          "name": [
            "Software602 a. s., Hornokr\u010dsk\u00e1 15, 140 00, Praha 4, Czech Republic"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Nemec", 
        "givenName": "Peter", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Technical University of Ostrava", 
          "id": "https://www.grid.ac/institutes/grid.440850.d", 
          "name": [
            "Institute of Geoinformatics, VSB Technical University of Ostrava, 17. listopadu 15, 70833, Ostrava, Poruba, Czech Republic"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Ardielli", 
        "givenName": "Ji\u0159\u00ed", 
        "id": "sg:person.010174430607.04", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010174430607.04"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Technical University of Ostrava", 
          "id": "https://www.grid.ac/institutes/grid.440850.d", 
          "name": [
            "Institute of Geoinformatics, VSB Technical University of Ostrava, 17. listopadu 15, 70833, Ostrava, Poruba, Czech Republic"
          ], 
          "type": "Organization"
        }, 
        "familyName": "R\u016f\u017ei\u010dka", 
        "givenName": "Jan", 
        "id": "sg:person.013705736707.39", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013705736707.39"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://app.dimensions.ai/details/publication/pub.1017234296", 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/978-1-4615-1665-1", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1017234296", 
          "https://doi.org/10.1007/978-1-4615-1665-1"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/978-1-4615-1665-1", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1017234296", 
          "https://doi.org/10.1007/978-1-4615-1665-1"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1002/aris.1440370103", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1025267224"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/bf01231602", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1042058432", 
          "https://doi.org/10.1007/bf01231602"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/bf01231602", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1042058432", 
          "https://doi.org/10.1007/bf01231602"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/11562214_58", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1044572227", 
          "https://doi.org/10.1007/11562214_58"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/11562214_58", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1044572227", 
          "https://doi.org/10.1007/11562214_58"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/1135777.1135799", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1048871055"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/tkde.2007.1041", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061661647"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.7751/telopea20035604", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1074025928"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2011", 
    "datePublishedReg": "2011-01-01", 
    "description": "Geoparsing assigns geographic identifiers to textual words and phrases in documents. The specific problem is how to apply geoparsing in languages where changes of word termination occur. An appropriate method requires a flexible solution reflecting different strategies and priorities. Sixteen Czech RSS news channels were evaluated according to ten criteria. Three selected RSS channels were monitored for more than two years. The applied geoparsing included successive steps of different filters\u2019 application and utilized the generation of different grammatical cases for recognized entities. Various problems with geographical names are classified and documented. The quality assessment shows satisfactory results namely for identification of names in domiciles (94%). The pessimistic strategy is applied to analyze a geographical balance of news distribution. The results show significant differences between distribution of news in monitored channels and document a high concentration of cultural and national news in several locations.", 
    "editor": [
      {
        "familyName": "Katarzyniak", 
        "givenName": "Rados\u0142aw", 
        "type": "Person"
      }, 
      {
        "familyName": "Chiu", 
        "givenName": "Tzu-Fu", 
        "type": "Person"
      }, 
      {
        "familyName": "Hong", 
        "givenName": "Chao-Fu", 
        "type": "Person"
      }, 
      {
        "familyName": "Nguyen", 
        "givenName": "Ngoc Thanh", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-642-23418-7_31", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-3-642-23417-0", 
        "978-3-642-23418-7"
      ], 
      "name": "Semantic Methods for Knowledge Management and Communication", 
      "type": "Book"
    }, 
    "name": "Geoparsing of Czech RSS News and Evaluation of Its Spatial Distribution", 
    "pagination": "353-367", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1049549984"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-642-23418-7_31"
        ]
      }, 
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "ec0b1e97cb6a52be25bfad426e3f8752ce59a0763614d16b1343dd7e8f91fc2d"
        ]
      }
    ], 
    "publisher": {
      "location": "Berlin, Heidelberg", 
      "name": "Springer Berlin Heidelberg", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-642-23418-7_31", 
      "https://app.dimensions.ai/details/publication/pub.1049549984"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2019-04-16T09:04", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000370_0000000370/records_46751_00000002.jsonl", 
    "type": "Chapter", 
    "url": "https://link.springer.com/10.1007%2F978-3-642-23418-7_31"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-23418-7_31'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-23418-7_31'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-23418-7_31'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-23418-7_31'


 

This table displays all metadata directly associated to this object as RDF triples.

142 TRIPLES      23 PREDICATES      35 URIs      20 LITERALS      8 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-642-23418-7_31 schema:about anzsrc-for:20
2 anzsrc-for:2004
3 schema:author N7be94c624cc54c64984fd2ad16dc4c6c
4 schema:citation sg:pub.10.1007/11562214_58
5 sg:pub.10.1007/978-1-4615-1665-1
6 sg:pub.10.1007/bf01231602
7 https://app.dimensions.ai/details/publication/pub.1017234296
8 https://doi.org/10.1002/aris.1440370103
9 https://doi.org/10.1109/tkde.2007.1041
10 https://doi.org/10.1145/1135777.1135799
11 https://doi.org/10.7751/telopea20035604
12 schema:datePublished 2011
13 schema:datePublishedReg 2011-01-01
14 schema:description Geoparsing assigns geographic identifiers to textual words and phrases in documents. The specific problem is how to apply geoparsing in languages where changes of word termination occur. An appropriate method requires a flexible solution reflecting different strategies and priorities. Sixteen Czech RSS news channels were evaluated according to ten criteria. Three selected RSS channels were monitored for more than two years. The applied geoparsing included successive steps of different filters’ application and utilized the generation of different grammatical cases for recognized entities. Various problems with geographical names are classified and documented. The quality assessment shows satisfactory results namely for identification of names in domiciles (94%). The pessimistic strategy is applied to analyze a geographical balance of news distribution. The results show significant differences between distribution of news in monitored channels and document a high concentration of cultural and national news in several locations.
15 schema:editor Na0847c668836452bb3e7854eed8f8ce4
16 schema:genre chapter
17 schema:inLanguage en
18 schema:isAccessibleForFree false
19 schema:isPartOf Nfac52c639b9c4cfabddfac4773462bcc
20 schema:name Geoparsing of Czech RSS News and Evaluation of Its Spatial Distribution
21 schema:pagination 353-367
22 schema:productId N2003e97366fc4c349b07ebca54447889
23 N5802db9bba22419b9fe3125a3f6ab5de
24 Nf36dc8e41a3c4ceda21bae9c2904033f
25 schema:publisher N81f63b90856a48c891cf12fe011c5576
26 schema:sameAs https://app.dimensions.ai/details/publication/pub.1049549984
27 https://doi.org/10.1007/978-3-642-23418-7_31
28 schema:sdDatePublished 2019-04-16T09:04
29 schema:sdLicense https://scigraph.springernature.com/explorer/license/
30 schema:sdPublisher N7cb290b638b248ef9f279146b689166b
31 schema:url https://link.springer.com/10.1007%2F978-3-642-23418-7_31
32 sgo:license sg:explorer/license/
33 sgo:sdDataset chapters
34 rdf:type schema:Chapter
35 N03c67430293c41b4b116682026cff5c9 schema:affiliation https://www.grid.ac/institutes/grid.440850.d
36 schema:familyName Belaj
37 schema:givenName Pavel
38 rdf:type schema:Person
39 N2003e97366fc4c349b07ebca54447889 schema:name doi
40 schema:value 10.1007/978-3-642-23418-7_31
41 rdf:type schema:PropertyValue
42 N299e2d681cc943b8bbb85c399b974990 rdf:first Ne3d18422c60348b49fce9f7f984e0cc7
43 rdf:rest Ndd1793c15e8642c7a9193160e7ef5f52
44 N2ec09d9913f845cc9258fd0bf175a0ca rdf:first sg:person.013705736707.39
45 rdf:rest rdf:nil
46 N3ca486d04bee4bd5bfd75baea54a4c72 rdf:first sg:person.014167141246.41
47 rdf:rest Nc6c4f47280474ca69a9afe114ee9a64a
48 N5802db9bba22419b9fe3125a3f6ab5de schema:name dimensions_id
49 schema:value pub.1049549984
50 rdf:type schema:PropertyValue
51 N59ef22db51944e599d99e9dc964d7ae0 schema:affiliation https://www.grid.ac/institutes/grid.448299.a
52 schema:familyName Nemec
53 schema:givenName Peter
54 rdf:type schema:Person
55 N7be94c624cc54c64984fd2ad16dc4c6c rdf:first sg:person.015061123613.29
56 rdf:rest Nddab8ff755424cb19568c3d9dc427288
57 N7cb290b638b248ef9f279146b689166b schema:name Springer Nature - SN SciGraph project
58 rdf:type schema:Organization
59 N81f63b90856a48c891cf12fe011c5576 schema:location Berlin, Heidelberg
60 schema:name Springer Berlin Heidelberg
61 rdf:type schema:Organisation
62 Na0847c668836452bb3e7854eed8f8ce4 rdf:first Nf6cb071c12484f4fb153b27a244642d0
63 rdf:rest Naa2fcdcef5e141c9bcb3e150895e047c
64 Naa2fcdcef5e141c9bcb3e150895e047c rdf:first Nf5e6adae0ff04dd19c1d00dd2e0e9489
65 rdf:rest N299e2d681cc943b8bbb85c399b974990
66 Naf509d84176f4b1fa63e876b82df6a44 schema:familyName Nguyen
67 schema:givenName Ngoc Thanh
68 rdf:type schema:Person
69 Nc6c4f47280474ca69a9afe114ee9a64a rdf:first N59ef22db51944e599d99e9dc964d7ae0
70 rdf:rest Neabf86546f9d4fab8cf33a1504250100
71 Ndd1793c15e8642c7a9193160e7ef5f52 rdf:first Naf509d84176f4b1fa63e876b82df6a44
72 rdf:rest rdf:nil
73 Nddab8ff755424cb19568c3d9dc427288 rdf:first N03c67430293c41b4b116682026cff5c9
74 rdf:rest N3ca486d04bee4bd5bfd75baea54a4c72
75 Ne3d18422c60348b49fce9f7f984e0cc7 schema:familyName Hong
76 schema:givenName Chao-Fu
77 rdf:type schema:Person
78 Neabf86546f9d4fab8cf33a1504250100 rdf:first sg:person.010174430607.04
79 rdf:rest N2ec09d9913f845cc9258fd0bf175a0ca
80 Nf36dc8e41a3c4ceda21bae9c2904033f schema:name readcube_id
81 schema:value ec0b1e97cb6a52be25bfad426e3f8752ce59a0763614d16b1343dd7e8f91fc2d
82 rdf:type schema:PropertyValue
83 Nf5e6adae0ff04dd19c1d00dd2e0e9489 schema:familyName Chiu
84 schema:givenName Tzu-Fu
85 rdf:type schema:Person
86 Nf6cb071c12484f4fb153b27a244642d0 schema:familyName Katarzyniak
87 schema:givenName Radosław
88 rdf:type schema:Person
89 Nfac52c639b9c4cfabddfac4773462bcc schema:isbn 978-3-642-23417-0
90 978-3-642-23418-7
91 schema:name Semantic Methods for Knowledge Management and Communication
92 rdf:type schema:Book
93 anzsrc-for:20 schema:inDefinedTermSet anzsrc-for:
94 schema:name Language, Communication and Culture
95 rdf:type schema:DefinedTerm
96 anzsrc-for:2004 schema:inDefinedTermSet anzsrc-for:
97 schema:name Linguistics
98 rdf:type schema:DefinedTerm
99 sg:person.010174430607.04 schema:affiliation https://www.grid.ac/institutes/grid.440850.d
100 schema:familyName Ardielli
101 schema:givenName Jiří
102 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010174430607.04
103 rdf:type schema:Person
104 sg:person.013705736707.39 schema:affiliation https://www.grid.ac/institutes/grid.440850.d
105 schema:familyName Růžička
106 schema:givenName Jan
107 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013705736707.39
108 rdf:type schema:Person
109 sg:person.014167141246.41 schema:affiliation https://www.grid.ac/institutes/grid.440850.d
110 schema:familyName Ivan
111 schema:givenName Igor
112 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014167141246.41
113 rdf:type schema:Person
114 sg:person.015061123613.29 schema:affiliation https://www.grid.ac/institutes/grid.440850.d
115 schema:familyName Horák
116 schema:givenName Jiří
117 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.015061123613.29
118 rdf:type schema:Person
119 sg:pub.10.1007/11562214_58 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044572227
120 https://doi.org/10.1007/11562214_58
121 rdf:type schema:CreativeWork
122 sg:pub.10.1007/978-1-4615-1665-1 schema:sameAs https://app.dimensions.ai/details/publication/pub.1017234296
123 https://doi.org/10.1007/978-1-4615-1665-1
124 rdf:type schema:CreativeWork
125 sg:pub.10.1007/bf01231602 schema:sameAs https://app.dimensions.ai/details/publication/pub.1042058432
126 https://doi.org/10.1007/bf01231602
127 rdf:type schema:CreativeWork
128 https://app.dimensions.ai/details/publication/pub.1017234296 schema:CreativeWork
129 https://doi.org/10.1002/aris.1440370103 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025267224
130 rdf:type schema:CreativeWork
131 https://doi.org/10.1109/tkde.2007.1041 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061661647
132 rdf:type schema:CreativeWork
133 https://doi.org/10.1145/1135777.1135799 schema:sameAs https://app.dimensions.ai/details/publication/pub.1048871055
134 rdf:type schema:CreativeWork
135 https://doi.org/10.7751/telopea20035604 schema:sameAs https://app.dimensions.ai/details/publication/pub.1074025928
136 rdf:type schema:CreativeWork
137 https://www.grid.ac/institutes/grid.440850.d schema:alternateName Technical University of Ostrava
138 schema:name Institute of Geoinformatics, VSB Technical University of Ostrava, 17. listopadu 15, 70833, Ostrava, Poruba, Czech Republic
139 rdf:type schema:Organization
140 https://www.grid.ac/institutes/grid.448299.a schema:alternateName Software602 (Czechia)
141 schema:name Software602 a. s., Hornokrčská 15, 140 00, Praha 4, Czech Republic
142 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...