Complexity and Universality in the Long-Range Order of Words View Full Text


Ontology type: schema:Chapter      Open Access: True


Chapter Info

DATE

2016-05-19

AUTHORS

Marcelo A. Montemurro , Damián H. Zanette

ABSTRACT

As is the case of many signals produced by complex systems, language presents a statistical structure that is balanced between order and disorder. Here we review and extend recent results from quantitative characterisations of the degree of order in linguistic sequences that give insights into two relevant aspects of language: the presence of statistical universals in word ordering, and the link between semantic information and the statistical linguistic structure. We first analyse a measure of relative entropy that assesses how much the ordering of words contributes to the overall statistical structure of language. This measure presents an almost constant value close to 3.5 bits/word across several linguistic families. Then, we show that a direct application of information theory leads to an entropy measure that can quantify semantic structures and extract keywords from linguistic samples, even without prior knowledge of the underlying language. More... »

PAGES

27-41

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-319-24403-7_3

DOI

http://dx.doi.org/10.1007/978-3-319-24403-7_3

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1010489878


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/01", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Mathematical Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/20", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Language, Communication and Culture", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0104", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Statistics", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/2004", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Linguistics", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Faculty of Life Sciences, The University of Manchester, Manchester, UK", 
          "id": "http://www.grid.ac/institutes/grid.5379.8", 
          "name": [
            "Faculty of Life Sciences, The University of Manchester, Manchester, UK"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Montemurro", 
        "givenName": "Marcelo A.", 
        "id": "sg:person.01241314317.99", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01241314317.99"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Centro At\u00f3mico Bariloche e Insituto Balseiro, San Carlos de Bariloche, R\u00edo Negro, Argentina", 
          "id": "http://www.grid.ac/institutes/grid.418211.f", 
          "name": [
            "Centro At\u00f3mico Bariloche e Insituto Balseiro, San Carlos de Bariloche, R\u00edo Negro, Argentina"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Zanette", 
        "givenName": "Dami\u00e1n H.", 
        "id": "sg:person.0673125037.10", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0673125037.10"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2016-05-19", 
    "datePublishedReg": "2016-05-19", 
    "description": "As is the case of many signals produced by complex systems, language presents a statistical structure that is balanced between order and disorder. Here we review and extend recent results from quantitative characterisations of the degree of order in linguistic sequences that give insights into two relevant aspects of language: the presence of statistical universals in word ordering, and the link between semantic information and the statistical linguistic structure. We first analyse a measure of relative entropy that assesses how much the ordering of words contributes to the overall statistical structure of language. This measure presents an almost constant value close to 3.5 bits/word across several linguistic families. Then, we show that a direct application of information theory leads to an entropy measure that can quantify semantic structures and extract keywords from linguistic samples, even without prior knowledge of the underlying language.", 
    "editor": [
      {
        "familyName": "Degli Esposti", 
        "givenName": "Mirko", 
        "type": "Person"
      }, 
      {
        "familyName": "Altmann", 
        "givenName": "Eduardo G.", 
        "type": "Person"
      }, 
      {
        "familyName": "Pachet", 
        "givenName": "Fran\u00e7ois", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-319-24403-7_3", 
    "inLanguage": "en", 
    "isAccessibleForFree": true, 
    "isPartOf": {
      "isbn": [
        "978-3-319-24401-3", 
        "978-3-319-24403-7"
      ], 
      "name": "Creativity and Universality in Language", 
      "type": "Book"
    }, 
    "keywords": [
      "overall statistical structure", 
      "bits/word", 
      "statistical structure", 
      "ordering of words", 
      "linguistic structure", 
      "linguistic samples", 
      "linguistic family", 
      "statistical universals", 
      "word ordering", 
      "linguistic sequences", 
      "language", 
      "semantic structure", 
      "underlying language", 
      "words", 
      "information theory", 
      "relative entropy", 
      "complex systems", 
      "entropy measure", 
      "semantic information", 
      "recent results", 
      "direct application", 
      "universals", 
      "prior knowledge", 
      "universality", 
      "relevant aspects", 
      "entropy", 
      "theory", 
      "aspects", 
      "complexity", 
      "keywords", 
      "knowledge", 
      "order", 
      "applications", 
      "quantitative characterisation", 
      "system", 
      "ordering", 
      "insights", 
      "link", 
      "structure", 
      "family", 
      "measures", 
      "signals", 
      "constant value", 
      "long-range order", 
      "cases", 
      "results", 
      "sequence", 
      "information", 
      "degree of order", 
      "values", 
      "degree", 
      "characterisation", 
      "presence", 
      "samples", 
      "disorders", 
      "statistical linguistic structure"
    ], 
    "name": "Complexity and Universality in the Long-Range Order of Words", 
    "pagination": "27-41", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1010489878"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-319-24403-7_3"
        ]
      }
    ], 
    "publisher": {
      "name": "Springer Nature", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-319-24403-7_3", 
      "https://app.dimensions.ai/details/publication/pub.1010489878"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2022-01-01T19:20", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220101/entities/gbq_results/chapter/chapter_355.jsonl", 
    "type": "Chapter", 
    "url": "https://doi.org/10.1007/978-3-319-24403-7_3"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-24403-7_3'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-24403-7_3'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-24403-7_3'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-24403-7_3'


 

This table displays all metadata directly associated to this object as RDF triples.

144 TRIPLES      23 PREDICATES      83 URIs      74 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-319-24403-7_3 schema:about anzsrc-for:01
2 anzsrc-for:0104
3 anzsrc-for:20
4 anzsrc-for:2004
5 schema:author N85f45ea48200483ba1f7a9c1fc686e8c
6 schema:datePublished 2016-05-19
7 schema:datePublishedReg 2016-05-19
8 schema:description As is the case of many signals produced by complex systems, language presents a statistical structure that is balanced between order and disorder. Here we review and extend recent results from quantitative characterisations of the degree of order in linguistic sequences that give insights into two relevant aspects of language: the presence of statistical universals in word ordering, and the link between semantic information and the statistical linguistic structure. We first analyse a measure of relative entropy that assesses how much the ordering of words contributes to the overall statistical structure of language. This measure presents an almost constant value close to 3.5 bits/word across several linguistic families. Then, we show that a direct application of information theory leads to an entropy measure that can quantify semantic structures and extract keywords from linguistic samples, even without prior knowledge of the underlying language.
9 schema:editor N3a8e9dfe5f954c71b51b4432b05c7606
10 schema:genre chapter
11 schema:inLanguage en
12 schema:isAccessibleForFree true
13 schema:isPartOf Ne6f8d56afc6b434e9d3465e95949830a
14 schema:keywords applications
15 aspects
16 bits/word
17 cases
18 characterisation
19 complex systems
20 complexity
21 constant value
22 degree
23 degree of order
24 direct application
25 disorders
26 entropy
27 entropy measure
28 family
29 information
30 information theory
31 insights
32 keywords
33 knowledge
34 language
35 linguistic family
36 linguistic samples
37 linguistic sequences
38 linguistic structure
39 link
40 long-range order
41 measures
42 order
43 ordering
44 ordering of words
45 overall statistical structure
46 presence
47 prior knowledge
48 quantitative characterisation
49 recent results
50 relative entropy
51 relevant aspects
52 results
53 samples
54 semantic information
55 semantic structure
56 sequence
57 signals
58 statistical linguistic structure
59 statistical structure
60 statistical universals
61 structure
62 system
63 theory
64 underlying language
65 universality
66 universals
67 values
68 word ordering
69 words
70 schema:name Complexity and Universality in the Long-Range Order of Words
71 schema:pagination 27-41
72 schema:productId N38578441d76142c1a345fcbe256f6b85
73 Ndc051257f8fc4343a8949acec2e77c13
74 schema:publisher N72153c1bd1b9481b9dca2e419b59579e
75 schema:sameAs https://app.dimensions.ai/details/publication/pub.1010489878
76 https://doi.org/10.1007/978-3-319-24403-7_3
77 schema:sdDatePublished 2022-01-01T19:20
78 schema:sdLicense https://scigraph.springernature.com/explorer/license/
79 schema:sdPublisher N2b046481e9774319aacb603e0f880485
80 schema:url https://doi.org/10.1007/978-3-319-24403-7_3
81 sgo:license sg:explorer/license/
82 sgo:sdDataset chapters
83 rdf:type schema:Chapter
84 N2b046481e9774319aacb603e0f880485 schema:name Springer Nature - SN SciGraph project
85 rdf:type schema:Organization
86 N38578441d76142c1a345fcbe256f6b85 schema:name dimensions_id
87 schema:value pub.1010489878
88 rdf:type schema:PropertyValue
89 N3a8e9dfe5f954c71b51b4432b05c7606 rdf:first N7427adddabc94437a569111015ac9b76
90 rdf:rest N479ba78589974c05866bb1e50132e3dc
91 N479ba78589974c05866bb1e50132e3dc rdf:first Nab6995c46e42466fb18dfed8b06f72f6
92 rdf:rest Nf559d0bb18024e2a9c583bd22c7f38d9
93 N72153c1bd1b9481b9dca2e419b59579e schema:name Springer Nature
94 rdf:type schema:Organisation
95 N7427adddabc94437a569111015ac9b76 schema:familyName Degli Esposti
96 schema:givenName Mirko
97 rdf:type schema:Person
98 N85f45ea48200483ba1f7a9c1fc686e8c rdf:first sg:person.01241314317.99
99 rdf:rest N9c929953c979484cb1187ecfecbf65f8
100 N9c929953c979484cb1187ecfecbf65f8 rdf:first sg:person.0673125037.10
101 rdf:rest rdf:nil
102 Nab6995c46e42466fb18dfed8b06f72f6 schema:familyName Altmann
103 schema:givenName Eduardo G.
104 rdf:type schema:Person
105 Nc40a713c97904e52889319b6022560ff schema:familyName Pachet
106 schema:givenName François
107 rdf:type schema:Person
108 Ndc051257f8fc4343a8949acec2e77c13 schema:name doi
109 schema:value 10.1007/978-3-319-24403-7_3
110 rdf:type schema:PropertyValue
111 Ne6f8d56afc6b434e9d3465e95949830a schema:isbn 978-3-319-24401-3
112 978-3-319-24403-7
113 schema:name Creativity and Universality in Language
114 rdf:type schema:Book
115 Nf559d0bb18024e2a9c583bd22c7f38d9 rdf:first Nc40a713c97904e52889319b6022560ff
116 rdf:rest rdf:nil
117 anzsrc-for:01 schema:inDefinedTermSet anzsrc-for:
118 schema:name Mathematical Sciences
119 rdf:type schema:DefinedTerm
120 anzsrc-for:0104 schema:inDefinedTermSet anzsrc-for:
121 schema:name Statistics
122 rdf:type schema:DefinedTerm
123 anzsrc-for:20 schema:inDefinedTermSet anzsrc-for:
124 schema:name Language, Communication and Culture
125 rdf:type schema:DefinedTerm
126 anzsrc-for:2004 schema:inDefinedTermSet anzsrc-for:
127 schema:name Linguistics
128 rdf:type schema:DefinedTerm
129 sg:person.01241314317.99 schema:affiliation grid-institutes:grid.5379.8
130 schema:familyName Montemurro
131 schema:givenName Marcelo A.
132 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01241314317.99
133 rdf:type schema:Person
134 sg:person.0673125037.10 schema:affiliation grid-institutes:grid.418211.f
135 schema:familyName Zanette
136 schema:givenName Damián H.
137 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0673125037.10
138 rdf:type schema:Person
139 grid-institutes:grid.418211.f schema:alternateName Centro Atómico Bariloche e Insituto Balseiro, San Carlos de Bariloche, Río Negro, Argentina
140 schema:name Centro Atómico Bariloche e Insituto Balseiro, San Carlos de Bariloche, Río Negro, Argentina
141 rdf:type schema:Organization
142 grid-institutes:grid.5379.8 schema:alternateName Faculty of Life Sciences, The University of Manchester, Manchester, UK
143 schema:name Faculty of Life Sciences, The University of Manchester, Manchester, UK
144 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...