Semantic classification method for network Tibetan corpus View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2017-03

AUTHORS

Gui-Xian Xu, Chang-Zhi Wang, Li-Hui Wang, Yu-Hong Zhou, Wei-Kang Li, Hao Xu, Qing Huang

ABSTRACT

Tibetan web pages appear enormously. It is meaningful that the information processing technology is utilized to find the useful knowledge from the Tibetan web information. Tibetan semantic ontology can enrich the Tibetan digital resource and is helpful to improve the information processing performance. In this paper, semantic classification of Tibetan network corpus is studied. Firstly Tibetan web pages are collected. Secondly preprocessing is conducted to extract the useful information from Web pages. Thirdly the word segmentation and text representation are introduced. Finally the text similarity classification algorithm is proposed to classify the text. During the experiment, the comparison between semantic classification and non semantic classification is conducted. The results show that the semantic classification performance is obviously superior to non semantic classification. This means that making full use of ontology semantic relationship can greatly enhance the classification accuracy. The research is useful and helpful to the study of Tibetan semantic information processing. More... »

PAGES

155-165

References to SciGraph publications

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/s10586-017-0742-6

DOI

http://dx.doi.org/10.1007/s10586-017-0742-6

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1053814668


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Minzu University of China", 
          "id": "https://www.grid.ac/institutes/grid.411077.4", 
          "name": [
            "Information Engineering College, Minzu University of China, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Xu", 
        "givenName": "Gui-Xian", 
        "id": "sg:person.011747311776.65", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011747311776.65"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Minzu University of China", 
          "id": "https://www.grid.ac/institutes/grid.411077.4", 
          "name": [
            "Information Engineering College, Minzu University of China, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wang", 
        "givenName": "Chang-Zhi", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Peking University", 
          "id": "https://www.grid.ac/institutes/grid.11135.37", 
          "name": [
            "School of Electronic Engineering and Computer Science, Peking University, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wang", 
        "givenName": "Li-Hui", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Zhejiang University", 
          "id": "https://www.grid.ac/institutes/grid.13402.34", 
          "name": [
            "College of Software Engineering, Zhejiang University, Hangzhou, Zhejiang Province, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Zhou", 
        "givenName": "Yu-Hong", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Minzu University of China", 
          "id": "https://www.grid.ac/institutes/grid.411077.4", 
          "name": [
            "Information Engineering College, Minzu University of China, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Li", 
        "givenName": "Wei-Kang", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Minzu University of China", 
          "id": "https://www.grid.ac/institutes/grid.411077.4", 
          "name": [
            "Information Engineering College, Minzu University of China, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Xu", 
        "givenName": "Hao", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Minzu University of China", 
          "id": "https://www.grid.ac/institutes/grid.411077.4", 
          "name": [
            "Information Engineering College, Minzu University of China, Beijing, China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Huang", 
        "givenName": "Qing", 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1016/j.patcog.2006.09.017", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1004116882"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/j.datak.2009.04.002", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1006279484"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/s13638-015-0498-8", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023474408", 
          "https://doi.org/10.1186/s13638-015-0498-8"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/s13638-015-0498-8", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023474408", 
          "https://doi.org/10.1186/s13638-015-0498-8"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/s10618-011-0238-6", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1028086968", 
          "https://doi.org/10.1007/s10618-011-0238-6"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1145/509907.509965", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1030416589"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/s1389-1286(99)00047-x", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1034663825"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/j.ijleo.2016.02.074", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1037141832"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3724/sp.j.1016.2011.00856", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1071326143"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3724/sp.j.1077.2011.00097", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1071331235"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://app.dimensions.ai/details/publication/pub.1078348468", 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icinis.2011.7", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1094376948"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2017-03", 
    "datePublishedReg": "2017-03-01", 
    "description": "Tibetan web pages appear enormously. It is meaningful that the information processing technology is utilized to find the useful knowledge from the Tibetan web information. Tibetan semantic ontology can enrich the Tibetan digital resource and is helpful to improve the information processing performance. In this paper, semantic classification of Tibetan network corpus is studied. Firstly Tibetan web pages are collected. Secondly preprocessing is conducted to extract the useful information from Web pages. Thirdly the word segmentation and text representation are introduced. Finally the text similarity classification algorithm is proposed to classify the text. During the experiment, the comparison between semantic classification and non semantic classification is conducted. The results show that the semantic classification performance is obviously superior to non semantic classification. This means that making full use of ontology semantic relationship can greatly enhance the classification accuracy. The research is useful and helpful to the study of Tibetan semantic information processing.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1007/s10586-017-0742-6", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": [
      {
        "id": "sg:journal.1046649", 
        "issn": [
          "1386-7857", 
          "1573-7543"
        ], 
        "name": "Cluster Computing", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "1", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "20"
      }
    ], 
    "name": "Semantic classification method for network Tibetan corpus", 
    "pagination": "155-165", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "32dfd1ae697d041b58e0732978e74e5ebcb53c21a67eba3c50d423c18e7abeb6"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/s10586-017-0742-6"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1053814668"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1007/s10586-017-0742-6", 
      "https://app.dimensions.ai/details/publication/pub.1053814668"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-11T09:31", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000346_0000000346/records_99803_00000003.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "https://link.springer.com/10.1007%2Fs10586-017-0742-6"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s10586-017-0742-6'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s10586-017-0742-6'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s10586-017-0742-6'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s10586-017-0742-6'


 

This table displays all metadata directly associated to this object as RDF triples.

137 TRIPLES      21 PREDICATES      38 URIs      19 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/s10586-017-0742-6 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author N475cd38658814ebd8cca7fb4a7a60cae
4 schema:citation sg:pub.10.1007/s10618-011-0238-6
5 sg:pub.10.1186/s13638-015-0498-8
6 https://app.dimensions.ai/details/publication/pub.1078348468
7 https://doi.org/10.1016/j.datak.2009.04.002
8 https://doi.org/10.1016/j.ijleo.2016.02.074
9 https://doi.org/10.1016/j.patcog.2006.09.017
10 https://doi.org/10.1016/s1389-1286(99)00047-x
11 https://doi.org/10.1109/icinis.2011.7
12 https://doi.org/10.1145/509907.509965
13 https://doi.org/10.3724/sp.j.1016.2011.00856
14 https://doi.org/10.3724/sp.j.1077.2011.00097
15 schema:datePublished 2017-03
16 schema:datePublishedReg 2017-03-01
17 schema:description Tibetan web pages appear enormously. It is meaningful that the information processing technology is utilized to find the useful knowledge from the Tibetan web information. Tibetan semantic ontology can enrich the Tibetan digital resource and is helpful to improve the information processing performance. In this paper, semantic classification of Tibetan network corpus is studied. Firstly Tibetan web pages are collected. Secondly preprocessing is conducted to extract the useful information from Web pages. Thirdly the word segmentation and text representation are introduced. Finally the text similarity classification algorithm is proposed to classify the text. During the experiment, the comparison between semantic classification and non semantic classification is conducted. The results show that the semantic classification performance is obviously superior to non semantic classification. This means that making full use of ontology semantic relationship can greatly enhance the classification accuracy. The research is useful and helpful to the study of Tibetan semantic information processing.
18 schema:genre research_article
19 schema:inLanguage en
20 schema:isAccessibleForFree false
21 schema:isPartOf N884d68dbb331441c977000ae87e407bf
22 Na6603d50aa3b47f986feecf03498a767
23 sg:journal.1046649
24 schema:name Semantic classification method for network Tibetan corpus
25 schema:pagination 155-165
26 schema:productId N2e38cea259fb453f869c56abb16c6f8b
27 N8b5467a1c7694e40961a086ea12ff363
28 N8f78bb7460274b4291f4fe27a1e04bb8
29 schema:sameAs https://app.dimensions.ai/details/publication/pub.1053814668
30 https://doi.org/10.1007/s10586-017-0742-6
31 schema:sdDatePublished 2019-04-11T09:31
32 schema:sdLicense https://scigraph.springernature.com/explorer/license/
33 schema:sdPublisher N58740e6d6ac14d8fb06cb742a34717a6
34 schema:url https://link.springer.com/10.1007%2Fs10586-017-0742-6
35 sgo:license sg:explorer/license/
36 sgo:sdDataset articles
37 rdf:type schema:ScholarlyArticle
38 N195292ddcb584308854e616530af982b schema:affiliation https://www.grid.ac/institutes/grid.411077.4
39 schema:familyName Li
40 schema:givenName Wei-Kang
41 rdf:type schema:Person
42 N24215857e9034d129cc683ff1a9923ee schema:affiliation https://www.grid.ac/institutes/grid.13402.34
43 schema:familyName Zhou
44 schema:givenName Yu-Hong
45 rdf:type schema:Person
46 N2e38cea259fb453f869c56abb16c6f8b schema:name dimensions_id
47 schema:value pub.1053814668
48 rdf:type schema:PropertyValue
49 N3190e1d0ed5e4f2daca5041ee9940a72 rdf:first N195292ddcb584308854e616530af982b
50 rdf:rest Nf0c87799c27e430d9ffb91d8ca222b1b
51 N475cd38658814ebd8cca7fb4a7a60cae rdf:first sg:person.011747311776.65
52 rdf:rest Necade5eb3f5b4506b42ef400d4606c1a
53 N4a68ddcdff4b4c04a394d7463537f716 schema:affiliation https://www.grid.ac/institutes/grid.411077.4
54 schema:familyName Huang
55 schema:givenName Qing
56 rdf:type schema:Person
57 N58740e6d6ac14d8fb06cb742a34717a6 schema:name Springer Nature - SN SciGraph project
58 rdf:type schema:Organization
59 N5b1269b8eee942a68bc4305b8d538766 rdf:first Nece23a7e768d4fd5b061a9241e6e314b
60 rdf:rest Na7d6c26c4be44debbb74970a44a09126
61 N5c4237f31bff49a0af4350554771aedb rdf:first N4a68ddcdff4b4c04a394d7463537f716
62 rdf:rest rdf:nil
63 N884d68dbb331441c977000ae87e407bf schema:volumeNumber 20
64 rdf:type schema:PublicationVolume
65 N8b5467a1c7694e40961a086ea12ff363 schema:name readcube_id
66 schema:value 32dfd1ae697d041b58e0732978e74e5ebcb53c21a67eba3c50d423c18e7abeb6
67 rdf:type schema:PropertyValue
68 N8f78bb7460274b4291f4fe27a1e04bb8 schema:name doi
69 schema:value 10.1007/s10586-017-0742-6
70 rdf:type schema:PropertyValue
71 Na6603d50aa3b47f986feecf03498a767 schema:issueNumber 1
72 rdf:type schema:PublicationIssue
73 Na7d6c26c4be44debbb74970a44a09126 rdf:first N24215857e9034d129cc683ff1a9923ee
74 rdf:rest N3190e1d0ed5e4f2daca5041ee9940a72
75 Nba9a4fb841444f5b8708cb81c5823c98 schema:affiliation https://www.grid.ac/institutes/grid.411077.4
76 schema:familyName Xu
77 schema:givenName Hao
78 rdf:type schema:Person
79 Nbc2c92aaa00c423f8b7471d99252b4fd schema:affiliation https://www.grid.ac/institutes/grid.411077.4
80 schema:familyName Wang
81 schema:givenName Chang-Zhi
82 rdf:type schema:Person
83 Necade5eb3f5b4506b42ef400d4606c1a rdf:first Nbc2c92aaa00c423f8b7471d99252b4fd
84 rdf:rest N5b1269b8eee942a68bc4305b8d538766
85 Nece23a7e768d4fd5b061a9241e6e314b schema:affiliation https://www.grid.ac/institutes/grid.11135.37
86 schema:familyName Wang
87 schema:givenName Li-Hui
88 rdf:type schema:Person
89 Nf0c87799c27e430d9ffb91d8ca222b1b rdf:first Nba9a4fb841444f5b8708cb81c5823c98
90 rdf:rest N5c4237f31bff49a0af4350554771aedb
91 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
92 schema:name Information and Computing Sciences
93 rdf:type schema:DefinedTerm
94 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
95 schema:name Artificial Intelligence and Image Processing
96 rdf:type schema:DefinedTerm
97 sg:journal.1046649 schema:issn 1386-7857
98 1573-7543
99 schema:name Cluster Computing
100 rdf:type schema:Periodical
101 sg:person.011747311776.65 schema:affiliation https://www.grid.ac/institutes/grid.411077.4
102 schema:familyName Xu
103 schema:givenName Gui-Xian
104 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011747311776.65
105 rdf:type schema:Person
106 sg:pub.10.1007/s10618-011-0238-6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1028086968
107 https://doi.org/10.1007/s10618-011-0238-6
108 rdf:type schema:CreativeWork
109 sg:pub.10.1186/s13638-015-0498-8 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023474408
110 https://doi.org/10.1186/s13638-015-0498-8
111 rdf:type schema:CreativeWork
112 https://app.dimensions.ai/details/publication/pub.1078348468 schema:CreativeWork
113 https://doi.org/10.1016/j.datak.2009.04.002 schema:sameAs https://app.dimensions.ai/details/publication/pub.1006279484
114 rdf:type schema:CreativeWork
115 https://doi.org/10.1016/j.ijleo.2016.02.074 schema:sameAs https://app.dimensions.ai/details/publication/pub.1037141832
116 rdf:type schema:CreativeWork
117 https://doi.org/10.1016/j.patcog.2006.09.017 schema:sameAs https://app.dimensions.ai/details/publication/pub.1004116882
118 rdf:type schema:CreativeWork
119 https://doi.org/10.1016/s1389-1286(99)00047-x schema:sameAs https://app.dimensions.ai/details/publication/pub.1034663825
120 rdf:type schema:CreativeWork
121 https://doi.org/10.1109/icinis.2011.7 schema:sameAs https://app.dimensions.ai/details/publication/pub.1094376948
122 rdf:type schema:CreativeWork
123 https://doi.org/10.1145/509907.509965 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030416589
124 rdf:type schema:CreativeWork
125 https://doi.org/10.3724/sp.j.1016.2011.00856 schema:sameAs https://app.dimensions.ai/details/publication/pub.1071326143
126 rdf:type schema:CreativeWork
127 https://doi.org/10.3724/sp.j.1077.2011.00097 schema:sameAs https://app.dimensions.ai/details/publication/pub.1071331235
128 rdf:type schema:CreativeWork
129 https://www.grid.ac/institutes/grid.11135.37 schema:alternateName Peking University
130 schema:name School of Electronic Engineering and Computer Science, Peking University, Beijing, China
131 rdf:type schema:Organization
132 https://www.grid.ac/institutes/grid.13402.34 schema:alternateName Zhejiang University
133 schema:name College of Software Engineering, Zhejiang University, Hangzhou, Zhejiang Province, China
134 rdf:type schema:Organization
135 https://www.grid.ac/institutes/grid.411077.4 schema:alternateName Minzu University of China
136 schema:name Information Engineering College, Minzu University of China, Beijing, China
137 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...