Collecting Data for Automatic Speech Recognition Systems in Dialectal Arabic Using Games with a Purpose View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2015

AUTHORS

Dayna El-Sakhawy , Slim Abdennadher , Injy Hamed

ABSTRACT

Building Automatic Speech Recognition (ASR) systems for spoken languages usually suffer from the problem of limited available transcriptions. Automatic Speech Recognition (ASR) systems require large speech corpora that contain speech and their corresponding transcriptions for training acoustic models. In this paper, we target the Egyptian dialectal Arabic. As other spoken languages, it is mainly used for spoken rather than writing purposes. Transcriptions are usually collected manually by experts. However, this proved to be a time-consuming and expensive process. In this paper, we introduce Games With a Purpose as a cheap and fast approach to gather transcriptions for Egyptian dialectal Arabic. Furthermore, Arabic orthographic transcriptions lack diacritizations, which leads to ambiguity. On the other hand, transcriptions written in Arabic Chat Alphabet are widely used, and include the pronunciation effects given by diacritics. In this work, we present the game (pronouced as makhamekho) that aims at collecting transcriptions in Arabic orthography, as well as in Arabic Chat Alphabet. It also gathers mappings of words from Arabic orthography to Arabic Chat Alphabet. More... »

PAGES

99-108

Book

TITLE

Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction

ISBN

978-3-319-15556-2
978-3-319-15557-9

Author Affiliations

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-319-15557-9_10

DOI

http://dx.doi.org/10.1007/978-3-319-15557-9_10

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1053559331


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "German University in Cairo", 
          "id": "https://www.grid.ac/institutes/grid.187323.c", 
          "name": [
            "Media Engineering and Technology Faculty, German University in Cairo, New Cairo, Egypt"
          ], 
          "type": "Organization"
        }, 
        "familyName": "El-Sakhawy", 
        "givenName": "Dayna", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "German University in Cairo", 
          "id": "https://www.grid.ac/institutes/grid.187323.c", 
          "name": [
            "Media Engineering and Technology Faculty, German University in Cairo, New Cairo, Egypt"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Abdennadher", 
        "givenName": "Slim", 
        "id": "sg:person.010445445574.13", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010445445574.13"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "German University in Cairo", 
          "id": "https://www.grid.ac/institutes/grid.187323.c", 
          "name": [
            "Media Engineering and Technology Faculty, German University in Cairo, New Cairo, Egypt"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Hamed", 
        "givenName": "Injy", 
        "id": "sg:person.016542541344.73", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016542541344.73"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.3115/1034678.1034680", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1009916577"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/slt.2010.5700870", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1093301973"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.2011.5947463", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1093438248"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.3115/1613715.1613751", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1099150814"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2015", 
    "datePublishedReg": "2015-01-01", 
    "description": "Building Automatic Speech Recognition (ASR) systems for spoken languages usually suffer from the problem of limited available transcriptions. Automatic Speech Recognition (ASR) systems require large speech corpora that contain speech and their corresponding transcriptions for training acoustic models. In this paper, we target the Egyptian dialectal Arabic. As other spoken languages, it is mainly used for spoken rather than writing purposes. Transcriptions are usually collected manually by experts. However, this proved to be a time-consuming and expensive process. In this paper, we introduce Games With a Purpose as a cheap and fast approach to gather transcriptions for Egyptian dialectal Arabic. Furthermore, Arabic orthographic transcriptions lack diacritizations, which leads to ambiguity. On the other hand, transcriptions written in Arabic Chat Alphabet are widely used, and include the pronunciation effects given by diacritics. In this work, we present the game (pronouced as makhamekho) that aims at collecting transcriptions in Arabic orthography, as well as in Arabic Chat Alphabet. It also gathers mappings of words from Arabic orthography to Arabic Chat Alphabet.", 
    "editor": [
      {
        "familyName": "B\u00f6ck", 
        "givenName": "Ronald", 
        "type": "Person"
      }, 
      {
        "familyName": "Bonin", 
        "givenName": "Francesca", 
        "type": "Person"
      }, 
      {
        "familyName": "Campbell", 
        "givenName": "Nick", 
        "type": "Person"
      }, 
      {
        "familyName": "Poppe", 
        "givenName": "Ronald", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-319-15557-9_10", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-3-319-15556-2", 
        "978-3-319-15557-9"
      ], 
      "name": "Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction", 
      "type": "Book"
    }, 
    "name": "Collecting Data for Automatic Speech Recognition Systems in Dialectal Arabic Using Games with a Purpose", 
    "pagination": "99-108", 
    "productId": [
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-319-15557-9_10"
        ]
      }, 
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "ff1bee4bc14b3ee37ddf83036cec9be71ae6cd83db68fa76d21fd786d3f36f44"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1053559331"
        ]
      }
    ], 
    "publisher": {
      "location": "Cham", 
      "name": "Springer International Publishing", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-319-15557-9_10", 
      "https://app.dimensions.ai/details/publication/pub.1053559331"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2019-04-15T16:07", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8675_00000092.jsonl", 
    "type": "Chapter", 
    "url": "http://link.springer.com/10.1007/978-3-319-15557-9_10"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-15557-9_10'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-15557-9_10'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-15557-9_10'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-15557-9_10'


 

This table displays all metadata directly associated to this object as RDF triples.

105 TRIPLES      23 PREDICATES      31 URIs      20 LITERALS      8 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-319-15557-9_10 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author Nb1b7d12cce19415fb446831767d4353c
4 schema:citation https://doi.org/10.1109/icassp.2011.5947463
5 https://doi.org/10.1109/slt.2010.5700870
6 https://doi.org/10.3115/1034678.1034680
7 https://doi.org/10.3115/1613715.1613751
8 schema:datePublished 2015
9 schema:datePublishedReg 2015-01-01
10 schema:description Building Automatic Speech Recognition (ASR) systems for spoken languages usually suffer from the problem of limited available transcriptions. Automatic Speech Recognition (ASR) systems require large speech corpora that contain speech and their corresponding transcriptions for training acoustic models. In this paper, we target the Egyptian dialectal Arabic. As other spoken languages, it is mainly used for spoken rather than writing purposes. Transcriptions are usually collected manually by experts. However, this proved to be a time-consuming and expensive process. In this paper, we introduce Games With a Purpose as a cheap and fast approach to gather transcriptions for Egyptian dialectal Arabic. Furthermore, Arabic orthographic transcriptions lack diacritizations, which leads to ambiguity. On the other hand, transcriptions written in Arabic Chat Alphabet are widely used, and include the pronunciation effects given by diacritics. In this work, we present the game (pronouced as makhamekho) that aims at collecting transcriptions in Arabic orthography, as well as in Arabic Chat Alphabet. It also gathers mappings of words from Arabic orthography to Arabic Chat Alphabet.
11 schema:editor N7963d99016744ad682ef3b59badafba7
12 schema:genre chapter
13 schema:inLanguage en
14 schema:isAccessibleForFree false
15 schema:isPartOf N6babfda0dd914794873974e6ac2a4008
16 schema:name Collecting Data for Automatic Speech Recognition Systems in Dialectal Arabic Using Games with a Purpose
17 schema:pagination 99-108
18 schema:productId N04b39d08c14f42dfaf8bdc786c28ab44
19 N58ee5d2aabe2455faca37d35d6177647
20 N7236c7fceab34702a1ef30e231dc452a
21 schema:publisher N286d3cc711e24f3abfc0ee92e06c963d
22 schema:sameAs https://app.dimensions.ai/details/publication/pub.1053559331
23 https://doi.org/10.1007/978-3-319-15557-9_10
24 schema:sdDatePublished 2019-04-15T16:07
25 schema:sdLicense https://scigraph.springernature.com/explorer/license/
26 schema:sdPublisher Nbc3cccffff5c4d0f93720eb05e921514
27 schema:url http://link.springer.com/10.1007/978-3-319-15557-9_10
28 sgo:license sg:explorer/license/
29 sgo:sdDataset chapters
30 rdf:type schema:Chapter
31 N04b39d08c14f42dfaf8bdc786c28ab44 schema:name dimensions_id
32 schema:value pub.1053559331
33 rdf:type schema:PropertyValue
34 N0c54412fb3ac421882ba8baa286e89bd schema:familyName Poppe
35 schema:givenName Ronald
36 rdf:type schema:Person
37 N286d3cc711e24f3abfc0ee92e06c963d schema:location Cham
38 schema:name Springer International Publishing
39 rdf:type schema:Organisation
40 N58ee5d2aabe2455faca37d35d6177647 schema:name readcube_id
41 schema:value ff1bee4bc14b3ee37ddf83036cec9be71ae6cd83db68fa76d21fd786d3f36f44
42 rdf:type schema:PropertyValue
43 N65f265827caa4d32bfaa73fba9ce3421 rdf:first Nedf2f9519b1648449fcb3554e9805094
44 rdf:rest N9aa1daaa58f943e2b16a6ced58ded73e
45 N6babfda0dd914794873974e6ac2a4008 schema:isbn 978-3-319-15556-2
46 978-3-319-15557-9
47 schema:name Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction
48 rdf:type schema:Book
49 N7236c7fceab34702a1ef30e231dc452a schema:name doi
50 schema:value 10.1007/978-3-319-15557-9_10
51 rdf:type schema:PropertyValue
52 N7963d99016744ad682ef3b59badafba7 rdf:first Ne92fd3eaf5aa4c80a2ed72fbfe323c1c
53 rdf:rest N65f265827caa4d32bfaa73fba9ce3421
54 N9aa1daaa58f943e2b16a6ced58ded73e rdf:first Ned9a93897a414579a91cf493e3ad0c4a
55 rdf:rest Na99f7aabe65b487b80e064eb37d2f68d
56 Na99f7aabe65b487b80e064eb37d2f68d rdf:first N0c54412fb3ac421882ba8baa286e89bd
57 rdf:rest rdf:nil
58 Nb1b7d12cce19415fb446831767d4353c rdf:first Nff8813ecef9f49a49b2368106c99f025
59 rdf:rest Nfebef0e9c2344857ac110cb71560dc0b
60 Nbc3cccffff5c4d0f93720eb05e921514 schema:name Springer Nature - SN SciGraph project
61 rdf:type schema:Organization
62 Nda97774af07a46d4af8a051bd55e6c8d rdf:first sg:person.016542541344.73
63 rdf:rest rdf:nil
64 Ne92fd3eaf5aa4c80a2ed72fbfe323c1c schema:familyName Böck
65 schema:givenName Ronald
66 rdf:type schema:Person
67 Ned9a93897a414579a91cf493e3ad0c4a schema:familyName Campbell
68 schema:givenName Nick
69 rdf:type schema:Person
70 Nedf2f9519b1648449fcb3554e9805094 schema:familyName Bonin
71 schema:givenName Francesca
72 rdf:type schema:Person
73 Nfebef0e9c2344857ac110cb71560dc0b rdf:first sg:person.010445445574.13
74 rdf:rest Nda97774af07a46d4af8a051bd55e6c8d
75 Nff8813ecef9f49a49b2368106c99f025 schema:affiliation https://www.grid.ac/institutes/grid.187323.c
76 schema:familyName El-Sakhawy
77 schema:givenName Dayna
78 rdf:type schema:Person
79 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
80 schema:name Information and Computing Sciences
81 rdf:type schema:DefinedTerm
82 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
83 schema:name Artificial Intelligence and Image Processing
84 rdf:type schema:DefinedTerm
85 sg:person.010445445574.13 schema:affiliation https://www.grid.ac/institutes/grid.187323.c
86 schema:familyName Abdennadher
87 schema:givenName Slim
88 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010445445574.13
89 rdf:type schema:Person
90 sg:person.016542541344.73 schema:affiliation https://www.grid.ac/institutes/grid.187323.c
91 schema:familyName Hamed
92 schema:givenName Injy
93 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016542541344.73
94 rdf:type schema:Person
95 https://doi.org/10.1109/icassp.2011.5947463 schema:sameAs https://app.dimensions.ai/details/publication/pub.1093438248
96 rdf:type schema:CreativeWork
97 https://doi.org/10.1109/slt.2010.5700870 schema:sameAs https://app.dimensions.ai/details/publication/pub.1093301973
98 rdf:type schema:CreativeWork
99 https://doi.org/10.3115/1034678.1034680 schema:sameAs https://app.dimensions.ai/details/publication/pub.1009916577
100 rdf:type schema:CreativeWork
101 https://doi.org/10.3115/1613715.1613751 schema:sameAs https://app.dimensions.ai/details/publication/pub.1099150814
102 rdf:type schema:CreativeWork
103 https://www.grid.ac/institutes/grid.187323.c schema:alternateName German University in Cairo
104 schema:name Media Engineering and Technology Faculty, German University in Cairo, New Cairo, Egypt
105 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...