Proper Name Pronunciations for Speech Technology Applications View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2003-10

AUTHORS

Murray F. Spiegel

ABSTRACT

This paper describes a 15-year research effort to improve the automatic pronunciation of proper names and details the issues involved in applying those pronunciations to speech synthesis and speech recognition. Our approach consists primarily of a large hand-tuned rule component, supplemented by a comparatively small pronunciation dictionary, both guided by extensive survey and polling data. Compared to other state-of-the-art programs, we use language-class identification to smaller degree. We utilize alternate pronunciations, obtained from the polling data, for both synthesis and recognition purposes. While our approach yields comparatively high accuracies, a comprehensive database of names and their pronunciations verified and authenticated through customer interactions (such as auto-attendants and automated directory assistance) will likely be the best future resource defining the ultimate in accuracy. More... »

PAGES

419-427

Identifiers

URI

http://scigraph.springernature.com/pub.10.1023/a:1025721319650

DOI

http://dx.doi.org/10.1023/a:1025721319650

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1022660869


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Ericsson (United States)", 
          "id": "https://www.grid.ac/institutes/grid.432790.b", 
          "name": [
            "Speech Technology Applications Research, Telcordia Technologies, Morristown, NJ, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Spiegel", 
        "givenName": "Murray F.", 
        "id": "sg:person.0611321467.22", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0611321467.22"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1016/0885-2308(91)90017-k", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1035874966"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1162/089120100561674", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1051289266"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.2002.5743825", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1093820210"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2003-10", 
    "datePublishedReg": "2003-10-01", 
    "description": "This paper describes a 15-year research effort to improve the automatic pronunciation of proper names and details the issues involved in applying those pronunciations to speech synthesis and speech recognition. Our approach consists primarily of a large hand-tuned rule component, supplemented by a comparatively small pronunciation dictionary, both guided by extensive survey and polling data. Compared to other state-of-the-art programs, we use language-class identification to smaller degree. We utilize alternate pronunciations, obtained from the polling data, for both synthesis and recognition purposes. While our approach yields comparatively high accuracies, a comprehensive database of names and their pronunciations verified and authenticated through customer interactions (such as auto-attendants and automated directory assistance) will likely be the best future resource defining the ultimate in accuracy.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1023/a:1025721319650", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": [
      {
        "id": "sg:journal.1132409", 
        "issn": [
          "1381-2416", 
          "1572-8110"
        ], 
        "name": "International Journal of Speech Technology", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "4", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "6"
      }
    ], 
    "name": "Proper Name Pronunciations for Speech Technology Applications", 
    "pagination": "419-427", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "6bf5778427968b5fd4b7fd86f1a50035b39ecfdc5558305e3627e8ac61b393a6"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1023/a:1025721319650"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1022660869"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1023/a:1025721319650", 
      "https://app.dimensions.ai/details/publication/pub.1022660869"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-10T14:59", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8663_00000505.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "http://link.springer.com/10.1023%2FA%3A1025721319650"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1023/a:1025721319650'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1023/a:1025721319650'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1023/a:1025721319650'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1023/a:1025721319650'


 

This table displays all metadata directly associated to this object as RDF triples.

70 TRIPLES      21 PREDICATES      30 URIs      19 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1023/a:1025721319650 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author Nf547b27ffdca48efac5d2f1a9160f35f
4 schema:citation https://doi.org/10.1016/0885-2308(91)90017-k
5 https://doi.org/10.1109/icassp.2002.5743825
6 https://doi.org/10.1162/089120100561674
7 schema:datePublished 2003-10
8 schema:datePublishedReg 2003-10-01
9 schema:description This paper describes a 15-year research effort to improve the automatic pronunciation of proper names and details the issues involved in applying those pronunciations to speech synthesis and speech recognition. Our approach consists primarily of a large hand-tuned rule component, supplemented by a comparatively small pronunciation dictionary, both guided by extensive survey and polling data. Compared to other state-of-the-art programs, we use language-class identification to smaller degree. We utilize alternate pronunciations, obtained from the polling data, for both synthesis and recognition purposes. While our approach yields comparatively high accuracies, a comprehensive database of names and their pronunciations verified and authenticated through customer interactions (such as auto-attendants and automated directory assistance) will likely be the best future resource defining the ultimate in accuracy.
10 schema:genre research_article
11 schema:inLanguage en
12 schema:isAccessibleForFree false
13 schema:isPartOf N6a9fd73b1c1347d98090e590596188ac
14 Naf891aa9d8474404997bfc9f8b4de5a5
15 sg:journal.1132409
16 schema:name Proper Name Pronunciations for Speech Technology Applications
17 schema:pagination 419-427
18 schema:productId N4deb54dde1ce473f9acbe7b09773d0f7
19 N61840e4984ac42808737e361154f8733
20 N9e88e459e25845b892c9afa60ecdbffc
21 schema:sameAs https://app.dimensions.ai/details/publication/pub.1022660869
22 https://doi.org/10.1023/a:1025721319650
23 schema:sdDatePublished 2019-04-10T14:59
24 schema:sdLicense https://scigraph.springernature.com/explorer/license/
25 schema:sdPublisher N304548386ef3422890f808b4e90692a2
26 schema:url http://link.springer.com/10.1023%2FA%3A1025721319650
27 sgo:license sg:explorer/license/
28 sgo:sdDataset articles
29 rdf:type schema:ScholarlyArticle
30 N304548386ef3422890f808b4e90692a2 schema:name Springer Nature - SN SciGraph project
31 rdf:type schema:Organization
32 N4deb54dde1ce473f9acbe7b09773d0f7 schema:name doi
33 schema:value 10.1023/a:1025721319650
34 rdf:type schema:PropertyValue
35 N61840e4984ac42808737e361154f8733 schema:name readcube_id
36 schema:value 6bf5778427968b5fd4b7fd86f1a50035b39ecfdc5558305e3627e8ac61b393a6
37 rdf:type schema:PropertyValue
38 N6a9fd73b1c1347d98090e590596188ac schema:issueNumber 4
39 rdf:type schema:PublicationIssue
40 N9e88e459e25845b892c9afa60ecdbffc schema:name dimensions_id
41 schema:value pub.1022660869
42 rdf:type schema:PropertyValue
43 Naf891aa9d8474404997bfc9f8b4de5a5 schema:volumeNumber 6
44 rdf:type schema:PublicationVolume
45 Nf547b27ffdca48efac5d2f1a9160f35f rdf:first sg:person.0611321467.22
46 rdf:rest rdf:nil
47 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
48 schema:name Information and Computing Sciences
49 rdf:type schema:DefinedTerm
50 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
51 schema:name Artificial Intelligence and Image Processing
52 rdf:type schema:DefinedTerm
53 sg:journal.1132409 schema:issn 1381-2416
54 1572-8110
55 schema:name International Journal of Speech Technology
56 rdf:type schema:Periodical
57 sg:person.0611321467.22 schema:affiliation https://www.grid.ac/institutes/grid.432790.b
58 schema:familyName Spiegel
59 schema:givenName Murray F.
60 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0611321467.22
61 rdf:type schema:Person
62 https://doi.org/10.1016/0885-2308(91)90017-k schema:sameAs https://app.dimensions.ai/details/publication/pub.1035874966
63 rdf:type schema:CreativeWork
64 https://doi.org/10.1109/icassp.2002.5743825 schema:sameAs https://app.dimensions.ai/details/publication/pub.1093820210
65 rdf:type schema:CreativeWork
66 https://doi.org/10.1162/089120100561674 schema:sameAs https://app.dimensions.ai/details/publication/pub.1051289266
67 rdf:type schema:CreativeWork
68 https://www.grid.ac/institutes/grid.432790.b schema:alternateName Ericsson (United States)
69 schema:name Speech Technology Applications Research, Telcordia Technologies, Morristown, NJ, USA
70 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...