Bilingual Voice Conversion by Weighted Frequency Warping Based on Formant Space View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2013

AUTHORS

Young-Sun Yun , Richard E. Ladner

ABSTRACT

Voice conversion is a technique that transforms the source speaker’s individuality to that of the target speaker. In this paper, we propose a simple and intuitive voice conversion algorithm that does not use training data between different languages, but uses text-to-speech generated speech rather than real recorded voices. The suggested method finds the transformed frequency by formant space warping. The formant space comprises four representative monophthongs for each language. The warping functions are represented by piecewise linear equations using pairs of four formants at matched monophthongs. Experimental results show the potential of the proposed method. More... »

PAGES

137-144

Book

TITLE

Text, Speech, and Dialogue

ISBN

978-3-642-40584-6
978-3-642-40585-3

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-642-40585-3_18

DOI

http://dx.doi.org/10.1007/978-3-642-40585-3_18

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1038659129


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/2004", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Linguistics", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/20", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Language, Communication and Culture", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Hannam University", 
          "id": "https://www.grid.ac/institutes/grid.411970.a", 
          "name": [
            "Dept. of Information and Communication Engineering, Hannam University, Daejeon, Republic of Korea, 306-791"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Yun", 
        "givenName": "Young-Sun", 
        "id": "sg:person.012422257207.75", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012422257207.75"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "University of Washington", 
          "id": "https://www.grid.ac/institutes/grid.34477.33", 
          "name": [
            "Dept. of Computer Science and Engineering, University of Washington, Seattle, Washington\u00a0USA, 98195"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Ladner", 
        "givenName": "Richard E.", 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1016/0167-6393(94)00058-i", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1041350006"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/0167-6393(94)00053-d", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1049408984"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/0167-6393(94)00052-c", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1051750656"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/tasl.2009.2038663", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061516465"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/tasl.2012.2198058", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061516939"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/asru.2003.1318521", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1093759268"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/isspit.2004.1433719", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1093765558"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.1997.596120", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1095088016"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.2006.1659962", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1095091226"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2013", 
    "datePublishedReg": "2013-01-01", 
    "description": "Voice conversion is a technique that transforms the source speaker\u2019s individuality to that of the target speaker. In this paper, we propose a simple and intuitive voice conversion algorithm that does not use training data between different languages, but uses text-to-speech generated speech rather than real recorded voices. The suggested method finds the transformed frequency by formant space warping. The formant space comprises four representative monophthongs for each language. The warping functions are represented by piecewise linear equations using pairs of four formants at matched monophthongs. Experimental results show the potential of the proposed method.", 
    "editor": [
      {
        "familyName": "Habernal", 
        "givenName": "Ivan", 
        "type": "Person"
      }, 
      {
        "familyName": "Matou\u0161ek", 
        "givenName": "V\u00e1clav", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-642-40585-3_18", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-3-642-40584-6", 
        "978-3-642-40585-3"
      ], 
      "name": "Text, Speech, and Dialogue", 
      "type": "Book"
    }, 
    "name": "Bilingual Voice Conversion by Weighted Frequency Warping Based on Formant Space", 
    "pagination": "137-144", 
    "productId": [
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-642-40585-3_18"
        ]
      }, 
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "b8279bfbbb24faf0b9d415952f474fcee55bc9ebff8a1b1cd7d708c3c6018475"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1038659129"
        ]
      }
    ], 
    "publisher": {
      "location": "Berlin, Heidelberg", 
      "name": "Springer Berlin Heidelberg", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-642-40585-3_18", 
      "https://app.dimensions.ai/details/publication/pub.1038659129"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2019-04-15T21:03", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8690_00000267.jsonl", 
    "type": "Chapter", 
    "url": "http://link.springer.com/10.1007/978-3-642-40585-3_18"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-40585-3_18'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-40585-3_18'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-40585-3_18'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-40585-3_18'


 

This table displays all metadata directly associated to this object as RDF triples.

106 TRIPLES      23 PREDICATES      36 URIs      20 LITERALS      8 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-642-40585-3_18 schema:about anzsrc-for:20
2 anzsrc-for:2004
3 schema:author N5bc29570df3b4cf0bfe2ec63fa1da275
4 schema:citation https://doi.org/10.1016/0167-6393(94)00052-c
5 https://doi.org/10.1016/0167-6393(94)00053-d
6 https://doi.org/10.1016/0167-6393(94)00058-i
7 https://doi.org/10.1109/asru.2003.1318521
8 https://doi.org/10.1109/icassp.1997.596120
9 https://doi.org/10.1109/icassp.2006.1659962
10 https://doi.org/10.1109/isspit.2004.1433719
11 https://doi.org/10.1109/tasl.2009.2038663
12 https://doi.org/10.1109/tasl.2012.2198058
13 schema:datePublished 2013
14 schema:datePublishedReg 2013-01-01
15 schema:description Voice conversion is a technique that transforms the source speaker’s individuality to that of the target speaker. In this paper, we propose a simple and intuitive voice conversion algorithm that does not use training data between different languages, but uses text-to-speech generated speech rather than real recorded voices. The suggested method finds the transformed frequency by formant space warping. The formant space comprises four representative monophthongs for each language. The warping functions are represented by piecewise linear equations using pairs of four formants at matched monophthongs. Experimental results show the potential of the proposed method.
16 schema:editor N8a7c58a9511147f9ae12128fc57b4bb2
17 schema:genre chapter
18 schema:inLanguage en
19 schema:isAccessibleForFree false
20 schema:isPartOf N5ddcd006348e46f3b38dbffce93e1d77
21 schema:name Bilingual Voice Conversion by Weighted Frequency Warping Based on Formant Space
22 schema:pagination 137-144
23 schema:productId N0232efd133f047c29750c0e5c5acb62c
24 Ne2acd29d2dd74a15a1a865717daa33fc
25 Nf7712f2658fc4c96aa3e1ff9b117c5e2
26 schema:publisher N9ab849fc9edb461d84b3dab950991b93
27 schema:sameAs https://app.dimensions.ai/details/publication/pub.1038659129
28 https://doi.org/10.1007/978-3-642-40585-3_18
29 schema:sdDatePublished 2019-04-15T21:03
30 schema:sdLicense https://scigraph.springernature.com/explorer/license/
31 schema:sdPublisher N0b09685cc5404502a00b1d5ddfa39971
32 schema:url http://link.springer.com/10.1007/978-3-642-40585-3_18
33 sgo:license sg:explorer/license/
34 sgo:sdDataset chapters
35 rdf:type schema:Chapter
36 N0232efd133f047c29750c0e5c5acb62c schema:name dimensions_id
37 schema:value pub.1038659129
38 rdf:type schema:PropertyValue
39 N096b119f00114fa2b411adbca663956d rdf:first N703f0ec77bf84c1299b9dc9a9883b733
40 rdf:rest rdf:nil
41 N0b09685cc5404502a00b1d5ddfa39971 schema:name Springer Nature - SN SciGraph project
42 rdf:type schema:Organization
43 N35022abd60df4a8ab3ba921126f76b3a schema:familyName Habernal
44 schema:givenName Ivan
45 rdf:type schema:Person
46 N5bc29570df3b4cf0bfe2ec63fa1da275 rdf:first sg:person.012422257207.75
47 rdf:rest Nbc8c447ac2ca466ea8b18589153b756e
48 N5ddcd006348e46f3b38dbffce93e1d77 schema:isbn 978-3-642-40584-6
49 978-3-642-40585-3
50 schema:name Text, Speech, and Dialogue
51 rdf:type schema:Book
52 N703f0ec77bf84c1299b9dc9a9883b733 schema:familyName Matoušek
53 schema:givenName Václav
54 rdf:type schema:Person
55 N8a7c58a9511147f9ae12128fc57b4bb2 rdf:first N35022abd60df4a8ab3ba921126f76b3a
56 rdf:rest N096b119f00114fa2b411adbca663956d
57 N9ab849fc9edb461d84b3dab950991b93 schema:location Berlin, Heidelberg
58 schema:name Springer Berlin Heidelberg
59 rdf:type schema:Organisation
60 Nbc8c447ac2ca466ea8b18589153b756e rdf:first Ne9cdff4f5aae40b08f91572f381ac228
61 rdf:rest rdf:nil
62 Ne2acd29d2dd74a15a1a865717daa33fc schema:name readcube_id
63 schema:value b8279bfbbb24faf0b9d415952f474fcee55bc9ebff8a1b1cd7d708c3c6018475
64 rdf:type schema:PropertyValue
65 Ne9cdff4f5aae40b08f91572f381ac228 schema:affiliation https://www.grid.ac/institutes/grid.34477.33
66 schema:familyName Ladner
67 schema:givenName Richard E.
68 rdf:type schema:Person
69 Nf7712f2658fc4c96aa3e1ff9b117c5e2 schema:name doi
70 schema:value 10.1007/978-3-642-40585-3_18
71 rdf:type schema:PropertyValue
72 anzsrc-for:20 schema:inDefinedTermSet anzsrc-for:
73 schema:name Language, Communication and Culture
74 rdf:type schema:DefinedTerm
75 anzsrc-for:2004 schema:inDefinedTermSet anzsrc-for:
76 schema:name Linguistics
77 rdf:type schema:DefinedTerm
78 sg:person.012422257207.75 schema:affiliation https://www.grid.ac/institutes/grid.411970.a
79 schema:familyName Yun
80 schema:givenName Young-Sun
81 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012422257207.75
82 rdf:type schema:Person
83 https://doi.org/10.1016/0167-6393(94)00052-c schema:sameAs https://app.dimensions.ai/details/publication/pub.1051750656
84 rdf:type schema:CreativeWork
85 https://doi.org/10.1016/0167-6393(94)00053-d schema:sameAs https://app.dimensions.ai/details/publication/pub.1049408984
86 rdf:type schema:CreativeWork
87 https://doi.org/10.1016/0167-6393(94)00058-i schema:sameAs https://app.dimensions.ai/details/publication/pub.1041350006
88 rdf:type schema:CreativeWork
89 https://doi.org/10.1109/asru.2003.1318521 schema:sameAs https://app.dimensions.ai/details/publication/pub.1093759268
90 rdf:type schema:CreativeWork
91 https://doi.org/10.1109/icassp.1997.596120 schema:sameAs https://app.dimensions.ai/details/publication/pub.1095088016
92 rdf:type schema:CreativeWork
93 https://doi.org/10.1109/icassp.2006.1659962 schema:sameAs https://app.dimensions.ai/details/publication/pub.1095091226
94 rdf:type schema:CreativeWork
95 https://doi.org/10.1109/isspit.2004.1433719 schema:sameAs https://app.dimensions.ai/details/publication/pub.1093765558
96 rdf:type schema:CreativeWork
97 https://doi.org/10.1109/tasl.2009.2038663 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061516465
98 rdf:type schema:CreativeWork
99 https://doi.org/10.1109/tasl.2012.2198058 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061516939
100 rdf:type schema:CreativeWork
101 https://www.grid.ac/institutes/grid.34477.33 schema:alternateName University of Washington
102 schema:name Dept. of Computer Science and Engineering, University of Washington, Seattle, Washington USA, 98195
103 rdf:type schema:Organization
104 https://www.grid.ac/institutes/grid.411970.a schema:alternateName Hannam University
105 schema:name Dept. of Information and Communication Engineering, Hannam University, Daejeon, Republic of Korea, 306-791
106 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...