Speech/speaker recognition using a HMM/GMM hybrid model View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

1997

AUTHORS

Elena Rodríguez , Belén Ruíz , Ángel García-Crespo , Fernando García

ABSTRACT

In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform.We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made in order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line. More... »

PAGES

227-234

Book

TITLE

Audio- and Video-based Biometric Person Authentication

ISBN

978-3-540-62660-2
978-3-540-68425-1

Author Affiliations

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/bfb0016000

DOI

http://dx.doi.org/10.1007/bfb0016000

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1014441711


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Carlos III University of Madrid", 
          "id": "https://www.grid.ac/institutes/grid.7840.b", 
          "name": [
            "Universidad Carlos III de Madrid, c/ Butarque, 15, 28911\u00a0Legan\u00e9s(Madrid), Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Rodr\u00edguez", 
        "givenName": "Elena", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Carlos III University of Madrid", 
          "id": "https://www.grid.ac/institutes/grid.7840.b", 
          "name": [
            "Universidad Carlos III de Madrid, c/ Butarque, 15, 28911\u00a0Legan\u00e9s(Madrid), Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Ru\u00edz", 
        "givenName": "Bel\u00e9n", 
        "id": "sg:person.013562204237.63", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013562204237.63"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Carlos III University of Madrid", 
          "id": "https://www.grid.ac/institutes/grid.7840.b", 
          "name": [
            "Universidad Carlos III de Madrid, c/ Butarque, 15, 28911\u00a0Legan\u00e9s(Madrid), Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Garc\u00eda-Crespo", 
        "givenName": "\u00c1ngel", 
        "id": "sg:person.011537037147.86", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011537037147.86"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Carlos III University of Madrid", 
          "id": "https://www.grid.ac/institutes/grid.7840.b", 
          "name": [
            "Universidad Carlos III de Madrid, c/ Butarque, 15, 28911\u00a0Legan\u00e9s(Madrid), Spain"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Garc\u00eda", 
        "givenName": "Fernando", 
        "type": "Person"
      }
    ], 
    "datePublished": "1997", 
    "datePublishedReg": "1997-01-01", 
    "description": "In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform.We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made in order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.", 
    "editor": [
      {
        "familyName": "Big\u00fcn", 
        "givenName": "Josef", 
        "type": "Person"
      }, 
      {
        "familyName": "Chollet", 
        "givenName": "G\u00e9rard", 
        "type": "Person"
      }, 
      {
        "familyName": "Borgefors", 
        "givenName": "Gunilla", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/bfb0016000", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-3-540-62660-2", 
        "978-3-540-68425-1"
      ], 
      "name": "Audio- and Video-based Biometric Person Authentication", 
      "type": "Book"
    }, 
    "name": "Speech/speaker recognition using a HMM/GMM hybrid model", 
    "pagination": "227-234", 
    "productId": [
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/bfb0016000"
        ]
      }, 
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "44ccf733034511a004e30fa11ca44f68a8571c7dd1ec712d5069bf6cc51ad3fa"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1014441711"
        ]
      }
    ], 
    "publisher": {
      "location": "Berlin, Heidelberg", 
      "name": "Springer Berlin Heidelberg", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/bfb0016000", 
      "https://app.dimensions.ai/details/publication/pub.1014441711"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2019-04-15T21:43", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8693_00000024.jsonl", 
    "type": "Chapter", 
    "url": "http://link.springer.com/10.1007/BFb0016000"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/bfb0016000'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/bfb0016000'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/bfb0016000'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/bfb0016000'


 

This table displays all metadata directly associated to this object as RDF triples.

94 TRIPLES      22 PREDICATES      27 URIs      20 LITERALS      8 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/bfb0016000 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author N27a3babf97224fb99e4e175c8fae6a6e
4 schema:datePublished 1997
5 schema:datePublishedReg 1997-01-01
6 schema:description In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform.We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made in order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.
7 schema:editor N4086b41239d146cbb135ebda4b439060
8 schema:genre chapter
9 schema:inLanguage en
10 schema:isAccessibleForFree false
11 schema:isPartOf Ndda2d0946587468999cc813d69ebcebd
12 schema:name Speech/speaker recognition using a HMM/GMM hybrid model
13 schema:pagination 227-234
14 schema:productId N54be7ca699fe4aaca9b251cc30acf874
15 N610f2a9f960f418a85beecdc5b438c5f
16 N7675af0afbcc46ccb88d87fa9ed0c58b
17 schema:publisher Nf3e4bdf9c0b444dfab0406704322e3e3
18 schema:sameAs https://app.dimensions.ai/details/publication/pub.1014441711
19 https://doi.org/10.1007/bfb0016000
20 schema:sdDatePublished 2019-04-15T21:43
21 schema:sdLicense https://scigraph.springernature.com/explorer/license/
22 schema:sdPublisher N6021ec7a140c490aac8966a0520889bc
23 schema:url http://link.springer.com/10.1007/BFb0016000
24 sgo:license sg:explorer/license/
25 sgo:sdDataset chapters
26 rdf:type schema:Chapter
27 N099db03b9af846d19fd3909faa1c18ff schema:affiliation https://www.grid.ac/institutes/grid.7840.b
28 schema:familyName Rodríguez
29 schema:givenName Elena
30 rdf:type schema:Person
31 N21b1d925b37a4b82bc42cc9150a9055d rdf:first Na78d9c569baf4f29985e7b9683b83c9f
32 rdf:rest rdf:nil
33 N27a3babf97224fb99e4e175c8fae6a6e rdf:first N099db03b9af846d19fd3909faa1c18ff
34 rdf:rest Nf5d9a0eaf3c84acea03c8302808f4e86
35 N2b3537099b064438b9e6c1554de3981f schema:familyName Bigün
36 schema:givenName Josef
37 rdf:type schema:Person
38 N4086b41239d146cbb135ebda4b439060 rdf:first N2b3537099b064438b9e6c1554de3981f
39 rdf:rest N7dba5bd6c9444970a4e04bec99507fa1
40 N54be7ca699fe4aaca9b251cc30acf874 schema:name dimensions_id
41 schema:value pub.1014441711
42 rdf:type schema:PropertyValue
43 N6021ec7a140c490aac8966a0520889bc schema:name Springer Nature - SN SciGraph project
44 rdf:type schema:Organization
45 N610f2a9f960f418a85beecdc5b438c5f schema:name doi
46 schema:value 10.1007/bfb0016000
47 rdf:type schema:PropertyValue
48 N7675af0afbcc46ccb88d87fa9ed0c58b schema:name readcube_id
49 schema:value 44ccf733034511a004e30fa11ca44f68a8571c7dd1ec712d5069bf6cc51ad3fa
50 rdf:type schema:PropertyValue
51 N7dba5bd6c9444970a4e04bec99507fa1 rdf:first Nc382561e932e4bb4be59d92476b5abfb
52 rdf:rest N21b1d925b37a4b82bc42cc9150a9055d
53 N93f57608047142c6a95c08800ca61b92 rdf:first Nf658c699164d43a3b2b59f477addb505
54 rdf:rest rdf:nil
55 Na78d9c569baf4f29985e7b9683b83c9f schema:familyName Borgefors
56 schema:givenName Gunilla
57 rdf:type schema:Person
58 Nc382561e932e4bb4be59d92476b5abfb schema:familyName Chollet
59 schema:givenName Gérard
60 rdf:type schema:Person
61 Ndda2d0946587468999cc813d69ebcebd schema:isbn 978-3-540-62660-2
62 978-3-540-68425-1
63 schema:name Audio- and Video-based Biometric Person Authentication
64 rdf:type schema:Book
65 Nf3e4bdf9c0b444dfab0406704322e3e3 schema:location Berlin, Heidelberg
66 schema:name Springer Berlin Heidelberg
67 rdf:type schema:Organisation
68 Nf5d9a0eaf3c84acea03c8302808f4e86 rdf:first sg:person.013562204237.63
69 rdf:rest Nf8e40ae465b6408ea18ae808495f9f71
70 Nf658c699164d43a3b2b59f477addb505 schema:affiliation https://www.grid.ac/institutes/grid.7840.b
71 schema:familyName García
72 schema:givenName Fernando
73 rdf:type schema:Person
74 Nf8e40ae465b6408ea18ae808495f9f71 rdf:first sg:person.011537037147.86
75 rdf:rest N93f57608047142c6a95c08800ca61b92
76 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
77 schema:name Information and Computing Sciences
78 rdf:type schema:DefinedTerm
79 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
80 schema:name Artificial Intelligence and Image Processing
81 rdf:type schema:DefinedTerm
82 sg:person.011537037147.86 schema:affiliation https://www.grid.ac/institutes/grid.7840.b
83 schema:familyName García-Crespo
84 schema:givenName Ángel
85 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011537037147.86
86 rdf:type schema:Person
87 sg:person.013562204237.63 schema:affiliation https://www.grid.ac/institutes/grid.7840.b
88 schema:familyName Ruíz
89 schema:givenName Belén
90 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013562204237.63
91 rdf:type schema:Person
92 https://www.grid.ac/institutes/grid.7840.b schema:alternateName Carlos III University of Madrid
93 schema:name Universidad Carlos III de Madrid, c/ Butarque, 15, 28911 Leganés(Madrid), Spain
94 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...