Affine-Invariant Visual Features Contain Supplementary Information to Enhance Speech Recognition View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2001-08-17

AUTHORS

Sabri Gurbuz , Eric Patterson , Zekeriya Tufekci , John N. Gowdy

ABSTRACT

The performance of audio-based speech recognition systems degrades severely when there is a mismatch between training and usage environments due to background noise. This degradation is due to a loss of ability to extract and distinguishim portant information from audio features. One of the emerging techniques for dealing with this problem is the addition of visual features in a multimodal recognition system. This paper presents an affine-invariant, multimodal speech recognition system and focuses on the supplementary information that is available from video features. More... »

PAGES

175-181

Book

TITLE

Audio- and Video-Based Biometric Person Authentication

ISBN

978-3-540-42216-7
978-3-540-45344-4

Author Affiliations

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/3-540-45344-x_25

DOI

http://dx.doi.org/10.1007/3-540-45344-x_25

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1020337184


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Clemson University", 
          "id": "https://www.grid.ac/institutes/grid.26090.3d", 
          "name": [
            "Department of Electrical and Computer Engineering, Clemson University, 29634, Clemson, SC, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Gurbuz", 
        "givenName": "Sabri", 
        "id": "sg:person.07522637313.39", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07522637313.39"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Clemson University", 
          "id": "https://www.grid.ac/institutes/grid.26090.3d", 
          "name": [
            "Department of Electrical and Computer Engineering, Clemson University, 29634, Clemson, SC, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Patterson", 
        "givenName": "Eric", 
        "id": "sg:person.016030767057.18", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016030767057.18"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Clemson University", 
          "id": "https://www.grid.ac/institutes/grid.26090.3d", 
          "name": [
            "Department of Electrical and Computer Engineering, Clemson University, 29634, Clemson, SC, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Tufekci", 
        "givenName": "Zekeriya", 
        "id": "sg:person.016661051436.44", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016661051436.44"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Clemson University", 
          "id": "https://www.grid.ac/institutes/grid.26090.3d", 
          "name": [
            "Department of Electrical and Computer Engineering, Clemson University, 29634, Clemson, SC, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Gowdy", 
        "givenName": "John N.", 
        "id": "sg:person.016274023713.84", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016274023713.84"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1145/57167.57170", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1007032918"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1098/rstb.1992.0009", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1024824599"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/89.536928", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061242358"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/89.799688", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061242552"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.2000.861829", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1095261252"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.2001.940796", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1095528564"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2001-08-17", 
    "datePublishedReg": "2001-08-17", 
    "description": "The performance of audio-based speech recognition systems degrades severely when there is a mismatch between training and usage environments due to background noise. This degradation is due to a loss of ability to extract and distinguishim portant information from audio features. One of the emerging techniques for dealing with this problem is the addition of visual features in a multimodal recognition system. This paper presents an affine-invariant, multimodal speech recognition system and focuses on the supplementary information that is available from video features.", 
    "editor": [
      {
        "familyName": "Bigun", 
        "givenName": "Josef", 
        "type": "Person"
      }, 
      {
        "familyName": "Smeraldi", 
        "givenName": "Fabrizio", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/3-540-45344-x_25", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": {
      "isbn": [
        "978-3-540-42216-7", 
        "978-3-540-45344-4"
      ], 
      "name": "Audio- and Video-Based Biometric Person Authentication", 
      "type": "Book"
    }, 
    "name": "Affine-Invariant Visual Features Contain Supplementary Information to Enhance Speech Recognition", 
    "pagination": "175-181", 
    "productId": [
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/3-540-45344-x_25"
        ]
      }, 
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "60e6719874b850dc7de2f93b1d604b3acc614693cfbff69e30e90dca6dacddda"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1020337184"
        ]
      }
    ], 
    "publisher": {
      "location": "Berlin, Heidelberg", 
      "name": "Springer Berlin Heidelberg", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/3-540-45344-x_25", 
      "https://app.dimensions.ai/details/publication/pub.1020337184"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2019-04-16T05:39", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000346_0000000346/records_99843_00000001.jsonl", 
    "type": "Chapter", 
    "url": "https://link.springer.com/10.1007%2F3-540-45344-X_25"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/3-540-45344-x_25'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/3-540-45344-x_25'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/3-540-45344-x_25'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/3-540-45344-x_25'


 

This table displays all metadata directly associated to this object as RDF triples.

109 TRIPLES      23 PREDICATES      32 URIs      19 LITERALS      8 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/3-540-45344-x_25 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author N9bfde40d1cb345c3ac6e4fd6c8bfb6ca
4 schema:citation https://doi.org/10.1098/rstb.1992.0009
5 https://doi.org/10.1109/89.536928
6 https://doi.org/10.1109/89.799688
7 https://doi.org/10.1109/icassp.2000.861829
8 https://doi.org/10.1109/icassp.2001.940796
9 https://doi.org/10.1145/57167.57170
10 schema:datePublished 2001-08-17
11 schema:datePublishedReg 2001-08-17
12 schema:description The performance of audio-based speech recognition systems degrades severely when there is a mismatch between training and usage environments due to background noise. This degradation is due to a loss of ability to extract and distinguishim portant information from audio features. One of the emerging techniques for dealing with this problem is the addition of visual features in a multimodal recognition system. This paper presents an affine-invariant, multimodal speech recognition system and focuses on the supplementary information that is available from video features.
13 schema:editor N1b55bd406f3947f4aa557293a1a72d4e
14 schema:genre chapter
15 schema:inLanguage en
16 schema:isAccessibleForFree false
17 schema:isPartOf Nc5f042ac0298461fa8b80d513d521998
18 schema:name Affine-Invariant Visual Features Contain Supplementary Information to Enhance Speech Recognition
19 schema:pagination 175-181
20 schema:productId N47357f74dcb74b12aefd27011b23d0e5
21 Na9f8595d32344850a5494fcf62ea5156
22 Nf4d69fa67881454f9ae1b2e517ad61dc
23 schema:publisher N77a84769aa8547869a51bcb06bce9239
24 schema:sameAs https://app.dimensions.ai/details/publication/pub.1020337184
25 https://doi.org/10.1007/3-540-45344-x_25
26 schema:sdDatePublished 2019-04-16T05:39
27 schema:sdLicense https://scigraph.springernature.com/explorer/license/
28 schema:sdPublisher N3c07fd23163548ce97a9e479e43ab766
29 schema:url https://link.springer.com/10.1007%2F3-540-45344-X_25
30 sgo:license sg:explorer/license/
31 sgo:sdDataset chapters
32 rdf:type schema:Chapter
33 N1b55bd406f3947f4aa557293a1a72d4e rdf:first Ne61c27a021024ec5a06de55ef6b720f8
34 rdf:rest Nefd538f365b947f48d2a99bd2903e1c4
35 N1f298d446887436bb6841553c582fe95 rdf:first sg:person.016274023713.84
36 rdf:rest rdf:nil
37 N3c07fd23163548ce97a9e479e43ab766 schema:name Springer Nature - SN SciGraph project
38 rdf:type schema:Organization
39 N47357f74dcb74b12aefd27011b23d0e5 schema:name doi
40 schema:value 10.1007/3-540-45344-x_25
41 rdf:type schema:PropertyValue
42 N77a84769aa8547869a51bcb06bce9239 schema:location Berlin, Heidelberg
43 schema:name Springer Berlin Heidelberg
44 rdf:type schema:Organisation
45 N9a4e8a21dc004f33beb4aa71b54eb73e rdf:first sg:person.016030767057.18
46 rdf:rest Nc950498bf84c4b69bde909ba0a7d6b99
47 N9bfde40d1cb345c3ac6e4fd6c8bfb6ca rdf:first sg:person.07522637313.39
48 rdf:rest N9a4e8a21dc004f33beb4aa71b54eb73e
49 Na9f8595d32344850a5494fcf62ea5156 schema:name dimensions_id
50 schema:value pub.1020337184
51 rdf:type schema:PropertyValue
52 Nc5f042ac0298461fa8b80d513d521998 schema:isbn 978-3-540-42216-7
53 978-3-540-45344-4
54 schema:name Audio- and Video-Based Biometric Person Authentication
55 rdf:type schema:Book
56 Nc950498bf84c4b69bde909ba0a7d6b99 rdf:first sg:person.016661051436.44
57 rdf:rest N1f298d446887436bb6841553c582fe95
58 Nd7940ed25b634edebd8e6f52e3252e55 schema:familyName Smeraldi
59 schema:givenName Fabrizio
60 rdf:type schema:Person
61 Ne61c27a021024ec5a06de55ef6b720f8 schema:familyName Bigun
62 schema:givenName Josef
63 rdf:type schema:Person
64 Nefd538f365b947f48d2a99bd2903e1c4 rdf:first Nd7940ed25b634edebd8e6f52e3252e55
65 rdf:rest rdf:nil
66 Nf4d69fa67881454f9ae1b2e517ad61dc schema:name readcube_id
67 schema:value 60e6719874b850dc7de2f93b1d604b3acc614693cfbff69e30e90dca6dacddda
68 rdf:type schema:PropertyValue
69 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
70 schema:name Information and Computing Sciences
71 rdf:type schema:DefinedTerm
72 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
73 schema:name Artificial Intelligence and Image Processing
74 rdf:type schema:DefinedTerm
75 sg:person.016030767057.18 schema:affiliation https://www.grid.ac/institutes/grid.26090.3d
76 schema:familyName Patterson
77 schema:givenName Eric
78 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016030767057.18
79 rdf:type schema:Person
80 sg:person.016274023713.84 schema:affiliation https://www.grid.ac/institutes/grid.26090.3d
81 schema:familyName Gowdy
82 schema:givenName John N.
83 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016274023713.84
84 rdf:type schema:Person
85 sg:person.016661051436.44 schema:affiliation https://www.grid.ac/institutes/grid.26090.3d
86 schema:familyName Tufekci
87 schema:givenName Zekeriya
88 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016661051436.44
89 rdf:type schema:Person
90 sg:person.07522637313.39 schema:affiliation https://www.grid.ac/institutes/grid.26090.3d
91 schema:familyName Gurbuz
92 schema:givenName Sabri
93 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07522637313.39
94 rdf:type schema:Person
95 https://doi.org/10.1098/rstb.1992.0009 schema:sameAs https://app.dimensions.ai/details/publication/pub.1024824599
96 rdf:type schema:CreativeWork
97 https://doi.org/10.1109/89.536928 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061242358
98 rdf:type schema:CreativeWork
99 https://doi.org/10.1109/89.799688 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061242552
100 rdf:type schema:CreativeWork
101 https://doi.org/10.1109/icassp.2000.861829 schema:sameAs https://app.dimensions.ai/details/publication/pub.1095261252
102 rdf:type schema:CreativeWork
103 https://doi.org/10.1109/icassp.2001.940796 schema:sameAs https://app.dimensions.ai/details/publication/pub.1095528564
104 rdf:type schema:CreativeWork
105 https://doi.org/10.1145/57167.57170 schema:sameAs https://app.dimensions.ai/details/publication/pub.1007032918
106 rdf:type schema:CreativeWork
107 https://www.grid.ac/institutes/grid.26090.3d schema:alternateName Clemson University
108 schema:name Department of Electrical and Computer Engineering, Clemson University, 29634, Clemson, SC, USA
109 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...