Ontology type: schema:Chapter Open Access: True
2007
AUTHORSNidhal Ben Aloui , Hervé Glotin , Patrick Hebrard
ABSTRACTThis paper presents original and efficient Qualitative Time-Frequency (QTF) speech features for speaker recognition based on a med-term speech dynamics qualitative representation. For each frame of around 150ms, we estimate and binarize a suband voicing activity estimation of 6 frequency subands. We then derive the Allen temporal relations graph between these 6 time intervals. This set of temporal relations, estimated at each frame, feeds a neural network which is trained for speaker recognition. Experiments are conducted on fifty speakers (males and females) of a reference radio database ESTER (40 hours) with continuous speech. Our best model generates around 3% of frame class error, without using information of frame continuity, which is similar to state of the art. Moreover, our QTF generates a simple and light representation using only 15 integers for coding speaker identity. More... »
PAGES1154-1163
Advances in Biometrics
ISBN
978-3-540-74548-8
978-3-540-74549-5
http://scigraph.springernature.com/pub.10.1007/978-3-540-74549-5_120
DOIhttp://dx.doi.org/10.1007/978-3-540-74549-5_120
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1039141695
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Artificial Intelligence and Image Processing",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Universite De Toulon Et Du Var",
"id": "https://www.grid.ac/institutes/grid.12611.35",
"name": [
"Universit\u00e9 du Sud Toulon-Var Laboratoire LSIS, B.P. 20 132 - 83 957 La Garde, France",
"DCNS - Division SIS, Le Mourillon B.P. 403 - 83 055 Toulon,Email: nidhal.ben-aloui@dcn.fr, France"
],
"type": "Organization"
},
"familyName": "Aloui",
"givenName": "Nidhal Ben",
"id": "sg:person.011513455112.08",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011513455112.08"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Universite De Toulon Et Du Var",
"id": "https://www.grid.ac/institutes/grid.12611.35",
"name": [
"Universit\u00e9 du Sud Toulon-Var Laboratoire LSIS, B.P. 20 132 - 83 957 La Garde, France"
],
"type": "Organization"
},
"familyName": "Glotin",
"givenName": "Herv\u00e9",
"id": "sg:person.016622300103.82",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016622300103.82"
],
"type": "Person"
},
{
"affiliation": {
"name": [
"DCNS - Division SIS, Le Mourillon B.P. 403 - 83 055 Toulon,Email: nidhal.ben-aloui@dcn.fr, France"
],
"type": "Organization"
},
"familyName": "Hebrard",
"givenName": "Patrick",
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1145/182.358434",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1018860881"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/s0016-0032(22)90319-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1021950006"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/bfb0016002",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1034365148",
"https://doi.org/10.1007/bfb0016002"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/89.326615",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061242263"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1121/1.387808",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1062339100"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1155/s1110865704310024",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1063207896",
"https://doi.org/10.1155/s1110865704310024"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/icassp.2001.940795",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1095748895"
],
"type": "CreativeWork"
}
],
"datePublished": "2007",
"datePublishedReg": "2007-01-01",
"description": "This paper presents original and efficient Qualitative Time-Frequency (QTF) speech features for speaker recognition based on a med-term speech dynamics qualitative representation. For each frame of around 150ms, we estimate and binarize a suband voicing activity estimation of 6 frequency subands. We then derive the Allen temporal relations graph between these 6 time intervals. This set of temporal relations, estimated at each frame, feeds a neural network which is trained for speaker recognition. Experiments are conducted on fifty speakers (males and females) of a reference radio database ESTER (40 hours) with continuous speech. Our best model generates around 3% of frame class error, without using information of frame continuity, which is similar to state of the art. Moreover, our QTF generates a simple and light representation using only 15 integers for coding speaker identity.",
"editor": [
{
"familyName": "Lee",
"givenName": "Seong-Whan",
"type": "Person"
},
{
"familyName": "Li",
"givenName": "Stan Z.",
"type": "Person"
}
],
"genre": "chapter",
"id": "sg:pub.10.1007/978-3-540-74549-5_120",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isPartOf": {
"isbn": [
"978-3-540-74548-8",
"978-3-540-74549-5"
],
"name": "Advances in Biometrics",
"type": "Book"
},
"name": "Application of New Qualitative Voicing Time-Frequency Features for Speaker Recognition",
"pagination": "1154-1163",
"productId": [
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1007/978-3-540-74549-5_120"
]
},
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"ba73384b84c716ede30b8f4420c4033358efd2c4101c9f9b10725a097d96a8a5"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1039141695"
]
}
],
"publisher": {
"location": "Berlin, Heidelberg",
"name": "Springer Berlin Heidelberg",
"type": "Organisation"
},
"sameAs": [
"https://doi.org/10.1007/978-3-540-74549-5_120",
"https://app.dimensions.ai/details/publication/pub.1039141695"
],
"sdDataset": "chapters",
"sdDatePublished": "2019-04-16T05:32",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000346_0000000346/records_99815_00000002.jsonl",
"type": "Chapter",
"url": "https://link.springer.com/10.1007%2F978-3-540-74549-5_120"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-74549-5_120'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-74549-5_120'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-74549-5_120'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-540-74549-5_120'
This table displays all metadata directly associated to this object as RDF triples.
109 TRIPLES
23 PREDICATES
34 URIs
20 LITERALS
8 BLANK NODES