Ontology type: schema:ScholarlyArticle
2019-01-09
AUTHORSJunbo Ma, Ruili Wang, Wanting Ji, Hao Zheng, En Zhu, Jianping Yin
ABSTRACTA smart environment is one of the application scenarios of the Internet of Things (IoT). In order to provide a ubiquitous smart environment for humans, a variety of technologies are developed. In a smart environment system, sound event detection is one of the fundamental technologies, which can automatically sense sound changes in the environment and detect sound events that cause changes. In this paper, we propose the use of Relational Recurrent Neural Network (RRNN) for polyphonic sound event detection, called RRNN-SED, which utilized the strength of RRNN in long-term temporal context extraction and relational reasoning across a polyphonic sound signal. Different from previous sound event detection methods, which rely heavily on convolutional neural networks or recurrent neural networks, the proposed RRNN-SED method can solve long-lasting and overlapping problems in polyphonic sound event detection. Specifically, since the historical information memorized inside RRNNs is capable of interacting with each other across a polyphonic sound signal, the proposed RRNN-SED method is effective and efficient in extracting temporal context information and reasoning the unique relational characteristic of the target sound events. Experimental results on two public datasets show that the proposed method achieved better sound event detection results in terms of segment-based F-score and segment-based error rate. More... »
PAGES1-19
http://scigraph.springernature.com/pub.10.1007/s11042-018-7142-7
DOIhttp://dx.doi.org/10.1007/s11042-018-7142-7
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1111312535
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Artificial Intelligence and Image Processing",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "National University of Defense Technology",
"id": "https://www.grid.ac/institutes/grid.412110.7",
"name": [
"Massey University, Auckland, New Zealand",
"School of Computer, National University of Defense Technology, Changsha, China"
],
"type": "Organization"
},
"familyName": "Ma",
"givenName": "Junbo",
"id": "sg:person.010071173434.24",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010071173434.24"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Zhejiang Gongshang University",
"id": "https://www.grid.ac/institutes/grid.413072.3",
"name": [
"Massey University, Auckland, New Zealand",
"Zhejiang Gongshang University, Hangzhou, China"
],
"type": "Organization"
},
"familyName": "Wang",
"givenName": "Ruili",
"id": "sg:person.01112556557.70",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01112556557.70"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Zhejiang Gongshang University",
"id": "https://www.grid.ac/institutes/grid.413072.3",
"name": [
"Massey University, Auckland, New Zealand",
"Zhejiang Gongshang University, Hangzhou, China"
],
"type": "Organization"
},
"familyName": "Ji",
"givenName": "Wanting",
"id": "sg:person.010550736405.59",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010550736405.59"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Nanjing Xiaozhuang University",
"id": "https://www.grid.ac/institutes/grid.440845.9",
"name": [
"College of information engineering, Nanjing Xiaozhuang University, Nanjing, China"
],
"type": "Organization"
},
"familyName": "Zheng",
"givenName": "Hao",
"id": "sg:person.07575706553.15",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07575706553.15"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "National University of Defense Technology",
"id": "https://www.grid.ac/institutes/grid.412110.7",
"name": [
"School of Computer, National University of Defense Technology, Changsha, China"
],
"type": "Organization"
},
"familyName": "Zhu",
"givenName": "En",
"id": "sg:person.012334352267.67",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012334352267.67"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Dongguan University of Technology",
"id": "https://www.grid.ac/institutes/grid.459466.c",
"name": [
"Dongguan University of Technology, Dongguan, China"
],
"type": "Organization"
},
"familyName": "Yin",
"givenName": "Jianping",
"id": "sg:person.012631125327.29",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012631125327.29"
],
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1016/j.neunet.2014.09.003",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1013219854"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.jclepro.2016.10.006",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1017673424"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.3390/app6060162",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1025035487"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1687-4722-2013-1",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1049472325",
"https://doi.org/10.1186/1687-4722-2013-1"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11042-015-2967-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1054523468",
"https://doi.org/10.1007/s11042-015-2967-9"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11042-015-2967-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1054523468",
"https://doi.org/10.1007/s11042-015-2967-9"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1155/2007/48317",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1063202294",
"https://doi.org/10.1155/2007/48317"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/taslp.2017.2690575",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1085641971"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.neucom.2017.07.021",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1090741558"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/tii.2017.2739340",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1091268684"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/icita.2005.231",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1093873558"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/ijcnn.2015.7280624",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1094027910"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/icassp.2015.7177950",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1095144935"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/icassp.2016.7472917",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1095196539"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/grc.2005.1547359",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1095596629"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/eusipco.2016.7760424",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1095637962"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/icassp.2017.7952260",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1095991193"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/icassp.2017.7952260",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1095991193"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.21437/interspeech.2016-392",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1099086765"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.patrec.2018.01.013",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1100726552"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/icot.2017.8336092",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1103265657"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/comst.2018.2844341",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1104439462"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s11042-018-6380-z",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1105830556",
"https://doi.org/10.1007/s11042-018-6380-z"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/ijcnn.2018.8489470",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1107705379"
],
"type": "CreativeWork"
}
],
"datePublished": "2019-01-09",
"datePublishedReg": "2019-01-09",
"description": "A smart environment is one of the application scenarios of the Internet of Things (IoT). In order to provide a ubiquitous smart environment for humans, a variety of technologies are developed. In a smart environment system, sound event detection is one of the fundamental technologies, which can automatically sense sound changes in the environment and detect sound events that cause changes. In this paper, we propose the use of Relational Recurrent Neural Network (RRNN) for polyphonic sound event detection, called RRNN-SED, which utilized the strength of RRNN in long-term temporal context extraction and relational reasoning across a polyphonic sound signal. Different from previous sound event detection methods, which rely heavily on convolutional neural networks or recurrent neural networks, the proposed RRNN-SED method can solve long-lasting and overlapping problems in polyphonic sound event detection. Specifically, since the historical information memorized inside RRNNs is capable of interacting with each other across a polyphonic sound signal, the proposed RRNN-SED method is effective and efficient in extracting temporal context information and reasoning the unique relational characteristic of the target sound events. Experimental results on two public datasets show that the proposed method achieved better sound event detection results in terms of segment-based F-score and segment-based error rate.",
"genre": "research_article",
"id": "sg:pub.10.1007/s11042-018-7142-7",
"inLanguage": [
"en"
],
"isAccessibleForFree": false,
"isPartOf": [
{
"id": "sg:journal.1044869",
"issn": [
"1380-7501",
"1573-7721"
],
"name": "Multimedia Tools and Applications",
"type": "Periodical"
}
],
"name": "Relational recurrent neural networks for polyphonic sound event detection",
"pagination": "1-19",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"badf6270d32bc0dd2e3e501301c03d2b76a37e57f5279173b04e36b8a08bc8ee"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1007/s11042-018-7142-7"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1111312535"
]
}
],
"sameAs": [
"https://doi.org/10.1007/s11042-018-7142-7",
"https://app.dimensions.ai/details/publication/pub.1111312535"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-11T08:37",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000315_0000000315/records_6310_00000000.jsonl",
"type": "ScholarlyArticle",
"url": "https://link.springer.com/10.1007%2Fs11042-018-7142-7"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s11042-018-7142-7'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s11042-018-7142-7'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s11042-018-7142-7'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s11042-018-7142-7'
This table displays all metadata directly associated to this object as RDF triples.
171 TRIPLES
21 PREDICATES
46 URIs
16 LITERALS
5 BLANK NODES