Speech enhancement via Mel-scale Wiener filtering with a frequency-wise voice activity detector View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2007-05

AUTHORS

Hwa Soo Kim, Young Man Cho, Han-Jun Kim

ABSTRACT

This paper presents a speech enhancement system that enables a comfortable communication inside an automobile. A couple of novel concepts are proposed in an effort to improve two major building blocks in the existing speech enhancement systems: a voice activity detector (VAD) and a noise filtering algorithm. The proposed VAD classifies a given data frame as speech or noise at each frequency, enabling the frequency-wise updates of noise statistics and thereby improving the effectiveness of the noise filtering algorithms by providing more up-to-date noise statistics. The celebrated Wiener filter is adopted in this paper as the accompanying noise filtering algorithm, which results in significant noise suppression. Yet, the musical noise present in most Wiener filter-based systems prompts the idea of applying the Wiener filter in the Mel-scale in which the human auditory system responds to the external stimulation. It turns out that the Mel-scale Wiener filter creates some masking effects and thereby reduces musical noise significantly, leading to smooth transition between data frames. More... »

PAGES

708

References to SciGraph publications

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/bf02916349

DOI

http://dx.doi.org/10.1007/bf02916349

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1049829582


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0906", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Electrical and Electronic Engineering", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/09", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Engineering", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Seoul National University", 
          "id": "https://www.grid.ac/institutes/grid.31501.36", 
          "name": [
            "School of Mechanical and Aerospace Engineering, Seoul National University, San 56-1, Shillim-dong Kwanak-gu, 151-744, Seoul, Korea"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Kim", 
        "givenName": "Hwa Soo", 
        "id": "sg:person.016513155261.16", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016513155261.16"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Seoul National University", 
          "id": "https://www.grid.ac/institutes/grid.31501.36", 
          "name": [
            "School of Mechanical and Aerospace Engineering, Seoul National University, San 56-1, Shillim-dong Kwanak-gu, 151-744, Seoul, Korea"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Cho", 
        "givenName": "Young Man", 
        "id": "sg:person.012176153337.61", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012176153337.61"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "name": [
            "Finetec Century, 1002, Daechi-dong Gangnam-gu, 135-280, Seoul, Korea"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Kim", 
        "givenName": "Han-Jun", 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "sg:pub.10.1007/bf03185797", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1013474699", 
          "https://doi.org/10.1007/bf03185797"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/s0165-1684(03)00061-6", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1015511372"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/s0165-1684(03)00061-6", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1015511372"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/0885-2308(89)90027-2", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1023489192"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/bf02984050", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1044625677", 
          "https://doi.org/10.1007/bf02984050"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://app.dimensions.ai/details/publication/pub.1048496408", 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://app.dimensions.ai/details/publication/pub.1048496408", 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/89.279283", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061242238"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/89.928915", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061242685"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/97.736233", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061251273"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/tassp.1979.1163209", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1061518517"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.1979.1170788", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1086222005"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1109/icassp.1982.1171716", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1086230986"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2007-05", 
    "datePublishedReg": "2007-05-01", 
    "description": "This paper presents a speech enhancement system that enables a comfortable communication inside an automobile. A couple of novel concepts are proposed in an effort to improve two major building blocks in the existing speech enhancement systems: a voice activity detector (VAD) and a noise filtering algorithm. The proposed VAD classifies a given data frame as speech or noise at each frequency, enabling the frequency-wise updates of noise statistics and thereby improving the effectiveness of the noise filtering algorithms by providing more up-to-date noise statistics. The celebrated Wiener filter is adopted in this paper as the accompanying noise filtering algorithm, which results in significant noise suppression. Yet, the musical noise present in most Wiener filter-based systems prompts the idea of applying the Wiener filter in the Mel-scale in which the human auditory system responds to the external stimulation. It turns out that the Mel-scale Wiener filter creates some masking effects and thereby reduces musical noise significantly, leading to smooth transition between data frames.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1007/bf02916349", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": [
      {
        "id": "sg:journal.1295111", 
        "issn": [
          "1011-8861", 
          "1226-4865"
        ], 
        "name": "Journal of Mechanical Science and Technology", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "5", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "21"
      }
    ], 
    "name": "Speech enhancement via Mel-scale Wiener filtering with a frequency-wise voice activity detector", 
    "pagination": "708", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "ed5f61a23e7199e7b5029ca4a812ed635a5ebc10c3b087f2fcc6132d5eebd035"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/bf02916349"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1049829582"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1007/bf02916349", 
      "https://app.dimensions.ai/details/publication/pub.1049829582"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-11T13:54", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000371_0000000371/records_130808_00000004.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "http://link.springer.com/10.1007%2FBF02916349"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/bf02916349'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/bf02916349'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/bf02916349'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/bf02916349'


 

This table displays all metadata directly associated to this object as RDF triples.

110 TRIPLES      21 PREDICATES      38 URIs      19 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/bf02916349 schema:about anzsrc-for:09
2 anzsrc-for:0906
3 schema:author N1d6fb2a13d0843fbbbc885655081e0cd
4 schema:citation sg:pub.10.1007/bf02984050
5 sg:pub.10.1007/bf03185797
6 https://app.dimensions.ai/details/publication/pub.1048496408
7 https://doi.org/10.1016/0885-2308(89)90027-2
8 https://doi.org/10.1016/s0165-1684(03)00061-6
9 https://doi.org/10.1109/89.279283
10 https://doi.org/10.1109/89.928915
11 https://doi.org/10.1109/97.736233
12 https://doi.org/10.1109/icassp.1979.1170788
13 https://doi.org/10.1109/icassp.1982.1171716
14 https://doi.org/10.1109/tassp.1979.1163209
15 schema:datePublished 2007-05
16 schema:datePublishedReg 2007-05-01
17 schema:description This paper presents a speech enhancement system that enables a comfortable communication inside an automobile. A couple of novel concepts are proposed in an effort to improve two major building blocks in the existing speech enhancement systems: a voice activity detector (VAD) and a noise filtering algorithm. The proposed VAD classifies a given data frame as speech or noise at each frequency, enabling the frequency-wise updates of noise statistics and thereby improving the effectiveness of the noise filtering algorithms by providing more up-to-date noise statistics. The celebrated Wiener filter is adopted in this paper as the accompanying noise filtering algorithm, which results in significant noise suppression. Yet, the musical noise present in most Wiener filter-based systems prompts the idea of applying the Wiener filter in the Mel-scale in which the human auditory system responds to the external stimulation. It turns out that the Mel-scale Wiener filter creates some masking effects and thereby reduces musical noise significantly, leading to smooth transition between data frames.
18 schema:genre research_article
19 schema:inLanguage en
20 schema:isAccessibleForFree false
21 schema:isPartOf N19dc4b1cecd4428fbb7d3efe98eb2df4
22 N5d1a20c6b78448bfaae830aacbe5d757
23 sg:journal.1295111
24 schema:name Speech enhancement via Mel-scale Wiener filtering with a frequency-wise voice activity detector
25 schema:pagination 708
26 schema:productId N51b0c27209a24af6a1f27c050533bd45
27 N82e939a5061a4d0a82c73e2598b5e079
28 Ndc8fe7a4199541fa9d1ab2af08b66b71
29 schema:sameAs https://app.dimensions.ai/details/publication/pub.1049829582
30 https://doi.org/10.1007/bf02916349
31 schema:sdDatePublished 2019-04-11T13:54
32 schema:sdLicense https://scigraph.springernature.com/explorer/license/
33 schema:sdPublisher N0412e15d58314e53a53c65237341aebb
34 schema:url http://link.springer.com/10.1007%2FBF02916349
35 sgo:license sg:explorer/license/
36 sgo:sdDataset articles
37 rdf:type schema:ScholarlyArticle
38 N0412e15d58314e53a53c65237341aebb schema:name Springer Nature - SN SciGraph project
39 rdf:type schema:Organization
40 N19dc4b1cecd4428fbb7d3efe98eb2df4 schema:issueNumber 5
41 rdf:type schema:PublicationIssue
42 N1d6fb2a13d0843fbbbc885655081e0cd rdf:first sg:person.016513155261.16
43 rdf:rest N75eec563679a43478a7e228d44be34dd
44 N51b0c27209a24af6a1f27c050533bd45 schema:name dimensions_id
45 schema:value pub.1049829582
46 rdf:type schema:PropertyValue
47 N5d1a20c6b78448bfaae830aacbe5d757 schema:volumeNumber 21
48 rdf:type schema:PublicationVolume
49 N75eec563679a43478a7e228d44be34dd rdf:first sg:person.012176153337.61
50 rdf:rest N7f3b9741d1ff42a7a06c0ecfcefc245c
51 N7f3b9741d1ff42a7a06c0ecfcefc245c rdf:first Nda25a431aa4f4c608c3c03e6e79ca335
52 rdf:rest rdf:nil
53 N82e939a5061a4d0a82c73e2598b5e079 schema:name readcube_id
54 schema:value ed5f61a23e7199e7b5029ca4a812ed635a5ebc10c3b087f2fcc6132d5eebd035
55 rdf:type schema:PropertyValue
56 Nc74de6c243c3419baedff2c0d863d390 schema:name Finetec Century, 1002, Daechi-dong Gangnam-gu, 135-280, Seoul, Korea
57 rdf:type schema:Organization
58 Nda25a431aa4f4c608c3c03e6e79ca335 schema:affiliation Nc74de6c243c3419baedff2c0d863d390
59 schema:familyName Kim
60 schema:givenName Han-Jun
61 rdf:type schema:Person
62 Ndc8fe7a4199541fa9d1ab2af08b66b71 schema:name doi
63 schema:value 10.1007/bf02916349
64 rdf:type schema:PropertyValue
65 anzsrc-for:09 schema:inDefinedTermSet anzsrc-for:
66 schema:name Engineering
67 rdf:type schema:DefinedTerm
68 anzsrc-for:0906 schema:inDefinedTermSet anzsrc-for:
69 schema:name Electrical and Electronic Engineering
70 rdf:type schema:DefinedTerm
71 sg:journal.1295111 schema:issn 1011-8861
72 1226-4865
73 schema:name Journal of Mechanical Science and Technology
74 rdf:type schema:Periodical
75 sg:person.012176153337.61 schema:affiliation https://www.grid.ac/institutes/grid.31501.36
76 schema:familyName Cho
77 schema:givenName Young Man
78 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012176153337.61
79 rdf:type schema:Person
80 sg:person.016513155261.16 schema:affiliation https://www.grid.ac/institutes/grid.31501.36
81 schema:familyName Kim
82 schema:givenName Hwa Soo
83 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016513155261.16
84 rdf:type schema:Person
85 sg:pub.10.1007/bf02984050 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044625677
86 https://doi.org/10.1007/bf02984050
87 rdf:type schema:CreativeWork
88 sg:pub.10.1007/bf03185797 schema:sameAs https://app.dimensions.ai/details/publication/pub.1013474699
89 https://doi.org/10.1007/bf03185797
90 rdf:type schema:CreativeWork
91 https://app.dimensions.ai/details/publication/pub.1048496408 schema:CreativeWork
92 https://doi.org/10.1016/0885-2308(89)90027-2 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023489192
93 rdf:type schema:CreativeWork
94 https://doi.org/10.1016/s0165-1684(03)00061-6 schema:sameAs https://app.dimensions.ai/details/publication/pub.1015511372
95 rdf:type schema:CreativeWork
96 https://doi.org/10.1109/89.279283 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061242238
97 rdf:type schema:CreativeWork
98 https://doi.org/10.1109/89.928915 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061242685
99 rdf:type schema:CreativeWork
100 https://doi.org/10.1109/97.736233 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061251273
101 rdf:type schema:CreativeWork
102 https://doi.org/10.1109/icassp.1979.1170788 schema:sameAs https://app.dimensions.ai/details/publication/pub.1086222005
103 rdf:type schema:CreativeWork
104 https://doi.org/10.1109/icassp.1982.1171716 schema:sameAs https://app.dimensions.ai/details/publication/pub.1086230986
105 rdf:type schema:CreativeWork
106 https://doi.org/10.1109/tassp.1979.1163209 schema:sameAs https://app.dimensions.ai/details/publication/pub.1061518517
107 rdf:type schema:CreativeWork
108 https://www.grid.ac/institutes/grid.31501.36 schema:alternateName Seoul National University
109 schema:name School of Mechanical and Aerospace Engineering, Seoul National University, San 56-1, Shillim-dong Kwanak-gu, 151-744, Seoul, Korea
110 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...