Adaptive music resizing with stretching, cropping and insertion View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2012-07-26

AUTHORS

Zhang Liu, Chaokun Wang, Jianmin Wang, Hao Wang, Yiyuan Bai

ABSTRACT

Content-aware music adaption, i.e. music resizing, in temporal constraints starts drawing attention from multimedia communities, because there are plenty of real-world scenarios, e.g. animation production and radio advertisement production. The goal of music resizing is to change the length of a music track to a user preferred length using a series of basic operations, e.g. compression, prolonging, cropping and insertion. The only existing music resizing approach so far, called LyDAR, is based on the lyrics analysis and just utilizes the compression operation to resize a music piece. As a result, LyDAR suffers from some limitations, e.g., it can neither prolong a music track nor compress music pieces with very small stretch rates. In this paper, we propose a content-aware music resizing framework, named MUSIZ. In general, MUSIZ outperforms LyDAR in three aspects: (a) Except for the compression operation, MUSIZ takes advantages of prolonging, cropping and insertion operations to handle the resizing requests of both compression and prolonging. (b) Observing the diversity of quality degradation for different segments, we propose the concept of stretch-resistance to measure the degree of quality degradation after a segment is stretched. The stretch-resistance is modeled based on both acoustical and lyrics features. (c) Cropping and insertion operations are utilized before stretching. We develop the contiguity-preservative cropping and insertion algorithms to remove and insert music segments while smoothing the abrupt change at the joint between the manipulated segments. Comprehensive user studies show that the music tracks resized by MUSIZ achieve better quality than those produced by existing approaches. More... »

PAGES

359-380

References to SciGraph publications

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/s00530-012-0289-6

DOI

http://dx.doi.org/10.1007/s00530-012-0289-6

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1044163052


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People\u2019s Republic of China", 
          "id": "http://www.grid.ac/institutes/grid.12527.33", 
          "name": [
            "Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People\u2019s Republic of China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Liu", 
        "givenName": "Zhang", 
        "id": "sg:person.016566115360.45", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016566115360.45"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "School of Software, Tsinghua University, 100084, Beijing, People\u2019s Republic of China", 
          "id": "http://www.grid.ac/institutes/grid.12527.33", 
          "name": [
            "School of Software, Tsinghua University, 100084, Beijing, People\u2019s Republic of China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wang", 
        "givenName": "Chaokun", 
        "id": "sg:person.011533665437.99", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011533665437.99"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "School of Software, Tsinghua University, 100084, Beijing, People\u2019s Republic of China", 
          "id": "http://www.grid.ac/institutes/grid.12527.33", 
          "name": [
            "School of Software, Tsinghua University, 100084, Beijing, People\u2019s Republic of China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wang", 
        "givenName": "Jianmin", 
        "id": "sg:person.012303351315.43", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012303351315.43"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People\u2019s Republic of China", 
          "id": "http://www.grid.ac/institutes/grid.12527.33", 
          "name": [
            "Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People\u2019s Republic of China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wang", 
        "givenName": "Hao", 
        "id": "sg:person.013572656645.92", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013572656645.92"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People\u2019s Republic of China", 
          "id": "http://www.grid.ac/institutes/grid.12527.33", 
          "name": [
            "Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People\u2019s Republic of China"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Bai", 
        "givenName": "Yiyuan", 
        "id": "sg:person.014437414013.41", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014437414013.41"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "sg:pub.10.1007/s11042-005-0895-9", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1012319174", 
          "https://doi.org/10.1007/s11042-005-0895-9"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/0-387-28958-5", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1001637565", 
          "https://doi.org/10.1007/0-387-28958-5"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/35002078", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1038110076", 
          "https://doi.org/10.1038/35002078"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/s10994-006-0586-4", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1025961601", 
          "https://doi.org/10.1007/s10994-006-0586-4"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/s00530-002-0065-0", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1053346413", 
          "https://doi.org/10.1007/s00530-002-0065-0"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2012-07-26", 
    "datePublishedReg": "2012-07-26", 
    "description": "Content-aware music adaption, i.e. music resizing, in temporal constraints starts drawing attention from multimedia communities, because there are plenty of real-world scenarios, e.g. animation production and radio advertisement production. The goal of music resizing is to change the length of a music track to a user preferred length using a series of basic operations, e.g. compression, prolonging, cropping and insertion. The only existing music resizing approach so far, called LyDAR, is based on the lyrics analysis and just utilizes the compression operation to resize a music piece. As a result, LyDAR suffers from some limitations, e.g., it can neither prolong a music track nor compress music pieces with very small stretch rates. In this paper, we propose a content-aware music resizing framework, named MUSIZ. In general, MUSIZ outperforms LyDAR in three aspects: (a) Except for the compression operation, MUSIZ takes advantages of prolonging, cropping and insertion operations to handle the resizing requests of both compression and prolonging. (b) Observing the diversity of quality degradation for different segments, we propose the concept of stretch-resistance to measure the degree of quality degradation after a segment is stretched. The stretch-resistance is modeled based on both acoustical and lyrics features. (c) Cropping and insertion operations are utilized before stretching. We develop the contiguity-preservative cropping and insertion algorithms to remove and insert music segments while smoothing the abrupt change at the joint between the manipulated segments. Comprehensive user studies show that the music tracks resized by MUSIZ achieve better quality than those produced by existing approaches.", 
    "genre": "article", 
    "id": "sg:pub.10.1007/s00530-012-0289-6", 
    "inLanguage": "en", 
    "isAccessibleForFree": false, 
    "isFundedItemOf": [
      {
        "id": "sg:grant.6987449", 
        "type": "MonetaryGrant"
      }, 
      {
        "id": "sg:grant.4907311", 
        "type": "MonetaryGrant"
      }, 
      {
        "id": "sg:grant.8291830", 
        "type": "MonetaryGrant"
      }
    ], 
    "isPartOf": [
      {
        "id": "sg:journal.1358858", 
        "issn": [
          "0942-4962", 
          "1432-1882"
        ], 
        "name": "Multimedia Systems", 
        "publisher": "Springer Nature", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "4", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "19"
      }
    ], 
    "keywords": [
      "music tracks", 
      "compression operation", 
      "real-world scenarios", 
      "quality degradation", 
      "comprehensive user study", 
      "insertion operation", 
      "music pieces", 
      "multimedia community", 
      "user study", 
      "adaptive music", 
      "temporal constraints", 
      "animation production", 
      "lyrics features", 
      "advertisement production", 
      "music segments", 
      "basic operations", 
      "insertion algorithm", 
      "operation", 
      "resizing", 
      "users", 
      "good quality", 
      "algorithm", 
      "requests", 
      "track", 
      "compression", 
      "cropping", 
      "scenarios", 
      "constraints", 
      "framework", 
      "lyric analysis", 
      "small stretch rates", 
      "music", 
      "adaption", 
      "pieces", 
      "advantages", 
      "features", 
      "goal", 
      "concept", 
      "quality", 
      "different segments", 
      "limitations", 
      "prolonging", 
      "segments", 
      "aspects", 
      "attention", 
      "community", 
      "abrupt changes", 
      "results", 
      "analysis", 
      "diversity", 
      "degree", 
      "degradation", 
      "insertion", 
      "length", 
      "series", 
      "rate", 
      "joints", 
      "study", 
      "changes", 
      "production", 
      "stretching", 
      "approach", 
      "paper", 
      "stretch rate", 
      "Content-aware music adaption", 
      "music adaption", 
      "radio advertisement production", 
      "music resizing", 
      "LyDAR", 
      "compress music pieces", 
      "content-aware music", 
      "MUSIZ", 
      "advantages of prolonging", 
      "contiguity-preservative cropping"
    ], 
    "name": "Adaptive music resizing with stretching, cropping and insertion", 
    "pagination": "359-380", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1044163052"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/s00530-012-0289-6"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1007/s00530-012-0289-6", 
      "https://app.dimensions.ai/details/publication/pub.1044163052"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2022-01-01T18:28", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220101/entities/gbq_results/article/article_578.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "https://doi.org/10.1007/s00530-012-0289-6"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s00530-012-0289-6'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s00530-012-0289-6'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s00530-012-0289-6'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s00530-012-0289-6'


 

This table displays all metadata directly associated to this object as RDF triples.

188 TRIPLES      22 PREDICATES      104 URIs      91 LITERALS      6 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/s00530-012-0289-6 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author Ne92a99f504d84999a11950250faf8f74
4 schema:citation sg:pub.10.1007/0-387-28958-5
5 sg:pub.10.1007/s00530-002-0065-0
6 sg:pub.10.1007/s10994-006-0586-4
7 sg:pub.10.1007/s11042-005-0895-9
8 sg:pub.10.1038/35002078
9 schema:datePublished 2012-07-26
10 schema:datePublishedReg 2012-07-26
11 schema:description Content-aware music adaption, i.e. music resizing, in temporal constraints starts drawing attention from multimedia communities, because there are plenty of real-world scenarios, e.g. animation production and radio advertisement production. The goal of music resizing is to change the length of a music track to a user preferred length using a series of basic operations, e.g. compression, prolonging, cropping and insertion. The only existing music resizing approach so far, called LyDAR, is based on the lyrics analysis and just utilizes the compression operation to resize a music piece. As a result, LyDAR suffers from some limitations, e.g., it can neither prolong a music track nor compress music pieces with very small stretch rates. In this paper, we propose a content-aware music resizing framework, named MUSIZ. In general, MUSIZ outperforms LyDAR in three aspects: (a) Except for the compression operation, MUSIZ takes advantages of prolonging, cropping and insertion operations to handle the resizing requests of both compression and prolonging. (b) Observing the diversity of quality degradation for different segments, we propose the concept of stretch-resistance to measure the degree of quality degradation after a segment is stretched. The stretch-resistance is modeled based on both acoustical and lyrics features. (c) Cropping and insertion operations are utilized before stretching. We develop the contiguity-preservative cropping and insertion algorithms to remove and insert music segments while smoothing the abrupt change at the joint between the manipulated segments. Comprehensive user studies show that the music tracks resized by MUSIZ achieve better quality than those produced by existing approaches.
12 schema:genre article
13 schema:inLanguage en
14 schema:isAccessibleForFree false
15 schema:isPartOf Nb784a84c68a441b59901a20835141c5e
16 Ne1fdcba154284905b090a20b708ef887
17 sg:journal.1358858
18 schema:keywords Content-aware music adaption
19 LyDAR
20 MUSIZ
21 abrupt changes
22 adaption
23 adaptive music
24 advantages
25 advantages of prolonging
26 advertisement production
27 algorithm
28 analysis
29 animation production
30 approach
31 aspects
32 attention
33 basic operations
34 changes
35 community
36 comprehensive user study
37 compress music pieces
38 compression
39 compression operation
40 concept
41 constraints
42 content-aware music
43 contiguity-preservative cropping
44 cropping
45 degradation
46 degree
47 different segments
48 diversity
49 features
50 framework
51 goal
52 good quality
53 insertion
54 insertion algorithm
55 insertion operation
56 joints
57 length
58 limitations
59 lyric analysis
60 lyrics features
61 multimedia community
62 music
63 music adaption
64 music pieces
65 music resizing
66 music segments
67 music tracks
68 operation
69 paper
70 pieces
71 production
72 prolonging
73 quality
74 quality degradation
75 radio advertisement production
76 rate
77 real-world scenarios
78 requests
79 resizing
80 results
81 scenarios
82 segments
83 series
84 small stretch rates
85 stretch rate
86 stretching
87 study
88 temporal constraints
89 track
90 user study
91 users
92 schema:name Adaptive music resizing with stretching, cropping and insertion
93 schema:pagination 359-380
94 schema:productId N344b2a13f07c4cc0b41c39b2b56cb098
95 N60c5ec32d1e243b49a152d7ceb7d356f
96 schema:sameAs https://app.dimensions.ai/details/publication/pub.1044163052
97 https://doi.org/10.1007/s00530-012-0289-6
98 schema:sdDatePublished 2022-01-01T18:28
99 schema:sdLicense https://scigraph.springernature.com/explorer/license/
100 schema:sdPublisher Nf39edbe15f6e41b2a10c09aca6209556
101 schema:url https://doi.org/10.1007/s00530-012-0289-6
102 sgo:license sg:explorer/license/
103 sgo:sdDataset articles
104 rdf:type schema:ScholarlyArticle
105 N1553a8a30018486f8b078f438cc7690a rdf:first sg:person.013572656645.92
106 rdf:rest Nc362381319ad40c3a4b6e580e4cb2e4d
107 N344b2a13f07c4cc0b41c39b2b56cb098 schema:name doi
108 schema:value 10.1007/s00530-012-0289-6
109 rdf:type schema:PropertyValue
110 N60c5ec32d1e243b49a152d7ceb7d356f schema:name dimensions_id
111 schema:value pub.1044163052
112 rdf:type schema:PropertyValue
113 Nb784a84c68a441b59901a20835141c5e schema:issueNumber 4
114 rdf:type schema:PublicationIssue
115 Nc362381319ad40c3a4b6e580e4cb2e4d rdf:first sg:person.014437414013.41
116 rdf:rest rdf:nil
117 Ncb30bfdb606e493fb40523197a1e4ab5 rdf:first sg:person.011533665437.99
118 rdf:rest Nf58e12b578eb4a86808c8e0e6dfd1262
119 Ne1fdcba154284905b090a20b708ef887 schema:volumeNumber 19
120 rdf:type schema:PublicationVolume
121 Ne92a99f504d84999a11950250faf8f74 rdf:first sg:person.016566115360.45
122 rdf:rest Ncb30bfdb606e493fb40523197a1e4ab5
123 Nf39edbe15f6e41b2a10c09aca6209556 schema:name Springer Nature - SN SciGraph project
124 rdf:type schema:Organization
125 Nf58e12b578eb4a86808c8e0e6dfd1262 rdf:first sg:person.012303351315.43
126 rdf:rest N1553a8a30018486f8b078f438cc7690a
127 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
128 schema:name Information and Computing Sciences
129 rdf:type schema:DefinedTerm
130 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
131 schema:name Artificial Intelligence and Image Processing
132 rdf:type schema:DefinedTerm
133 sg:grant.4907311 http://pending.schema.org/fundedItem sg:pub.10.1007/s00530-012-0289-6
134 rdf:type schema:MonetaryGrant
135 sg:grant.6987449 http://pending.schema.org/fundedItem sg:pub.10.1007/s00530-012-0289-6
136 rdf:type schema:MonetaryGrant
137 sg:grant.8291830 http://pending.schema.org/fundedItem sg:pub.10.1007/s00530-012-0289-6
138 rdf:type schema:MonetaryGrant
139 sg:journal.1358858 schema:issn 0942-4962
140 1432-1882
141 schema:name Multimedia Systems
142 schema:publisher Springer Nature
143 rdf:type schema:Periodical
144 sg:person.011533665437.99 schema:affiliation grid-institutes:grid.12527.33
145 schema:familyName Wang
146 schema:givenName Chaokun
147 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011533665437.99
148 rdf:type schema:Person
149 sg:person.012303351315.43 schema:affiliation grid-institutes:grid.12527.33
150 schema:familyName Wang
151 schema:givenName Jianmin
152 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012303351315.43
153 rdf:type schema:Person
154 sg:person.013572656645.92 schema:affiliation grid-institutes:grid.12527.33
155 schema:familyName Wang
156 schema:givenName Hao
157 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013572656645.92
158 rdf:type schema:Person
159 sg:person.014437414013.41 schema:affiliation grid-institutes:grid.12527.33
160 schema:familyName Bai
161 schema:givenName Yiyuan
162 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.014437414013.41
163 rdf:type schema:Person
164 sg:person.016566115360.45 schema:affiliation grid-institutes:grid.12527.33
165 schema:familyName Liu
166 schema:givenName Zhang
167 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016566115360.45
168 rdf:type schema:Person
169 sg:pub.10.1007/0-387-28958-5 schema:sameAs https://app.dimensions.ai/details/publication/pub.1001637565
170 https://doi.org/10.1007/0-387-28958-5
171 rdf:type schema:CreativeWork
172 sg:pub.10.1007/s00530-002-0065-0 schema:sameAs https://app.dimensions.ai/details/publication/pub.1053346413
173 https://doi.org/10.1007/s00530-002-0065-0
174 rdf:type schema:CreativeWork
175 sg:pub.10.1007/s10994-006-0586-4 schema:sameAs https://app.dimensions.ai/details/publication/pub.1025961601
176 https://doi.org/10.1007/s10994-006-0586-4
177 rdf:type schema:CreativeWork
178 sg:pub.10.1007/s11042-005-0895-9 schema:sameAs https://app.dimensions.ai/details/publication/pub.1012319174
179 https://doi.org/10.1007/s11042-005-0895-9
180 rdf:type schema:CreativeWork
181 sg:pub.10.1038/35002078 schema:sameAs https://app.dimensions.ai/details/publication/pub.1038110076
182 https://doi.org/10.1038/35002078
183 rdf:type schema:CreativeWork
184 grid-institutes:grid.12527.33 schema:alternateName Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People’s Republic of China
185 School of Software, Tsinghua University, 100084, Beijing, People’s Republic of China
186 schema:name Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, People’s Republic of China
187 School of Software, Tsinghua University, 100084, Beijing, People’s Republic of China
188 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...