UPM Institutional Repository

A query-based framework to handle data issues in concatenative sound synthesis


Citation

Mohd. Norowi, Noris and Miranda, Eduardo Reck (2014) A query-based framework to handle data issues in concatenative sound synthesis. In: Malaysian National Conference of Databases 2014 (MaNCoD 2014), 17 Sept. 2014, Universiti Putra Malaysia, Serdang, Selangor. (pp. 81-86). (Unpublished)

Abstract

Concatenative Sound Synthesis (CSS) is a data-driven method to synthesise new sounds. It involves taking in a sound, decomposing it into smaller sound segments, analyzing its spectral and other auditory content, before searching into a database of other sound segments for a matching pair. The selected segments are then concatenated together in sequence, and are then resynthesised to produce new sounds that are based on the original. However, with the increase in processing power, hard disk capacity and network bandwidth, the amount of audio information that is possible be extracted from each of the sound segments can become too much, rendering it useless in aiding the matching process. This study looks at the current approaches adopted in matching sound segment in CSS and discusses the challenges which arise from it. This includes the tradeoffs of extracting huge, multi-dimensional audio features and the need to understand human sound perception in order to minimize the synthesis of mismatching segments. To improve similarity result, a query-based CSS framework is proposed. A proof-of-concept, ConQuer, was also developed based on this framework, which offers users parametric control in order to communicate their intended creations to the system to synthesise.


Download File

[img] PDF
38837.pdf
Restricted to Repository staff only

Download (712kB)
Official URL or Download Paper: http://mancod2014.blogspot.my/p/proceedings.html

Additional Metadata

Item Type: Conference or Workshop Item (Paper)
Divisions: Faculty of Computer Science and Information Technology
Keywords: Concatenative sound synthesis; Query-based framework; Multi-dimensional audio features; Sound similarity; ConQuer
Depositing User: Nursyafinaz Mohd Noh
Date Deposited: 18 Jun 2015 06:33
Last Modified: 29 Jul 2016 08:12
URI: http://psasir.upm.edu.my/id/eprint/38837
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item