Please wait a minute...
New Technology of Library and Information Service  2012, Vol. 28 Issue (1): 40-45    DOI: 10.11925/infotech.1003-3513.2012.01.07
Current Issue | Archive | Adv Search |
Audio to Score Alignment Based on Chroma Features and Dynamic Time Warping Algorithm
Zhang Biqiao, Han Shenglong
Department of Information Management, Peking University, Beijing 100871, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  This paper accomplishes the alignment between two kinds of score format—MIDI and MusicXML, and two kinds of audio format—WAV and MP3 based on chroma features and dynamic time warping algorithm. In addition, in the process of experiment and evaluation, this paper uses historical recordings as audio materials and labels the ground truth of measures manually in order to know the true effect of the algorithm in real world. The results of the experiment show that the algorithm has achieved the desired effect basically.
Key wordsAudio to score alignment      Chroma features      DTW algorithm     
Received: 17 October 2011      Published: 26 February 2012
: 

TP391.4

 

Cite this article:

Zhang Biqiao, Han Shenglong. Audio to Score Alignment Based on Chroma Features and Dynamic Time Warping Algorithm. New Technology of Library and Information Service, 2012, 28(1): 40-45.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2012.01.07     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2012/V28/I1/40

[1] Orio N. Music Retrieval: A Tutorial and Review [J]. Foundations and Trends in Information Retrieval,2006, 1(1): 1-90.

[2] Bloch J, Dannenberg R B. Real-time Accompaniment of Polyphonic Keyboard Performance [C]. In:Proceedings of the International Computer Music Conference,San Francisco, USA.1985: 279-290.

[3] Logan B. Mel Frequency Cepstral Coefficients for Music Modeling [C]. In: Proceedings of the International Society for Music Information Retrieval, Plymouth, Massachusetts, USA. 2000.

[4] Tzanetakis G,Ermolinskyi A,Cook P. Pitch Histograms in Audio and Symbolic Music Information Retrieval [C]. In: Proceedings of the International Society for Music Information Retrieval, Paris, France.2002: 31-38.

[5] Hu N, Dannenberg R B, Tzanetakis G. Polyphonic Audio Matching and Alignment for Music Retrieval[C]. In:Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics,New Paltz, NY, USA. 2003.

[6] Ewert S, Müller M, Grosche P. High Resolution Audio Synchronization Using Chroma Onset Features [C].In:Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing,Taipei, China.2009.

[7] Devaney J, Mandel M I, Ellis D P W. Improving MIDI-Audio Alignment with Acoustic Features[C]. In:Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics,New Paltz, NY, USA.2009.

[8] Ewert S, Müller M. Refinement Strategies for Music Synchronization[C]. In:Proceedings of the 5th International Synposium on Computer Music Modeling and Retrieval, Coperhagen, Denmark.2008.

[9] Raphael C. Automatic Segmentation of Acoustic Musical Signals Using Hidden Markov Models [J]. IEEE Transactions on PAMI, 1999, 21(4): 360-370.

[10] Cano P, Loscos A, Bonada J. Score-Performance Matching Using HMMs[C]. In:Proceedings of the International Computer Music Conference, San Francisco, USA. 1999: 441-444.

[11] Orio N, Dechelle F. Score Following Using Spectral Analysis and Hidden Markov Models [C]. In:Proceedings of the International Computer Music Conference,San Francisco, USA.2001: 151-154.

[12] Dixon S. Live Tracking of Musical Performances Using On-line Time Warping[C]. In:Proceedings of the 8th International Conference on Digital Audio Effects, Madrial, Spain.2005: 92-97.

[13] Cont A. A Coupled Duration-focused Architecture for Realtime Music to Score Alignment[J]. IEEE Transaction on PAMI, 2010,32(6): 974-987.

[14] Cont A. Realtime Audio to Score Alignment for Polyphonic Music Instruments Using Sparse Non-negative Constraints and Hierarchical HMMs[C]. In:Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, France. 2006.

[15] Niedermayer B. Improving Accuracy of Polyphonic Music-to-Score alignment[C]. In: Proceedings of the International Society for Music Information Retrieval,Kobe, Japan.2009: 585-590.

[16] Niedermayer B, Widmer G. A Multi-pass Algorithm for Accurate Audio-to-Score Alignment[C]. In:Proceedings of the International Society for Music Information Retrieval, Utrecht, Netherlands. 2010: 417-422.

[17] Eerola T, Toiviainen P. MIDI Toolbox: MATLAB Tools for Music Research[EB/OL].[2011-08-17]. http://www.jyu.fi/hum/laitokset/musiikki/en/research/coe/materials/miditoolbox/.

[18] Nichols E, Morris D, Basu S. Relationships Between Lyrics and Melody in Popular Music[C]. In:Proceedings of the International Society for Music Information Retrieval, Kobe, Japan.2009: 471-476.

[19] Müller M, Ewert S. Chroma Toolbox: MATLAB Implementations for Extracting Variants of Chroma-based Audio Features[C].In:Proceedings of the International Conference on Music Information Retrieval, Miami,Florida. 2011: 215-220.

[20] Sankoff D, Kruskal J B. Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison[M]. Addison Wesley, 1983.
[1] Zhu Xuefang,Zeng Wei,Zeng Na. Multi-book Detection for Self-service System in Libraries[J]. 现代图书情报技术, 2008, 24(9): 97-101.
[2] Han Shenglong . A New Method of Music Melody Extraction and Its Application[J]. 现代图书情报技术, 2007, 2(3): 77-79.
[3] Zhu Xuefang. Resampling and Recovery of Digital Color Images[J]. 现代图书情报技术, 2003, 19(3): 67-72.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn