|
|
Audio to Score Alignment Based on Chroma Features and Dynamic Time Warping Algorithm |
Zhang Biqiao, Han Shenglong |
Department of Information Management, Peking University, Beijing 100871, China |
|
|
Abstract This paper accomplishes the alignment between two kinds of score format—MIDI and MusicXML, and two kinds of audio format—WAV and MP3 based on chroma features and dynamic time warping algorithm. In addition, in the process of experiment and evaluation, this paper uses historical recordings as audio materials and labels the ground truth of measures manually in order to know the true effect of the algorithm in real world. The results of the experiment show that the algorithm has achieved the desired effect basically.
|
Received: 17 October 2011
Published: 26 February 2012
|
|
[1] Orio N. Music Retrieval: A Tutorial and Review [J]. Foundations and Trends in Information Retrieval,2006, 1(1): 1-90.[2] Bloch J, Dannenberg R B. Real-time Accompaniment of Polyphonic Keyboard Performance [C]. In:Proceedings of the International Computer Music Conference,San Francisco, USA.1985: 279-290.[3] Logan B. Mel Frequency Cepstral Coefficients for Music Modeling [C]. In: Proceedings of the International Society for Music Information Retrieval, Plymouth, Massachusetts, USA. 2000.[4] Tzanetakis G,Ermolinskyi A,Cook P. Pitch Histograms in Audio and Symbolic Music Information Retrieval [C]. In: Proceedings of the International Society for Music Information Retrieval, Paris, France.2002: 31-38.[5] Hu N, Dannenberg R B, Tzanetakis G. Polyphonic Audio Matching and Alignment for Music Retrieval[C]. In:Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics,New Paltz, NY, USA. 2003.[6] Ewert S, Müller M, Grosche P. High Resolution Audio Synchronization Using Chroma Onset Features [C].In:Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing,Taipei, China.2009.[7] Devaney J, Mandel M I, Ellis D P W. Improving MIDI-Audio Alignment with Acoustic Features[C]. In:Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics,New Paltz, NY, USA.2009.[8] Ewert S, Müller M. Refinement Strategies for Music Synchronization[C]. In:Proceedings of the 5th International Synposium on Computer Music Modeling and Retrieval, Coperhagen, Denmark.2008.[9] Raphael C. Automatic Segmentation of Acoustic Musical Signals Using Hidden Markov Models [J]. IEEE Transactions on PAMI, 1999, 21(4): 360-370.[10] Cano P, Loscos A, Bonada J. Score-Performance Matching Using HMMs[C]. In:Proceedings of the International Computer Music Conference, San Francisco, USA. 1999: 441-444.[11] Orio N, Dechelle F. Score Following Using Spectral Analysis and Hidden Markov Models [C]. In:Proceedings of the International Computer Music Conference,San Francisco, USA.2001: 151-154.[12] Dixon S. Live Tracking of Musical Performances Using On-line Time Warping[C]. In:Proceedings of the 8th International Conference on Digital Audio Effects, Madrial, Spain.2005: 92-97.[13] Cont A. A Coupled Duration-focused Architecture for Realtime Music to Score Alignment[J]. IEEE Transaction on PAMI, 2010,32(6): 974-987.[14] Cont A. Realtime Audio to Score Alignment for Polyphonic Music Instruments Using Sparse Non-negative Constraints and Hierarchical HMMs[C]. In:Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, France. 2006.[15] Niedermayer B. Improving Accuracy of Polyphonic Music-to-Score alignment[C]. In: Proceedings of the International Society for Music Information Retrieval,Kobe, Japan.2009: 585-590.[16] Niedermayer B, Widmer G. A Multi-pass Algorithm for Accurate Audio-to-Score Alignment[C]. In:Proceedings of the International Society for Music Information Retrieval, Utrecht, Netherlands. 2010: 417-422.[17] Eerola T, Toiviainen P. MIDI Toolbox: MATLAB Tools for Music Research[EB/OL].[2011-08-17]. http://www.jyu.fi/hum/laitokset/musiikki/en/research/coe/materials/miditoolbox/.[18] Nichols E, Morris D, Basu S. Relationships Between Lyrics and Melody in Popular Music[C]. In:Proceedings of the International Society for Music Information Retrieval, Kobe, Japan.2009: 471-476.[19] Müller M, Ewert S. Chroma Toolbox: MATLAB Implementations for Extracting Variants of Chroma-based Audio Features[C].In:Proceedings of the International Conference on Music Information Retrieval, Miami,Florida. 2011: 215-220.[20] Sankoff D, Kruskal J B. Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison[M]. Addison Wesley, 1983. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|