0
0.2
0.4
0.6
0.8
1
0
1
2
3
4
5
0
0.2
0.4
0.6
0.8
1
Time Level
Motif Level
Semantic Level
Time Level
Fig. 5. Distance diagram.
similarities. The medley of different motifs from “Auld Lang Syne”, and “Oh, when the
saints” is situated in between these key points. Running a number of experiments using
different songs from diverse genres proved the general applicability of the techniques
we have introduced. Thus, our semantic-based approach to comparing and clustering
different pieces of music which can be variations of the same composition succeeds.
Future work should focus on adopting the methods presented in this paper to sub-
symbolic audio data using hypotheses-based recognition ([8]). Being able to detect dif-
ferent instances of the same song in whatever representation shall finally provide us with
the required techniques to develop a music retrieval system that implements a cluster
index-based access to its document repository. Hence, these techniques will assure that
diverse queries considering the semantics of music will be processable efficiently.
References
1. Cano, P., Batlle, E., Kalker T., Haitsma J.: A Review of Algorithms for Audio Fingerprinting.
In: International Workshop on Multimedia Signal Processing, US Virgin Islands (2002)
2. Dowling, W. J.: Scale and contour: Two components of a theory of memory for melodies. In:
Psychological Review, p. 341 - 354 (1978)
3. Haitsma, J., Kalker, T.: A Highly Robust Audio Fingerprinting System. ISMIR, Paris, France
(2002)
4. Neve, G., Orio, N.: Indexing and Retrieval of Music Documents trought Pattern Analysis and
Data Fusion Techniques. ISMIR, Barcelona, Spain (2004)
5. Nishimura, T. et al.: Music Signal Spotting Retrieval by a Humming Query Using Start
Frame Feature Dependent Continuous Dynamic Programming. In: Music Information Re-
trieval, p. 211 - 218, Bloomington, Indiana (2001)
6. Rentzsch, M.: Entwicklung und Implementierung von Vergleichsoperationen f
¨
ur symboli-
sche Tondokumente, Diploma Thesis. Chemnitz University of Technology, Germany (2005)
7. Seifert, F.: Musikalische Datenbanken, Dissertation. Chemnitz University of Technology,
Germany (2004)
8. Seifert, F.: Prediction-Driven Correlation of Audio with Generic Music Templates.
EuroIMSA, Grindelwald, Switzerland (2005)
9. Shapiro, L. G., Haralick, R. M.: A metric for comparing relational descriptions. In: IEEE
Trans PAMI, p. 90 - 94 (1985)
10. Song, J., Bae, S. Y., Yoon, K.: Mid-Level Music Melody Representation of Polyphonic Audio
for Query-by-Humming System. ISMIR, Paris, France (2002)
11. Youngmoo, K., Wei C., Ricardo, G., Barry, V.: Analysis of a contour-based representation for
melody. Int. Symposium on Music Information Retrieval, Plymouth, Massachusetts (2000)
180