SPL MRPP PhaVoRIT Traktor Pitch'n Time MPEX-2
mean score 5,5573 5,7704 6,4262 6,2622 6,4590 2,3278
std. deviation 3,0029 2,6101 1,8926 2,5684 2,5597 2,1191
per-song mean values comments
Lovesong 1,7 2 7,7 8,2 7 5,1 rapid transients
Smooth Sailing 4,3 4,8 4,8 9 8,1 1,6 sharp transients
Radetzky March 8,4545 8,3636 6,1818 4,7272 4,1818 1,7272 dense arrangement
Narcotic 6 6,1 6,7 6 7,2 1,3 duet vocals
Poison 4,2 5,2 6,6 6,2 7,4 3 noisy characteristic
Kl. Nachtmusik 8,4 7,9 6,6 3,6 5,1 1,3 no transients
Table 2: List of the mean grades for each algorithm. Grades are from 1 (worst) to 10 (best).
Maestro! interactive conducting exhibit. The audio quality
of PhaVoRIT was assessed in a formal user listening test and
has been found to be comparable to some of the best commercially available time-stretching systems.
References
Auger, F. and P. Flandrin (1995, May). Improving the readablility of time-frequency and time-scale representations by the
reassignment method. In IEEE Transactions on Signal Processing, Volume 43, pp. 1068-89.
Bernsee, S. M. (2005, June). The DSP dimension.
Bonada, J. (2000). Automatic technique in frequency domain for
near-lossless time-scale modification of audio. In Proceedings of International Computer Music Conference.
Borchers, J., E. Lee, W. Samminger, and M. Mtihlhduser (2004,
March). Personal orchestra: A real-time audio/video system
for interactive conducting. ACM Multimedia Systems Journal Special Issue on Multimedia Software Engineering 9(5),
458-465.
Dolson, M. (1986). The phase vocoder: A tutorial. Computer Music Journal 1(4), 14-27.
Duxbury, C., M. Davies, and M. Sandler (2001, December). Separation of transient information in musical audio using multiresolution analysis techniques. In Proceedings of the COST
G-6 Conference on Digital Audio Effects (DAFX-01), Limerick, Ireland.
Flanagan, J. L. and R. M. Golden (1966, November). Phase
vocoder. The Bell System Technical Journal 45, 1493-1509.
Garas, J. and P. C. Sommen (1998). Time/pitch scaling using
the constant-Q phase vocoder. In Proceedings of STW's 1998
workshops CSSP98 and SAFE98, pp. 173-176.
Hammer, F. (2001). Time-scale modification using the phase
vocoder. Master's thesis, Institute for Electronic Music and
Acoustics (JEM), Graz University of Music and Dramatic
Arts.
Karrer, T. (2005). Phavorit - a phase vocoder for real-time interactive time-stretching. Master's thesis, RWTH Aachen University.
Laroche, J. and M. Dolson (1997). Phase vocoder: About this
phasiness business. In Proceedings of IEEE ASSP Workshop
on application of signal processing to audio and acoustics,
New Paltz, NY.
Laroche, J. and M. Dolson (1999, May). Improved phase vocoder
time-scale modification of audio. In IEEE Transactions on
Speech and Audio Processing, Volume 7, pp. 323-332.
Lee, E., T. Karrer, and J. Borchers (2006). Towards a framework
for interactive systems to conduct digital audio and video
streams. Computer Music Journal 30(1). To appear.
Lee, E., H. Kiel, S. Dedenbach, I. Gruell, T. Karrer, M. Wolf,
and J. Borchers (2006, April). iSymphony: An adaptive interactive orchestral conducting system for conducting digital audio and video streams. In Extended Abstracts of CHI
2006 Conference on Human Factors in Computing Systems,
Montr6al, Canada. ACM Press.
Lee, E., T. M. Nakra, and J. Borchers (2004, June). You're the
conductor: A realistic interactive conducting system for children. In NIME 2004 International Conference on New Interfaces for Musical Expression, Hamamatsu, Japan, pp. 68-73.
Levine, S. N. and J. 0. Smith III(1998). A sines+transients+noise
audio representation for data compression and time/pitch
scale modications. In 105th Audio Engineering Society Convention, San Francisco.
Masri, P. (1996). Computer Modelling of Sound for Transformation and Synthesis of Musical Signals. Ph. D. thesis, University of Bristol.
Masri, P. and A. Bateman (1996). Improved modelling of attack
transients in music analysis-resynthesis. In Proceedings of
the International Computer Music Conference.
McAulay, R. J. and T. F. Quatieri (1986, August). Speech analysis/synthesis based on a sinusoidal representation. In IEEE
Transactions on Acoustics, Speech, and Signal Processing,
Volume 34, pp. 744-754.
Puckette, M. (1995). Phase-locked vocoder. In IEEE ASSP Conference on Applications of Signal Processing to Audio and
Acoustics, Mohonk, New York.
Ribel, A. (2003). Transient detection and preservation in the
phase vocoder. In Proceedings of the Int. Computer Music
Conference (ICMC'03), Singapore, pp. 247-250.
715