~ICMC 2015 - Sept. 25 - Oct. 1, 2015 - CEMI, University of North Texas Fuzzy Equalization of Musical Genres Marie Gonzalez Department of Electrical Engineering School of Engineering Pontificia Universidad Cat6lica de Chile [email protected] Patricio de la Cuadra Center for Research in Audio Technologies Music Institute Pontificia Universidad Cat6lica de Chile [email protected] Rodrigo F. Cadiz Center for Research in Audio Technologies Music Institute Pontificia Universidad Cat6lica de Chile [email protected] ABSTRACT In this article, we propose an audio equalizer of musical genres based on fuzzy logic. Widely used audio playback software, such as VLC or iTunes, propose genre-specific equalization curves to be applied for the whole duration of the music. These curves are the same for all songs belonging to a specific genre, and they do not take into account the specifics and diversity of each song. We propose a different strategy. Research in music information retrieval has revealed a significant number of audio descriptors that allow for the recognition and description of diverse musical genres. We use some of these descriptors to feed a fuzzy logic inference system, whose outputs are the required equalization levels for each frequency band. The rules of the system are derived from the analysis of a well known music database encompassing ten different musical genres. Our results indicate that our approach works for songs that exhibit multiple genre characteristics, that are difficult to classify into one category, or that mix genres. 1. INTRODUCTION Audio equalization using digital signal processors has been a subject of research for several decades. An equalizer aims for the correction of the magnitude and the phase response of an audio chain [1]. The equalization of audio is required in a great number of distinct situations such as radio broadcasting, recording studios, or music listening while driving or at home. Equalization is useful to correct for uneven frequency responses of sound reproduction systems, which essentially means static equalization curves for all genres. Consequently, it appears that equalization based on genre is unnecessary and not even desirable. This argument strengthens considering all the work and effort that mastering engineers have put into balancing the audio spectrum in the studio. However, we would Copyright: 02015 Marie Gonzilez et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License 3.0 Unported, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. like to emphasize that genre-based equalization is a common reality. Car radios and home audio playback systems contain this kind of equalization. Very popular audio software such as VLC or iTunes base their equalization curves on genre information, as it is shown in figure 1. We simply want to propose a different strategy for genre equalization and we don't have the intention at this moment of arguing against or in favor of it. These curves, although somewhat similar in different software, appear to be based mostly on experience and customized by each vendor, and we have not found scientific or widely accepted methodologies for their design. Similar curves are also present in the majority of audio reproduction equipments and car radios. Each one of these curves aims for a specific musical genre such as rock, pop, jazz or electronic. The user is supposed to select one of them when listening to a particular piece of music. However, what curves should be used for music that mix several styles, songs that contain a significant amount of musical diversity, or when the genre one is listening to does not appear in the lists of available equalizations? In order to tackle this problem, we propose a fuzzy logic-based equalization scheme that allows for a more meaningful equalization of music, including mixed musical genres or genres outside the realm of traditional music. This article is structured as follows. First, fuzzy logic is presented as discussed as a valid technique for non-linear mappings of data. Next, we present our fuzzy logic-based audio equalizer including previous analysis of the audio data, determination of fuzzy inference rules, output variables and computational implementation. In the following section, we describe one simple experiment we conducted in order to test our equalization scheme and later the main results are presented. Finally, conclusions and future work are addressed. 2. FUZZY LOGIC Fuzzy logic [2] [3] [4] [5] [6] is a concept derived from the mathematical branch of fuzzy sets [7] that applies multi-valued logic to sets or groups of objects. In a narrow sense, fuzzy logic refers to a logical system than generalizes traditional two-valued logic for reasoning under uncertainty, allowing multiple values of truth. In a broad sense, it refers to all the -134 -
Top of page Top of page