learning algorithm is also concerned with how to generalize the representa-
tion of that data, i.e., to find a regression or discrimination function that best
describes the data or category. There is a wide variety of algorithms and tech-
niques, and their description would easily fill up several volumes. There is no
ideal one: results usually depend on the problem that is given, the complex-
ity of implementation, and time of execution. Here, we recall some of the key
notions and concepts of machine learning.
5.1.1 Supervised, unsupervised, and reinforcement learning
When dealing with music signals and extracting perceptual information, there
is neces sarily a fair amount of ambiguity and imprecis ion (noise) in the esti-
mated data, not only due to the analysis technique, but also to the inherent
fuzziness of the perceptual information. Therefore, statistics are widely used
and will often play an important role in machine perception—a machine that
can recognize patterns grounded on our senses . If an external teacher provides
a category label or cost for each pattern (i.e., when there is specific feedback
available), the learning is said to be supervised: the learning element is given
the true output for particular inputs. It adapts its internal representation of a
correlation function to best match the information provided by the feedback.
More formally, we say that an example (or sample) is a pair (x, f(x)), where
x is the input and f(x) is the output of the function applied to x. Induc-
tion is the task that, given a collection of examples of f, returns a function h
(the hypothesis) that approximates f. Supervised learning can be incremental
(i.e., update its old hypothesis whenever a new example arrives) or based on a
representative training set of examples. One must use a large enough amount
of training samples, but one must keep some for validation of the hypothesis
function (typically around 30%).
In unsupervised learning or clustering, there is no explicit teacher, and the
system forms “natural” clusters (groupings) of the input patterns. Different
clustering algorithms may lead to different clusters, and the number of clusters
can be specified ahead of time if there is some prior knowledge of the classifi-
cation task. Finally, a third form of learning, reinforcement learning, specifies
only if the tentative classification or decision is right or wrong, which improves
(reinforces) the classifier.
For example, if our task were to classify musical instruments from listening
to their sound, in a supervised context we would first train a classifier by us-
ing a large database of sound recordings for which we know the origin. In an
unsup e rvised learning context, several clusters would be formed, hopefully rep-
resenting different instruments. With reinforcement learning, a new example
with a known target label is computed, and the result is used to improve the
classifier.
80 CHAPTER 5. LEARNING MUSIC SIGNALS