<< Chapter < Page Chapter >> Page >
The audio features used to characterize the sound signal and classify the sample by instrument.

How do we decide what parts of the spectrum are important? The CUIDADO project (2) provided a set of 72 audio features, and research1 has shown that some of the features aremore important in capturing the signal characteristics. We therefore decided to implement a small subset of thesefeatures:

Cepstral Features

  • Mel-Frequency Cepstrum Coefficients (MFCC), k = 2:13

Spectral Features

  • Slope
  • Roll-Off
  • Centroid
  • Spread
  • Skew
  • Kurtosis
  • Odd-to-Even Harmonic Energy Ratio (OER)
  • Tristimulus

Definitions

Cepstral coefficients have received a great deal of attention in the speech processing community, as they tryto extract the characteristics of the filter and model it independently of the signal being produced. This is ideal, as thefilter in our case is the instrument that we are trying to recognize. We work on a Mel scale because it more accurately modelshow the human auditory system perceives different frequencies, i.e. it gives more weight to changes at low frequencies as humans aremore adept at distinguishing low frequency changes.

The centroid correlates to the “brightness” of the sound and is often higher than expected due to the energyfrom harmonics above the fundamental frequency. The spread, skew, and kurtosis are based on the 2nd, 3rd, and 4th moments and, alongwith the slope, help portray spectral shape.

Odd-to-even harmonic energy ratio simply determines whether a sound consists primarily of odd harmonicenergy, of even harmonic energy, or whether the harmonic energy is equally spread.

The tristimulus measure energy as well and were introduced as the timbre equivalent to the color attributes ofvision. Like the OER, it provides clues regarding the distribution of harmonic energy, this time focusing on low, mid, and highharmonics rather than odd and even harmonics. This gives more weight to the first few harmonics, which are perceptually moreimportant.

How we chose features

MFCC have shown to work very well in monophonic environments, as they capture the shape of the spectrumvery effectively. Unfortunately, they are of less use in polyphonic recordings, as the MFCC captures the shape of a spectrum calculatedfrom multiple sources. Most of the work we have seen on this subject uses MFCC regardless, however. They are particularly usefulif only one instrument is playing or is relatively quite salient.

Most wind instruments have their harmonics evenly spread among the odd and even indices, but the clarinet isdistinct in that it produces spectra consisting predominantly of odd ratios, with very little even harmonics appearing at all. Thismakes sense from a physics standpoint, as when played, the clarinet becomes a closed cylinder at one end, therefore allowing only theodd harmonics to resonate. This feature was thus chosen primarily with clarinet classification in mind.

We chose the roll-off and tristimulus as our energy measures, as they were both easy to implement and judged tobe important (1) . Finally, the first four spectral moments and the spectral slope, in both perceptual and spectral models, were shownto be the top ten most important features in the same study and were therefore some of the first features added to our classificationsystem. We note that we had hoped to implement a perceptual model and thereby nearly double our features, but we could not find anaccurate filter model for the mid-ear and thus decided to forgo any features based on perceptual modeling.

For further discussion of these features, along with explicit mathematical formulas, please refer to (1) .

References

  • A.A. Livshin and X. Rodet. “Musical Instrument Identification in Continuous Recordings,” in Proc. of the 7th Int. Conference onDigital Audio Effects, Naples, Italy, October 5-8, 2004.
  • G. Peeters. “A large set of audio features for sound description (similarity and classification) in the CUIDADOproject,” 2003. URL: http://www.ircam.fr/anasyn/peeters/ARTICLES/Peeters_2003_cuidadoaudiofeatures.pdf.

Questions & Answers

what is mutation
Janga Reply
what is a cell
Sifune Reply
how is urine form
Sifune
what is antagonism?
mahase Reply
classification of plants, gymnosperm features.
Linsy Reply
what is the features of gymnosperm
Linsy
how many types of solid did we have
Samuel Reply
what is an ionic bond
Samuel
What is Atoms
Daprince Reply
what is fallopian tube
Merolyn
what is bladder
Merolyn
what's bulbourethral gland
Eduek Reply
urine is formed in the nephron of the renal medulla in the kidney. It starts from filtration, then selective reabsorption and finally secretion
onuoha Reply
State the evolution relation and relevance between endoplasmic reticulum and cytoskeleton as it relates to cell.
Jeremiah
what is heart
Konadu Reply
how is urine formed in human
Konadu
how is urine formed in human
Rahma
what is the diference between a cavity and a canal
Pelagie Reply
what is the causative agent of malaria
Diamond
malaria is caused by an insect called mosquito.
Naomi
Malaria is cause by female anopheles mosquito
Isaac
Malaria is caused by plasmodium Female anopheles mosquitoe is d carrier
Olalekan
a canal is more needed in a root but a cavity is a bad effect
Commander
what are pathogens
Don Reply
In biology, a pathogen (Greek: πάθος pathos "suffering", "passion" and -γενής -genēs "producer of") in the oldest and broadest sense, is anything that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ. The term pathogen came into use in the 1880s.[1][2
Zainab
A virus
Commander
Definition of respiration
Muhsin Reply
respiration is the process in which we breath in oxygen and breath out carbon dioxide
Achor
how are lungs work
Commander
where does digestion begins
Achiri Reply
in the mouth
EZEKIEL
what are the functions of follicle stimulating harmones?
Rashima Reply
stimulates the follicle to release the mature ovum into the oviduct
Davonte
what are the functions of Endocrine and pituitary gland
Chinaza
endocrine secrete hormone and regulate body process
Achor
while pituitary gland is an example of endocrine system and it's found in the Brain
Achor
what's biology?
Egbodo Reply
Biology is the study of living organisms, divided into many specialized field that cover their morphology, physiology,anatomy, behaviour,origin and distribution.
Lisah
biology is the study of life.
Alfreda
Biology is the study of how living organisms live and survive in a specific environment
Sifune
Got questions? Join the online conversation and get instant answers!
Jobilize.com Reply

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Elec 301 projects fall 2005. OpenStax CNX. Sep 25, 2007 Download for free at http://cnx.org/content/col10380/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Elec 301 projects fall 2005' conversation and receive update notifications?

Ask