<< Chapter < Page Chapter >> Page >
ELEC 301 Project Flow Chart

Music classification by genre system diagram

Music Matcher, a collection of scripts and functions, takes a .wav file input, digitally processes it, and creates an output vector characteristic of the sample. A neural network is trained with 20 songs in each genre. Then it analyzes the new song vectors for patterns and predicts an output classification genre.

Music Matcher takes a .wav file, analyzes it, and outputs a music genre. Our system breaks up a .wav file into twenty .5 second windows. From here, the DSP functions are called for each of the twenty windows. Each one of these twenty windows is analyzed by seven DSP functions:

  • Bandwidth
  • Power Spectral Density
  • Total Power (L-2 norm / L-infinity norm)
  • Spectrogram Smoothness
  • High Pass Filter
  • Beat Detection
  • Frequency Cutoff
The values returned from each of these functions is averaged over all twenty windows to give an average value for each song as well as a standard deviation, which tells us how these qualities change over time. That way, our DSP vector has some measure of how each of the functions changed with time.

First, the neural network is trained with 120 songs, 20 of each genre. After we train the neural network, we give it songs it has never seen, and the output of the system is the classification of genre that the neural network determines.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Music classification by genre. OpenStax CNX. Jan 22, 2004 Download for free at http://cnx.org/content/col10216/1.5
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Music classification by genre' conversation and receive update notifications?

Ask