<< Chapter < Page Chapter >> Page >
This module introduces the concepts of mean, variance, and histograms. This work was done as part of the TaTGAP program, a VIGRE PFUG that pairs Rice students with local high school students. VIGRE is a program of Vertically Integrated Grants for Research and Education in the Mathematical Sciences under the direction of the National Science Foundation. A PFUG is a group of Postdocs, Faculty, Undergraduates and Graduate students formed around the study of a common problem.

By the end of this module, you should be able to analyze large data sets.

Key Concepts:

  1. EEG
  2. Mean
  3. Variance
  4. Standard Deviation
  5. Histogram

Motivation from epilepsy

An EEG (electroencephalogram) is a tool that tracks and records electrical activity (voltage) in the brain. Data from an EEG can be used to diagnose and monitor seizure disorders.

Exercise 1.1

Go to the webpage EEG in Common Epilepsy Syndromes . This page shows the EEG signals recorded from patients with various types of epilepsy. What do you notice about these signals, particularly media files 3, 5, 6, 9, and 10? Can you spot the period in the data where the seizures occur?

While in some cases, it may be easy to recognize a seizure from a plot of EEG data, the EEG itself produces a large set of numbers corresponding to the voltage at discrete points in time. If the sampling rate were 250 Hz, then the EEG would record the voltage every 4 ms. This means that to record the activity in the brain for 10 minutes, the EEG would give 150,000 voltage readings. It would be very difficult to look at such a large set of data and make conclusions about what types of seizures, if any, are present. This module will give you the basic tools needed to analyze large sets of data, such as the data taken from an EEG. Below is a summary of the concepts you will learn to compute in this module.

Summary of Key Concepts

Term Matlab command Definition
Mean mean Average number
Variance var Measure of the spread of the data
Standard Deviation std Square root of the variance
Histogram hist Proportion of numbers that falls within given intervals

Mean

The mean of a set of numbers is the average. If x is a vector of n numbers, then the mean of x is given by

x ¯ = 1 n k = 1 n x k = x 1 + x 2 + . . . + x n n .

In Matlab, the mean of the vector x can be computed by typing mean(x).

Example 2.1

Let x = [1, 7, 2, 5, 9, 6]. Then, x ¯ = ( 1 + 7 + 2 + 5 + 9 + 6 ) / 6 = 5 .

Example 2.2

Suppose you are interviewing for a job where the employees make the following daily salaries:

  • 16 general employees: $100 each
  • 3 managers: $900 each
  • 1 owner: $1700

Then, the mean daily salary is given by

x ¯ = 16 · 100 + 3 · 900 + 1700 16 + 3 + 1 = 6000 / 20 = 300 .

Notice that the mean is $300, which is 3 times the daily salary of 80% of the workers! Thus, if you are simply told the mean of the daily salaries, you will not have an accurate idea of how much money you would be making if you got the job. The variance and standard deviation are two measures that can give you an idea of how well the mean represents the data.

Exercise 2.1

Compute the mean of the vector y = [3, 8, 2, 5, 5, 7], both on paper and using Matlab.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, The art of the pfug. OpenStax CNX. Jun 05, 2013 Download for free at http://cnx.org/content/col10523/1.34
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'The art of the pfug' conversation and receive update notifications?

Ask