0.9 Lab 7a - discrete-time random processes (part 1) (Page 5/5)

Page 5 / 5

Notice that we cannot form a density estimate by simply differentiating the empirical CDF, since this function contains discontinuities at thesample locations $X_{i}$ . Rather, we need to estimate the probability that a random variable willfall within a particular interval of the real axis. In this section, we will describe a common method known as the histogram .

The histogram

Our goal is to estimate an arbitrary probability density function, $f_{X} (x)$ , within a finite region of the $x$ -axis. We will do this by partitioning the region into $L$ equally spaced subintervals, or “bins”,and forming an approximation for $f_{X} (x)$ within each bin. Let our region of support start at the value $x_{0}$ , and end at $x_{L}$ . Our $L$ subintervals of this region will be $[x_{0}, x_{1}]$ , $(x_{1}, x_{2}]$ , ..., $(x_{L - 1}, x_{L}]$ . To simplify our notation we will define $b i n (k)$ to represent the interval $(x_{k - 1}, x_{k}]$ , $k = 1, 2, \dots, L$ , and define the quantity $Δ$ to be the length of each subinterval.

\begin{matrix} b i n (k) & = & (x_{k - 1}, x_{k}] k = 1, 2, \dots, L \\ Δ & = & \frac{x_{L} - x_{0}}{L} \end{matrix}

We will also define $\tilde{f} (k)$ to be the probability that $X$ falls into $b i n (k)$ .

\begin{matrix} \tilde{f} (k) & = & P (X \in b i n (k)) \\ = & \int_{x_{k - 1}}^{x_{k}} f_{X} (x) d x \end{matrix}

\begin{matrix} \approx & f_{X} (x) Δ for x \in b i n (k) \end{matrix}

The approximation in [link] only holds for an appropriately small bin width $Δ$ .

Next we introduce the concept of a histogram of a collection of i.i.d. random variables ${X_{1}, X_{2}, \dots, X_{N}}$ . Let us start by defining a function that will indicate whether ornot the random variable $X_{n}$ falls within $b i n (k)$ .

I_{n} (k) = \{\begin{matrix} 1, & if X_{n} \in b i n (k) \\ 0, & if X_{n} \notin b i n (k) \end{matrix})

The histogram of $X_{n}$ at $b i n (k)$ , denoted as $H (k)$ , is simply the number of random variables that fall within $b i n (k)$ . This can be written as

H (k) = \sum_{n = 1}^{N} I_{n} (k) .

We can show that the normalized histogram, $H (k) / N$ , is an unbiased estimate of the probability of $X$ falling in $b i n (k)$ . Let us compute the expected value of the normalized histogram.

\begin{matrix} E [\frac{H (k)}{N}] & = & \frac{1}{N} \sum_{n = 1}^{N} E [I_{n} (k)] \\ = & \frac{1}{N} \sum_{n = 1}^{N} {1 \cdot P (X_{n} \in b i n (k)) + 0 \cdot P (X_{n} \notin b i n (k))} \\ = & \tilde{f} (k) \end{matrix}

The last equality results from the definition of $\tilde{f} (k)$ , and from the assumption that the $X_{n}$ 's have the same distribution. A similar argument may be used to show that the variance of $H (k)$ is given by

\begin{matrix} V a r [\frac{H (k)}{N}] = \frac{1}{N} \tilde{f} (k) (1 - \tilde{f} (k)) . \end{matrix}

Therefore, as $N$ grows large, the bin probabilities $\tilde{f} (k)$ can be approximated by the normalized histogram $H (k) / N$ .

\tilde{f} (k) \approx \frac{H (k)}{N}

Using [link] , we may then approximate the density function $f_{X} (x)$ within $b i n (k)$ by

f_{X} (x) \approx \frac{H (k)}{N Δ} for x \in b i n (k) .

Notice this estimate is a staircase function of $x$ which is constant over each interval $b i n (k)$ . It can also easily be verified that this density estimate integrates to 1.

Exercise

Let $U$ be a uniformly distributed random variable on the interval [0,1]with the following cumulative probability distribution, $F_{U} (u)$ :

F_{U} (u) = \{\begin{matrix} 0, & if u < 0 \\ u, & if 0 \leq u \leq 1 \\ 1, & if u > 1 \end{matrix})

We can calculate the cumulative probability distribution for the new random variable $X = U^{\frac{1}{3}}$ .

\begin{matrix} F_{X} (x) & = & P (X \leq x) \\ = & P (U^{\frac{1}{3}} \leq x) \\ = & P (U \leq x^{3}) \\ = & {(F_{U}, (u)|}_{u = x^{3}} \\ = & \{\begin{matrix} 0, & if x < 0 \\ x^{3}, & if 0 \leq x \leq 1 \\ 1, & if x > 1 \end{matrix}) \end{matrix}

Plot $F_{X} (x)$ for $x \in [0, 1]$ . Also, analytically calculate the probability density $f_{X} (x)$ , and plot it for $x \in [0, 1]$ .

Using $L = 20$ , $x_{0} = 0$ and $x_{L} = 1$ , use Matlab to compute $\tilde{f} (k)$ , the probability of $X$ falling into $b i n (k)$ .

Hint Use the fact that

\tilde{f} (k) = F_{X} (x_{k}) - F_{X} (x_{k - 1})

Plot

\tilde{f} (k)

for

k = 1, \dots, L

using the stem function.

Inlab report

Submit your plots of $F_{X} (x)$ , $f_{X} (x)$ and $\tilde{f} (k)$ . Use stem to plot $\tilde{f} (k)$ , and put all three plots on a single figure using subplot .
Show (mathematically) how $f_{X} (x)$ and $\tilde{f} (k)$ are related.

Generate 1000 samples of a random variable $U$ that is uniformly distributed between 0 and 1 (using the rand command). Then form the random vector $X$ by computing $X = U^{\frac{1}{3}}$ .

Use the Matlab function hist to plot a normalized histogram for your samples of $X$ , using 20 bins uniformly spaced on the interval $[0, 1]$ .

Hint Use the Matlab command H=hist(X,(0.5:19.5)/20) to obtain the histogram, and then normalize H .

Use the stem command to plot the normalized histogram

H (k) / N

and

\tilde{f} (k)

together on the same figure using subplot .

Inlab report

Submit your two stem plots of $H (k) / N$ and $\tilde{f} (k)$ . How do these plots compare?
Discuss the tradeoffs (advantages and the disadvantages) between selecting a very large or very small bin-width.

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Purdue digital signal processing labs (ece 438). OpenStax CNX. Sep 14, 2009 Download for free at http://cnx.org/content/col10593/1.4

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Purdue digital signal processing labs (ece 438)' conversation and receive update notifications?

Ask

©flickr: Abraham	Multicellular Organisms Test By Monty Hartfield Start Test
	NCE Ch 10 Professional Orientation By Anh Dao Start Quiz
	Nutrition and Chronic Disease- Test 2 By Madison Christian Start Flashcards
	36 Biology 36 Sensory Systems MCQ By OpenStax Start Quiz
	37 Biology 37 The Endocrine System MCQ By OpenStax Start Quiz
	14 AP 14 Brain Cranial Nerves MCQ By OpenStax Start Quiz
	MAPEH Test G9 (Physical Education) By Angelica Lito Start Quiz
	7 Microbiology Unit 3 By Madison Christian Start Test
	12 AP 12 Nervous System Essay By OpenStax Start Flashcards
	1 Physiotherapy Flashcards Set 1 By Rhodes Start Flashcards