Transform coding: background and motivation

An introduction to source-coding Page 1 / 1

Transform coding is described and an analysis is performed for the simple 2-dimensional case, including a comparison to PCM.

In transform coding (TC), blocks of N input samples are transformed to N transform coefficients which are then quantized and transmitted. At the decoder, an inverse transformis applied to the quantized coefficients, yielding a reconstruction of the original waveform.By designing individual quantizers in accordance with the statistics of their inputs, it is possible to allocatebits in a more optimal manner, e.g., encoding the “more important” coefficients at a higher bit rate.

$N \times N$ Transform Coder/Decoder with Scalar Quantization
Orthogonal Transforms: From our perspective, an $N \times N$ “transform” will be any real-valued linear operation taking N input samples to N output samples, or transform coefficients.This operation can always be written in matrix form
$y (m) = T x (m), T \in R^{N \times N}$
where $x (m)$ and $y (m)$ are vectors representing $N \times 1$ blocks of input/output elements:
$\begin{matrix} x (m) & = & {(x (m N), x (m N - 1), \dots, x (m N - N + 1))}^{t} \\ y (m) & = & {(y (m N), y (m N - 1), \dots, y (m N - N + 1))}^{t} . \end{matrix}$
Intuition comes from considering the transform's basis vectors ${t_{k}}$ defined by the rows of the matrix
$T = (\begin{matrix} --- t_{0}^{t} --- \\ --- t_{1}^{t} --- \\ ⋮ \\ -- t_{N - 1}^{t} -- \end{matrix})$
since the coefficient $y_{k} = t_{k}^{t} x$ can be thought of as the result of a“comparison” between the $k^{t h}$ basis vector and the input x . These comparisons are defined by the inner product $< t_{k}, x > = t_{k}^{t} x$ which has a geometrical interpretation involving the angle θ _k between vectors $t_{k}$ and x .
$< t_{k}, x > = cos (θ_{k}) ∥ t_{k} ∥_{2} {∥ x ∥}_{2} .$
When the vectors ${t_{k}}$ are mutually orthogonal, i.e., $t_{k}^{t} t_{ℓ} = 0$ for $k \neq ℓ$ , the transform coefficients represent separate, unrelated features of the input.This property is convenient if the transform coefficients are independently quantized, as is typical in TC schemes.

$2 \times 2$ Transform coder

Say that stationary zero-mean Gaussian source $x (m)$ has autocorrelation $r_{x} (0) = 1$ , $r_{x} (1) = ρ$ , and $r_{x} (k) = 0$ for $k > 1$ . For a bit rate of R bits per sample, uniformly-quantized PCM implies a mean-squared reconstruction error of

$σ_{r}^{2} |_{PCM} = \frac{Δ^{2}}{12} |_{\begin{matrix} Δ = 2 x_{max} / L \\ L = 2^{R} \end{matrix}} = \underset{γ_{x}}{\underset{︸}{\frac{1}{3} \frac{x_{max}^{2}}{σ_{x}^{2}}}} σ_{x}^{2} 2^{- 2 R} = γ_{x} σ_{x}^{2} 2^{- 2 R} .$

For transform coding, say we choose linear transform

$T = (\begin{matrix} t_{0}^{t} \\ t_{1}^{t} \end{matrix}) = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix})$

Setting $x (m) = {(\begin{matrix} x (2 m) & x (2 m - 1) \end{matrix})}^{t}$ and $y (m) = Tx (m)$ , we find that the transformed coefficients have variance

$σ_{y_{0}}^{2} = E {| t_{0}^{t} x (m) |^{2}} = \frac{1}{2} E \{{| x (2 m) + x (2 m - 1) |}^{2}\} = \frac{1}{2} (2 r_{x} (0) + 2 r_{x} (1)) = 1 + ρ$

$σ_{y_{1}}^{2} = E {| t_{1}^{t} x (m) |^{2}} = \frac{1}{2} E \{{| x (2 m) - x (2 m - 1) |}^{2}\} = \frac{1}{2} (2 r_{x} (0) - 2 r_{x} (1)) = 1 - ρ$

and using uniformly-quantized PCM on each coefficient we get mean-squared reconstruction errors

$σ_{q_{0}}^{2} = (1 + ρ) γ_{x} 2^{- 2 R_{0}}$

$σ_{q_{1}}^{2} = (1 - ρ) γ_{x} 2^{- 2 R_{1}} .$

We use the same quantizer performance factor γ _x as before since linear operations preserve Gaussianity.
For orthogonal matrices T , i.e., $T^{- 1} = T^{t}$ , we can show that the mean-squared reconstruction error σ _r ² equals the mean-squared quantization error:

$\begin{matrix} σ_{r}^{2} & : = & \frac{1}{N} \sum_{k = 0}^{N - 1} E \{{(\tilde{x} (N m - k) - x (N m - k))}^{2}\} (here N = 2) \\ = & \frac{1}{N} E \{∥, \tilde{x}, {(m) - x (m) ∥}^{2}\} \\ = & \frac{1}{N} E \{∥, T^{- 1}, \tilde{y}, {(m) - x (m) ∥}^{2}\} \\ = & \frac{1}{N} E \{∥, T^{- 1}, (y (m) + q (m)), {- x (m) ∥}^{2}\} \\ = & \frac{1}{N} E \{∥ T^{- 1} Tx (m) + T^{- 1} {q (m) - x (m) ∥}^{2}\} \\ = & \frac{1}{N} E \{∥, T^{- 1}, {q (m) ∥}^{2}\} \\ = & \frac{1}{N} E \{q^{t}, (m), \underset{I}{\underset{︸}{{(T^{- 1})}^{t} T^{- 1}}}, q, (m)\} \\ = & \frac{1}{N} E \{{∥ q (m) ∥}^{2}\} \\ = & \frac{1}{N} \sum_{k = 0}^{N - 1} σ_{q_{k}}^{2} . \end{matrix}$

Since our $2 \times 2$ matrix is indeed orthogonal, we have mean-squared reconstruction error

$σ_{r}^{2} |_{TC} = \frac{1}{2} ((1 + ρ) γ_{x} 2^{- 2 R_{0}} + (1 - ρ) γ_{x} 2^{- 2 R_{1}})$

at bit rate of $R_{0} + R_{1}$ bits per two samples. Comparing TC to PCM at equal bit rates (i.e. $R_{0} + R_{1} = 2 R$ ),

$\frac{σ_{r}^{2} |_{TC}}{σ_{r}^{2} |_{PCM}} = \frac{1}{2} \frac{(1 + ρ) γ_{x} 2^{- 2 R_{0}} + (1 - ρ) γ_{x} 2^{- 2 (2 R - R_{0})}}{γ_{x} 2^{- 2 R}} = (1 + ρ) 2^{2 (R - R_{0}) - 1} + (1 - ρ) 2^{2 (R_{0} - R) - 1} .$

[link] shows that (i) allocating a higher bit rate to the quantizer with stronger input signal can reduce the average reconstruction errorrelative to PCM, and (ii) the gain over PCM is higher when the input signal exhibits stronger correlation ρ . Also note that when $R_{0} = R_{1} = R$ , there is no gain over PCM—a verification of the fact that $σ_{r}^{2} = σ_{q}^{2}$ when T is orthogonal.

Ratio of TC to PCM mean-squared reconstruction errors versus bit rate R ₀ for two values of ρ .

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, An introduction to source-coding: quantization, dpcm, transform coding, and sub-band coding. OpenStax CNX. Sep 25, 2009 Download for free at http://cnx.org/content/col11121/1.2

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'An introduction to source-coding: quantization, dpcm, transform coding, and sub-band coding' conversation and receive update notifications?

Ask

	Microeconomics Practice MCQ By Frank Levy Start Test
	24 PLANT QUIZ 1 By Brooke Delaney Start Exam
	28 AP 28 Development Inheritance Essay By OpenStax Start Flashcards
	8 Java Persistence API By JavaChamp Team Start Quiz
	Macroeconomics MCQ By Candice Butts Start Quiz
	9 Domain Driven Design By JavaChamp Team Start Quiz
	39 Biology 39 The Respiratory System MCQ By OpenStax Start Quiz
	1 Endocrine System MCQ By Nick Swain Start Quiz
	7 Psychology MCQ 2010 Final Exam By John Gabrieli Start Exam
	Anthropology Politics Culture By Richley Crapo Start Assignment

Transform coding: background and motivation

2 × 2 Transform coder

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

$2 \times 2$ Transform coder