<< Chapter < Page Chapter >> Page >
Specialized DSP hardware typically uses fixed-point number representations for lower cost and complexity and greater speed. Interpretation of two's-complement binary numbers as signed fractions between -1 and 1 allows integer arithmetic to be used for DSP computations, but introduces quantization and overflow errors.

Fixed-point arithmetic is generally used when hardware cost, speed, or complexity is important. Finite-precision quantization issuesusually arise in fixed-point systems, so we concentrate on fixed-point quantization and error analysis in the remainder of this course.For basic signal processing computations such as digital filters and FFTs, the magnitude of the data, the internalstates, and the output can usually be scaled to obtain good performance with a fixed-point implementation.

Two's-complement integer representation

As far as the hardware is concerned, fixed-point number systems represent data as B -bit integers. The two's-complement number system is usually used: k binary integer representation 0 k 2 B 1 1 bit-by-bit inverse k 1 2 B 1 k 0

The most significant bit is known at the sign bit ; it is 0 when the number is non-negative; 1 when the number is negative.

Fractional fixed-point number representation

For the purposes of signal processing, we often regard the fixed-point numbers as binary fractions between -1 1 , by implicitly placing a decimal point after the sign bit.

or x b 0 i B 1 1 b i 2 i This interpretation makes it clearer how to implement digital filters in fixed-point, at least when the coefficients have amagnitude less than 1.

Truncation error

Consider the multiplication of two binary fractions

Note that full-precision multiplication almost doubles thenumber of bits; if we wish to return the product to a B -bit representation, we must truncate the B 1 least significant bits. However, this introduces truncation error (also known as quantization error , or roundoff error if the number is rounded to the nearest B -bit fractional value rather than truncated). Note that this occurs after multiplication .

Overflow error

Consider the addition of two binary fractions;

Note the occurence of wraparound overflow ; this only happens with addition . Obviously, it can be a bad problem.

There are thus two types of fixed-point error: roundoff error, associated with data quantization and multiplication, andoverflow error, associated with data quantization and additions. In fixed-point systems, one must strike a balancebetween these two error sources; by scaling down the data, the occurence of overflow errors is reduced, but the relative sizeof the roundoff error is increased.

Since multiplies require a number of additions, they are especially expensive in terms of hardware(with a complexity proportional to B x B h , where B x is the number of bits in the data, and B h is the number of bits in the filter coefficients). Designers try to minimize both B x and B h , and often choose B x B h !

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Digital filter structures and quantization error analysis. OpenStax CNX. Jan 02, 2005 Download for free at http://cnx.org/content/col10259/1.1
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Digital filter structures and quantization error analysis' conversation and receive update notifications?

Ask