Implementation of Power Efficient Approximate Multiplier

J.Anjana, A.Chitra, K.Shanthalakshmi
Department of ECE, Adhiyamaan College of Engineering, Hosur, India,
Department of ECE, Adhiyamaan College of Engineering, Hosur, India,
Asst Professsor, Departmet of ECE, Adhiyamaan College of Engineering, Hosur, India,

Abstract: Low power is the most important requirement for multimedia devices including various digital signal processing algorithms. In most multimedia signal processing, the final output is disturbed by human senses, which are not accurate. This is the reason for the need to produce exactly correct outputs. Previous study in this process utilizes error reduction through voltage over scaling, utilizing algorithmic and architectural techniques. In this paper, we propose logic complexity reduction by varying partial products to improve the accuracy. We reveal this concept by proposing various type approximate half , full adder and compressor with reduced number of the transistor, and then use in the process of designing approximate multipliers. In addition to this our process usually results in significantly low power dissipation and reduced actual time. We design architectures for both compressions of video and image algorithms using the proposed approximate multipliers. This simulations results a large power savings and area savings with a very little loss in output quality of image, when it is compared to existing system.

Key Terms: Approximate multiplier, partial products, Low-power.

I. Introduction

The idea for modern VLSI design and digital Embedded Systems is the ability of VLSI design and embedded systems to create an impact to the environment[1][2]. The purpose for creating power efficient VLSI architectures in the approximate multipliers is not only to reduce the power dissipation but also to improve the approximation value[3].

In order to improve power efficiency of such computing devices, many attempts have been recommended at various levels, from simulation to architecture, and all the way down to hardware circuit and technology[4]. Embedded system and all the mobile computing devices are habitually required to execute some key elements in the digital signal processing (DSP) applications. Further some improvement in the power efficiency for executing such applications can be made by integrated devices [5]. It has been observed that the use of these specialized processors can improve power efficiency compared with general processors at the same voltage and technology generation [6].Second, many DSP system and classification applications heavily depend on complex mathematical models and are designed to process information that typically contains noise[7]. Thus, for reducing the computational error, they exhibit only slight change in the overall DSP quality and classification precision instead of a severe failure in the multipliers. Such type of the error tolerance has been created by trading accuracy with power consumption [8]. Finally, these algorithms are initially designed and trained with partial products, but they are often converted to fixed point arithmetic due to the area, time and power cost of supporting units in embedded computing devices [9]. Even though this conversion process leads to some loss of computational precision, it usually does not affect the quality of DSP, images and the accuracy of classification applications due to reduced approximate difference [10].

The rest of this paper is organized as follows. Section II details the various related works. Section III gives a clear description about the proposed power efficient approximate multiplication technique. Section IV explains about the simulation and synthesis results. Finally, Section V details the conclusion.

II. Related Works

Approximate computing of the partial products is an emerging design paradigm that creates highly efficient hardware and software implementations by reducing the inherent resilience of applications to in accuracy in their computations[11][12]. Several efforts have been done for approximate computing in both the software and hardware with exact results. Software techniques always improves the performance by reducing the computations or the use of costly operations is avoided whereas some hardware techniques change the design at various levels of partial products to introduce change between output quality and power efficiency [13]. These efforts have been established for the significant approximate computing in the multipliers [14]. The figure 1 shows the generation of the partial products in the multipliers.
Approximate computing have been considered for reducing the error in all the applications that can resilience some loss of accuracy in the image and video with improved performance and power efficiency [14]. Applications including image processing, image recognition, multimedia signal processing and data mining can tolerant some error and does not require perfect output in computation[15]. For these types of applications, approximate multipliers may play an important role as a best alternative for reducing power, area and energy in digital systems that can tolerate some loss of accuracy in the output, so that it is possible to achieve best performance in power efficiency [16]. Such type of process exhibits the property of high inherent algorithmic tolerance to errors[17]. These algorithms are used to process large amounts of input data for multipliers that has significant discharge and may frequently contain imperfections in the output [18]. The fig 2 shows the conversion of partial products.

\[
\begin{array}{cccccccccccc}
2^14 & 2^{13} & 2^{12} & 2^{11} & 2^{10} & 2^9 & 2^8 & 2^7 & 2^6 & 2^5 & 2^4 & 2^3 & 2^2 & 2^1 & 2^0 \\
a_{7,7} & a_{7,6} & a_{7,5} & p_{7,4} & p_{7,3} & p_{7,2} & p_{7,1} & p_{7,0} & p_{6,0} & p_{5,0} & p_{4,0} & p_{3,0} & a_{2,0} & a_{1,0} & a_{0,0} \\
a_{6,6} & a_{5,7} & a_{5,6} & p_{5,5} & p_{5,4} & p_{5,3} & p_{5,2} & p_{5,1} & p_{5,0} & p_{4,0} & p_{3,0} & a_{2,0} & a_{1,0} & a_{0,0} \\
a_{6,5} & g_{7,4} & g_{7,3} & g_{7,2} & g_{7,1} & a_{6,4} & a_{6,3} & a_{6,2} & a_{6,1} & a_{6,0} & a_{5,0} & a_{4,0} & a_{3,0} & a_{2,0} & a_{1,0} \\
a_{6,4} & g_{6,4} & g_{6,3} & g_{6,2} & g_{6,1} & g_{6,0} & a_{5,4} & a_{5,3} & a_{5,2} & a_{5,1} & a_{5,0} & a_{4,0} & a_{3,0} & a_{2,0} & a_{1,0} \\
a_{6,3} & g_{5,4} & g_{5,3} & g_{5,2} & g_{5,1} & g_{5,0} & g_{4,0} & g_{3,0} & g_{2,0} & g_{1,0} & g_{0,0} & a_{4,0} & a_{3,0} & a_{2,0} & a_{1,0} \\
a_{5,4} & g_{4,4} & g_{4,3} & g_{4,2} & g_{4,1} & g_{4,0} & g_{3,0} & g_{2,0} & g_{1,0} & g_{0,0} & a_{3,0} & a_{2,0} & a_{1,0} & a_{0,0} \\
\end{array}
\]

**Fig 2:** Conversion of partial products.

III. Proposed Method

In the proposed multipliers we consider two 8-bit operands \( a_7a_6......a_2a_0 \) and \( b_7b_6......b_2b_0 \) for DADDA multiplier, the partial products of two 8-bit numbers are \( abj \) where \( i,j \) go from 0,1,...7. The partial products form a matrix of 0 rows and 7 columns as show in Fig.3.

\[
\begin{align*}
&\ s_1 = \{a[7] & \& b[0],a[6] & \& b[0],a[5] & \& b[0],a[4] & \& b[0],a[3] & \& b[0],a[2] & \& b[0],a[1] & \& b[0],a[0] & \& b[0]\}; \\
\end{align*}
\]

**Fig 3:** Generation of partial products of the proposed multipliers.

The next is the partial products reduction in each part. The generated partial products is reduced using some basic rules of DADDA multipliers. The final two rows of each part in the reduction of partial products are added using a Carry save Adder. The fig 4 shows the reduction of partial products.
After the process of reduction of the partial products we use all the approximate half adders, full adders and 4-2 compressors for further reduction of the partial products. Then the output of the approximate circuits is given to the carry save adder which produces the final value of 8x8 multiplications. Carry save adder reduces delay when compared with the ripple carry adder so we make use of the carry save adder.

IV. Results And Discussion.

All the generated partial products usually make use of the approximate half adders, full adders, 4-2 compressors to produce the sum and carry row. The various generated sum and carry rows are given as the input to the carry save adder. The fig 5 shows the simulation results of the carry save adder.

So finally we can easily find the value of the approximate multiplication using the DADDA multipliers. Then we need to synthesis the final results in order to calculate the amount of the power dissipation. After the process of synthesis we found that the approximation reduces the power consumption from 135 mW to 113.66 mW. The fig 6 shows the synthesis results of power.
V. Conclusion

In this paper, to propose power efficient approximate multipliers, first the partial products of the multiplier are changed using generate and propagate signals in the initial stage. Approximation of the multipliers is applied using simple OR gate for changed generate partial products. Approximate half-adder, approximate full-adder, and approximate 4:2 compressor are designed to reduce remaining partial products. They are also found to have better precision when it is compared to existing approximate multiplier. The proposed approximate multiplier designs can be used in applications like image processing with little loss in output quality of the image while reducing power and area. The proposed approximate multiplier design technique can be used for any type of parallel multipliers to improve the speed and performance.

References