"A New And Efficient Method Of Implementing Functions And Caseaded

< Back

"A New And Efficient Method Of Implementing Functions And Caseaded Wide Input Functions"

Abstract: The present invention provides an improved configurable logic device providing enhanced flexibility, scalability and providing area efficient implementation of arithmetic operation on (N-1) bit variables comprising a first configurable logic subsystem capable of generating logic OR output in response to functions of N-1 input variables in arithmetic mode, a second configurable logic subsystem capable of generating logic AND output in response to fimctions of N-1 input variables in arithmetic mode, and a configurable logic block connected at its first input to the output of said first configurable logic subsystem, connected at its second input to the output of said second configurable logic subsystem, connected at its third input to the Nth input variable, and connected at its fourth input to a carry/borrow signal; said configurable logic block providing a first output corresponding to carry/borrow value in arithmetic mode, a second output corresponding to logical functions of said N input variables in the logical mode and a third output corresponding to sum/difference value in arithmetic mode.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #

Filing Date

29 October 2004

Publication Number

31/2009

Publication Type

INA

Invention Field

COMPUTER SCIENCE

Status

remfry-sagar@remfry.com

Parent Application

Applicants

STMICROELECTRONICS PVT. LTD.,

PLOT NO.2, 3 & 18, SECTOR 16A, INSTITUTIONAL AREA, NOIDA-201 3001, U.P.INDIA

Inventors

1. VIVEK KUMAR SOOD

C-3959, RAJAJIPURAM, LUCKNOW-226017

Specification

Field of the Invention
The present invention relates to an improved configurable logic device providing enhanced flexibility, scalability and providing area efficient implementation of arithmetic operation on n-bit variables.
Background of the Invention
Conventional adder/subtractor circuits are used in configurable logic devices to perform the most common Eirithmetic operations.
Figure 1 is a schematic diagram of a conventional carry chain circuit, which receives three input signals A, B, and Cin, where Cin is a carry input signal received fi-om another carry chain multiplexer circuit. Input signals A and B are applied to input terminals of XOR gate 101 A. In response, XOR gate 101A provides a carry propagate signal, P.
Carry propagate signal P is applied to an input terminal of XOR gate 102. Carry input signal Cin is applied to the other input terminal of XOR gate 102. In response, XOR gate 102 provides a sum signal S. Table 1 depicts the truth table for the carry chain circuit shown in figure 1.
Figure 2 shows the depiction of subtraction operation A - B by using a convention logic device. The carry propagate signal P is also applied to a control input terminal of multiplexer 103. Input signal A is applied to the "0" input terminal of multiplexer 103 and the carry input signal Cin is provided to the "1" input terminal of multiplexer 103. Depending upon the value of carry propagate signal P, either input signal A or carry input signal Cin is transmitted through multiplexer 103 as carry output signal Cout.
The XNOR gate lOlB is used here instead of the XOR gate lOlA.This is equivalent to inverting input B and using XOR gate lOlA instead.
Implementation of subtraction operation by using two's complement operation is shown in Table 2.
Carry/Borrow chain circuits shown in Figure 1 and Figure 2 have been implemented in a number of different ways in programmable logic devices (PLDs) such as field programmable-gate-arrays-(FPGAs).
A conventional circuit for using a function generator of a programmable logic device to implement carry logic fiinctions is described by the US Patent No. 5818255 that shows one of the methods of implementing the aforementioned truth tables in PLDs. The circuit is further illustrated diagrammatically in Figure 3. The configurable bits 320 and 401-416 are programmed with appropriate values to implement the truth tables shown in Table 1 and Table 2. Signal G is connected to "A" or "B".
It can be easily observed from the aforementioned US Patent that it does not have a dedicated provision to cascade the output S of the 4-input LUT for implementing wide input cascade functions.
Another conventional circuit for implementing dynamic addition subtraction operation is shown in Figure 4 that perform dynamic addition subtraction in 2's complement form by configuring input G3 or input G4 of the LUT as the add-sub signal and then using some additional logic so that the "cin" may become either "0" for addition or "1" for subtraction, since the operation is 2's complement. It is noteworthy that the add-sub signal needs to be connected at two places that is one at the LUT inputs (G3 or G4) and the other at the logic required for initializing the chain to logic 0 or logic 1. MUXCY is used for implementing carry logic as well as for implementing wide input functions by cascading the outputs of the 4-input LUTs. However the cascade element (MUXCY) does not have any provision of implementing XOR gates. Furthermore, additional connectivity in the logic circuit requires an increase in the resources.
An existing Altera device shown in Figure 5 provides an XOR gate (501) at the input "datal" of the LUT. This XOR gate is specifically given for performing dynamic addition/subtraction using 2's complement logic. However providing an XOR gate at only one input of the LUT causes the logical equivalence of the two arithmetic inputs "datal" and "data2" to be lost when performing dynamic addition subtraction operation since only "data2 ± datal" operation can be performed and not "datal ± data2" operation. If this equivalence is required then additionally connectivity has to be provided at the input terminals for "datal" and "data2" so that any signal that reaches "datal" can also reach "data2" and vice-versa any signal that reaches "data2" can also reach "datal". This causes an additional increase in hardware resources due to more connectivity and therefore requires more configuration bits.
Table 3 shows the truth table for the subtraction operation for performing A - B in 2's complement form.
The Bin(2scomp) represents the Bout(2scomp) of the previous subtraction operation.
At the start of the subtraction operation Bin(2scomp) is given a fixed value of logic 1 at-the-LSB.
Equation for a 2s complement subtraction(A-B) operation can be written as:
Diff = ~B ^ A ^ Bin(2scomp) (1)
Bout(2scomp)
= (~B && A) II ( ~B && Bin(2scomp)) || (Bin(2scomp) && A ). (2)
Let us assume that the operation (101011 - 110100) that is equivalent to -21 - (-12) has to be performed using 2's complement operation, which is illustrated in detail by Figure 6.
Also shown are the borrow outs of each stage. The result is 110111, which is the binary representation of-9 in 2's complement form. As can be seen from this example that at the LSB "Bin (2scomp)" requires a value of logic 1. This is shown as "init" in Figure 6.
Ail of the above prior arts implement subtraction using the tvi'o's complement arithmetic. The subtraction is performed by simply inverting one of the operands and making "Cin" as logic 1 for the LSB subtraction. Using two's complement arithmetic it suffices to provide just an adder circuit and generates the requirement of more hardware resources.
Thus a need is felt for an improved logic device that provides a scalable solution for achieving a minimum hardware implementation of arithmetic operations on n-bit variables.
Object and Summary of the Invention
It is an object of the present invention to provide a configurable logic device for performing direct subtraction operation on a given set of input variables.
It is another object of the present invention to provide a configurable logic device for performing a logical operation on a given set of input variables.
It is fiirther an object of the present invention to provide a cascade configurable logic device for performing arithmetic operation on data streams comprising at least two bit data.
To achieve the aforementioned objectives the present invention provides enhanced flexibility, scalability and providing area efficient implementation of arithmetic operation on n-bit variables comprising:
- a first configurable logic subsystem capable of generating logic OR output in response to functions of N-1 input variables in arithmetic mode,
- a second configurable logic subsystem capable of generating logic AND output in response to fiinctions of N-1 input variables in arithmetic mode, and
- a configurable logic block connected at its first input to the output of said first configurable logic subsystem, connected at its second input to the output of said second configurable logic subsystem, connected at its third input to the Nth input variable, and connected at its fourth input to a carry/borrow signal; the said configurable logic block providing a first output corresponding to carry/borrow value in arithmetic mode, a second output corresponding to logical fimctions of said N input variables in the logical mode and a third output corresponding to sum/difference value in the arithmetic mode.
Brief description of the accompanying drawings
Figure 1 illustrates the schematic diagram of a conventional carry chain circuit.
Figure 2 illustrates a conventional logic device for performing subtraction operation.
Figure 3 illustrates a conventional programmable logic device for implementing carry logic functions.
Figure 4 illustrates an existing circuit for performing subtraction using two's complement arithmetic.
Figure 5 illustrates a conventional device for performing dynamic addition and subtraction on a given set of input data.
Figure 6 illustrates conventional bit-by-bit subtraction by using two's complement subtraction
Figure 6 A illustrates the improved configurable logic device in accordance with the present invention.
Figure 7 illustrates the bit-by-bit subtraction by using direct subtraction.
Figure 8 illustrates the embodiment of the configurable logic device present invention for implementing arithmetic and logic operations of n-bit input variables.
Figure 9 illustrates a first sub-structure of the configurable logic device for generating a first arithmetic output in accordance with the present invention.
Figure 10 illustrates a second sub-structure of the configurable logic device for generating a second arithmetic output in accordance with the present invention.
Figure 11 illustrates a third sub-structure of the configurable logic device for generating a third arithmetic output in accordance with the present invention.
Figure 12 illustrates a fourth sub-structure of the configure logic device for generating a fourth arithmetic output in accordance with the present invention.
Figure 13 illustrates a cascaded configurable logic device in accordance with the present invention.
Figure 14 illustrates the internal structure for the cascaded structure of the configurable logic device in accordance with the present invention.
Figures 15, 16 & 17 illustrate another embodiment of the present invention.
Table 1 illustrates the conventional truth table for the carry chain circuit shown in figure 1.
Table 2 illustrates the conventional implementation of subtraction operation by using two's complement operation.
Table 3 illustrates the truth table for the subtraction operation for performing A - B in 2's complement form.
Table 4 illustrates the truth table for implementing direct subtraction in accordance with the present invention.
Detailed description
The present invention is discussed in the light of the derivation of the direct method of subtraction. The logical truth table for the method can be clearly seen from Table 4.
The equations for this operation are as follows:
Diff = A ^ B '^ Bin(direct) (3)
Bout(direct) = (~A && B) ||( B && Bin(direct)) || (~A &«&Bin(direct)) — (4)
The advantage of implementing this equation is that it requires a value of logic 0 for initialization at the start of the chain at the LSB subtraction. This requirement is the same as the requirement for carry chain initialization when performing addition operation, which also requires a fixed value of logic 0 at the start of the chain.
It can be therefore seen that implementing this equation in FPGAs will result in a significant saving in area since the requirement of having a value of logic 0 for the carry/borrow initialization will remove the requirement for having a programmable configuration bit at that place programming of which allowed logic 0 or logic 1 values to pass through while performing two's complement addition and subtraction respectively. In FPGAs the configuration bits have a significant area. Doing away with one configuration bit will save a significant area. Moreover, when performing dynamic addition and subtraction operation the add-sub signal need not control the LSB "Bin". This means that the add-sub signal need not have any additional connectivity apart from being connected to the LUT inputs and can be treated just like any other LUT input. This is in contrast to Figure 4 where the add-sub signal needs to be routed to other logic also; apart from the LUT that implements the addition or subtraction.
Direct method of operation is normally used when dealing with unsigned numbers. However with a slight interpretation of results the direct method of subtraction can also tackle numbers represented in two's complement form. Further, it is shown that how the direct subtraction produces the same result as a two's complement subtraction.
Examining Table 3 and Table 4 it is seen that if Bin (direct) is chosen such that Bin (direct) = -Bin (2scomp) for some given values of operands A and B then for the same values of operands A and B it is found that Bout (direct) = -Bout (2scomp).
Let us assume that the operation (101011 - 110100) that is equivalent to -21 - (-12) has to be performed using 2's complement operation, which is illustrated in detail by Figure 6.
Also shown are the borrow outs of each stage. The result is 110111, which is the binary representation of-9 in 2's complement form. As can be seen from this example that at the LSB "Bin (2scomp)" requires a value of logic 1. This is shown as "init" in Figure 6.
Figure 6A illustrates the schematic of the configurable logic device for implementing the arithmetic, logic operations for an input for n-bit variables. It is seen that the LUTs are coupled to N-1 input data streams for generating an intermediate arithmetic and logic function of said N-1 input data stream. The outputs X and Y from the LUTs are further provided to a logic selector for generating a logical function FG_OUT2 of the N-1 input variables and to two logically configurable subsystems for generating sum/ difference and carry/borrow signals. The logically configurable subsystems are driven by the configuration bits P2 and P3 and are coimected to a carry/ borrow input signal CIN besides being connected to each other for producing carry/ borrow FG_OUT 1 and sum/difference FG_OUT 3 outputs respectively. The switching logic PI is a two-way switch that is connected to a configuration bit PI and its inversion signal at its first input, coupled to Nth input at its second input and coupled to the carry/borrow input signal CIN at its third input to generate an output W that is further coimected at the select input of the logic selector for generating said logical function of N-1 input variables. Switch PI is used for enabling the carry/ borrow input logic to perform arithmetic operation on data streams comprising at least two bits.
We shall now consider that whether the same operands (101011 and 110100) are given to a direct method subtracter and the same subtraction has to be done using the direct method. Figure 7 shows this operation.
As can be seen from this example that at the LSB "Bin(direct)" requires a value of logic 0 for initialization.. This is shown as "init" in Figure 7. This logic 0 value is the same as that required for carry chain initialization in case of binary addition, as can be verified by those skilled in the art
From the example shown in figures 6 & 7 it can be seen that at each stage Bout (direct) = -Bout (2scomp). The value of "Diff remains the same in the two cases. Thus using the direct method of subtraction we can deal with numbers represented in 2's complement as well. Only difference being that Bout (2scomp) = ~Bout (direct). This however is not a problem in FPGAs since the inverter can easily be absorbed in the LUT if intermediate borrow outs are required to be used at places other than the borrow chain.
The present invention therefore implements the addition and subtraction using the above concept such that no additional configuration bits differentiate the addition operation from the subtraction operation, apart from the configuration bits of the look up table. The aim of the present invention is to therefore provide an efficient method of implementing arithmetic operations (addition, subtraction and dynamic addition/subtraction) and to provide efficient means of implementing all wide input cascade functions.
The present invention therefore implements the addition and subtraction using the above concept such that no additional configuration bits differentiate the addition operation from the subtraction operation, apart from the configuration bits of the LUT. The aim of the present invention is thus to provide an efficient method of implementing arithmetic operations (addition, subtraction and dynamic addition/subtraction) as well as provide an efficient means of implementing all \\'ide input cascade functions.

The invention therefore provides means to implement addition/subtraction as well as dynamic addition/subtraction. The cany chain can also be used for the implementation of wide input fiinctions. The said invention consists of the 4-uiput LUT (818) formed using four 2-input LUTs (801),(802),(803) and (804). Outputs of the multiplexers Mux (805) and Mux (806) act as the outputs of the two 3-input LUTs (816) and (817) respectively. Output of Mux (807) as well as Mux (809) generate the output of the 4-input LUT (818). However Mux (807) and (809) are different in the aspect that the Mux (809) takes its input from unit (808), which causes the output of Mux (809) to have a fixed value of logic 0 or the normal 4-input LUT out (composed of inputs 10,11,12 and CBin) depending on the configuration bit (PI). Mux (809) belongs to a dedicated chain structure, which does not disturb the normal 4-input LUT out fimctionality, which is still available at output FG_0UT2. Note that the functionality of Unit (808) is not limited by the implementation shown in Figure 8. Several possible implementations of Unit (808) are possible. One such can be the usage of two AND gates, the outputs of which act as inputs to "0" and "1" of Mux (809). One of the inputs to each of these AND gates are the outputs of Mux (805) and Mux(806) respectively. The other input to each of these AND gates is the configuration bit (PI). Thus when PI is "0" the output of Mux(809) is "0" else it as the normal 4-input LUT out (composed of inputs 10,11,12 and CBin).
Unit (810) is used to implement the sum/difference out both for adder and subtractor respectively. Note that the functionality of Unit (810) is not limited by the implementation shown in the Figure 8. Several possible implementations of Unit (810) are possible.
Signals 10, 11,12,13 and CBin are the inputs to the apparatus shown in Figure 8. The apparatus-can-be-configured-in-these-modes:
1. Arithmetic mode:
• Addition
• Subtraction
• Dynamic Addition/Subtraction.

2. Normal Mode
3. Cascade Mode
LUTFDWCTIOHALITYIN AIUIHMETIC MODE Functionality to be ijplemented in 2-LUTl(s01)-
F-10II
Functionality to be implenented in 2-LUT2(802):
ADDER:
F-10||Il
SUBTCACTOR/dymaic ADD-SUB mode;
F-10||~[lforIl-IO
Fnntionality to be implenented in 2-LUT3(803)--
F = ID&&11
Funtionality to beimplemented in 2-LUT4(S04) -
ADDDER mode:
F=I0&&I1
SUBTRACTOR/DynamicADD-SUB modc:
F—[Dt&Ilftlfia-I]
F = ID&fl.-41fbrIl-10
Addition mode
The sum equation that is implemented is:
Sum = 10 ^ II ^ CBin. (5)
(EQUATION REMOVED)

Equation 8 expresses 10 xnor II in terms of RHS components of Equation 6. Equation 7, which denotes the SUM can now be expressed in terms of Equation 8. Thus, finally it follows from this that equation 5 can be expressed in terms of components of RHS of equation 6. Hence SUM can be generated from the components of equation 6.
In this mode of operation the four 2-input LUTs (801),(802),(803) and (804)are configured to implement the functionality shown in Figure 9 so that "CBout" implements equation 6. The apparatus then performs the addition operation 10 + 11 or
11 +10. Input 12 is pulled to logic 0 for this operation. This causes multiplexers (805)
and (806) to select the output of the 2-input LUTs (802) and (804) respectively, which
thus act as the two inputs of the multiplexer (809). PI of unit 808 is configured to
select the signals (814) and (815) as the inputs to the Multiplexer (809). "CBin" is the
select line of multiplexer (809).
The outputs of 2-input LUTs (801) and (803) act as the inputs of unit (810).
Functionality implemented in (801) and (803) is as shown below.
(801) is configured by the user to implement 10 || II and (803) is configured by the user to implement 10 && II i Both of these are the RHS components of equation 6.
Thus the output of unit (810) now implements the sum equation 5 using the logic of equations-(7)-and-(8):
Figure 10 shows the functionality of the 2-input Function generators (801),(802),(803) and (804) that can be used to implement this equation. Note that these 2-input FGs are actually a part of the 4-input EG (811), which is also shown in Figure 10. Inverter 1008, NAND gate 1009 and XNOR gate 1010 are the additional components required to implement the SUM logic. Signal AS, as shown in Figure 10, is pulled down to logic 0 using many of the possible techniques apparent to those skilled in the art.
Subtractor mode:
Subtraction operation (10-11)
Equation of borrow out is:
Borrow_Out = [ (~I0 || II) && borrow_in ] || [~ 10 && Il&&~borrow_in] — (9)
Equation of the difference (DIFF) remains same as equation of SUM, which is again given, by equations 5 and 8 mentioned before.
Figure 11 shows the functionality of the 2-input LUTs that can be used to implement this equation. The units (801) and (803) have the same functionality as that in the adder mode. However, the functionality of (802) and (804) is different from that in the adder mode. As signal again needs to be pulled to logic 0 similar to the adder mode.
Subtraction operation (II -10)
Equation of borrow out is :
Borrow_Out = [ (~I1 || 10) && borrowjn ] || [~ II && 10] — (10)
Equation of the difference DIFF remains same as equation of SUM, which is again given, by equations 5 and 8 mentioned before.
Figure 12 shows the functionality of the 2-input LUTs that can be used to implement this equation. The units (801) and (803) have the same functionality as that in the adder mode. The functionality of (802) and (804) is different from that of the subtractor (10-11) mode. As signal again needs to be pulled to logic 0 similar to the adder mode.
Dynamic addition subtraction:
This mode has the same implementation as the subtractor modes except that AS becomes a normal input to the apparatus and is controlled by the output of some other circuit rather than being permanently pulled to logic 0.
Figure 13 show the dynamic add-sub mode for performing 10 ± II. Note how in this mode the AS signal is not pulled to ground but behaves like any other input to the apparatus and control the operation of addition and subtraction. When AS is logic 0 subtraction is performed and when AS is logic 1 addition is performed.
Similarly dynamic addition/subtraction for performing II ± 10 can be implemented.
Note that in any of the above implementations, apart from the changing configuration bits of the LUT no additional configuration bit is required to differentiate between the addition or the subtraction operation, which means that it requires less configuration bits as compared to few of the Prior arts mentioned above apart from other advantages to-be-mentioned-later.
Normal-mode:
In this mode of operation FG_OUTl generates a fixed value of logic 0 using bit "PI" of Figure 8 while FG_OUT2 generates the normal 4-input LUT functionality. This logic 0 value can be used for two purposes. Firstly it causes the start of the carry/cascade chain in an apparatus that is above the apparatus that causes this initialization. Secondly it can be used for preventing the unnecessary toggling of the unused portions of the carry/cascade chain. Refer Figure 14, wherein (Pll) is configured such that Mux (Mil) generates a value of logic 0. FG_OUT2 generates any 4-input function of inputs "I0","I1","I2" and "13". Thus, even when initializing, the normal 4-input functionality of (1401) is not disturbed.
Cascade-mode:
This mode is identical to the arithmetic mode except that now the functionality of the 4-rnput LUT is not limited by the values mentioned in Figure 9. The 4-input LUT (818) shown in Figure 8 generates any four input fimction of inputs 10, 11,12 and CBin. This function is obtained at the output FG_OUTl which is the output of Mux(807) (M21 and M31 in Figure 14). Referring again to Figure 14 it is seen that since FG_OUTl connects to CBin of the apparatus just above it therefore this whole cascaded structure forms a cascade chain that can implement wide functions. If
(1402) has to implement the first element of a carry/cascade structure then (1401) is configured in the normal mode. Output of Mux (Mil) goes to logic 0. This causes Mux (M21) to select the signal at its "0" input as its output. Thus the output of Mux (M21) can have any function of three inputs 10, II and 12; the inputs belonging to structures such as (1402). For the Arithmetic mode this function can be a half adder or half subtractor. For the cascade mode this can be any Boolean function of three variables formed using 10, II and 12. Now say one requires the result of the cascade or carry function operation to be propagated to any other place other than the "CBin" of the apparatus belonging to the chain; then the same function as is available at FG_OUTl can be made available at FG_OUT2. Thus for example FG_OUTl of
(1403) is required to be used at some other logic other than the chain, then (P32) is programmed such that the signal at input "0" of Mux (M32) is propagated as the output of (M32). This causes FG_OUTl and FG_OUT2 of (1403) to have the same functionality. Thus FG_0UT2 of (1403) can be used to tap the intermediate or the end carry/cascade outs.
It can be seen from Figure 14 that since the output of one 4-input LUT acts as the input of another 4-input LUT (using multiplexers like M11,M21 and M31 in Figure 14) , the output of this 4-input LUT acts as an input of another 4-input LUT(using similar type of muxes) and so on therefore this structure of the cascaded 4-input LUTs can implement wide input functions without any restraint on the functionality to be implemented in the cascade structure.
Figures 15 and 16 show another embodiment of the invention. Here the inputs to (810) are taken in a different manner. These are taken from points (814) and (815) which are the outputs of muxes (805) and (806) respectively, instead of being taken from the outputs of the 2-input LUTs (801) and (803) which was shown previously in Figure 8. Mux (1602) is further added as shown in Figure 16. Following discussion explains how this embodiment functions and its advantages.
Considering Figure 17, it is seen that it is identical to Figure 8 except that the inputs to (810) are taken from points (814) and (815) instead of from the outputs of (801) and (803) as were taken in Figure 8. If the structure shown in Figure 17 has to implement the adder operation then the same functionality as shown in Figure 10 can be implemented in Figure 17 and the apparatus works just as fine. Now suppose the structure shown in Figure 17 has to implement the subtractor operation like that shown in Figure 11 (for 10 - II) or Figure 12(for II - 10). If the same functionality of Figure 11 or Figvu-e 12 is implemented in Figure 17 it can easily be verified by those skilled in the art that the DIFF function obtained at FG_OUT3 in Figure 17 will actually be the inverted of that which is actually required and obtained at FG_OUT3 of Figure ll(for 10 - II) or Figure 12(for II - 10). Now since the DIFF function is actually a 3-input XOR function of operands "I0","n" and "CBin" therefore if any one of the inputs to this XOR function is inverted then we can obtain the non inverted value of the DIFF.function in Figure 17 . This value will be correct and will match the functionality of Figure 11 or Figure 12 . Here if it be chosen to invert CBin (which is Bin(direct) for the subtractor case) then we can obtain the desired functionality at the DIFF output of Figure 17. From Table 3 and Table 4 it is seen that for the same value of operands "A" and "B" if we choose Bin(2scomp) such that that Bin(2scomp) = ~Bin(direct) ,we find that Bout(2scomp) = ~Bout(direct). Thus inverting Bin(direct) is actually equivalent of passing Bin(2scomp). Therefore in a chain like that shown in Figure 14 but with (1401),(1402) and (1403) replaced by structure of Figure 17 and the chain implementing a subtractor operation ,if at the LSB we invert CBin(Bin(direct)) then all subsequent Bouts will be inverted such that the DIFF output obtained is the correct value. This is equivalent of a 2scomplement subtraction operation in which we pass Bin(2somp) as equal to "1" at the LSB. However doing that would mean making unit(808) of the apparatus just below the LSB of the chain, generate both logic 0(for addition) or logic 1 (for subtraction). This would be true for an apparatus like (1401) shown in Figure 14, which would require its "Cbout" to be configured to logic 0 as well as logic 1 .Additionally we would require the add-sub signal also to be routed to this place for implementing dynamic add-sub. This method would require two additional configuration bits.
However, a better method exists which is shown in Figure 16. This method involves the addition of Mux (1602) along with a single configuration bit (1601). Mux (1602) when configured to select the value at "0" input would pass the value of the 3-input LUT (816) through the NAND gate (1603) belonging to unit (810). This NAND gate
now acts as a simple inverter since one of its input is at logic 1. Since CBin of the apparatus just below the apparatus from which the chain starts is configured to logic 0 as previously shown in Figure 14, therefore the value of the 3-input LUT (816) is now available at FG_0UT3. Thus at this point we can obtain any 3 input functions of inputs "I0","H" and "12". This three input function can be the 2-input XOR of any of the 2-inputs belonging to "I0","n" or "12". In such a case it becomes the sum/diff out of the half adder or the half subtracter. Further since "CBin" is pulled to logic 0 therefore the output of the other 3-input LUT (815) passes through Mux (809) and is available as "CBout" of the LSB operation. In case of an adder this can be the carry-out of the half adder while in case of the subtracter this can be the borrow-out of the half subtractor, which now will be the inversion of Bout (direct) i.e. it is now Bout (2scomp). Since at the LSB operation we obtain inverted value of Bout (direct) which will act as the ~Bin (direct) of the next stage it follows from the previous discussions that Bout of this stage will also be ~Bout(direct) which will further act as the input of the next stage and so on . Thus at all subsequent stages we would obtain the inverted value of Bout(direct) Since we would obtain the inverted value of Bout(direct) at all stages this means we obtain the correct value of DIFF at all stages since as explained previously we would have obtained the inverted value of DIFF had we passed Bout(direct) without inverting using the implementation of Figure 17. When implementing dynamic add-sub operation any of the inputs "10" or "11" or "12: can be used for the add-sub signal since the inputs to unit-808 are now the outputs of the 3-input LUT structures (816) and (817 . This can be considered in contrast to Figure 5 (Prior Art : Altera( Stratix device) data sheet) where the add-sub signal cannot be swapped with any other signal apart from "datal". Further more in the current invention any of the 3-inputs "I0","n" or "12" can be used as the operands of the arithmetic operation, which can be considered in contrast to Figure 4 (Prior Art: Reference Virtex II data sheet) where one of the operands has to be connected to either of inputs "Gl" or "G2" or Figure 5 (Prior Art : Altera( Stratix device) data sheet) where the operands can be placed only on "datal" or "data2". Thus the invention provides three input logical equivalence between the operands of the arithmetic operation as well as between the operands and the add-sub signal. Increasing the logical equivalence makes it more software friendly since the solution space for the algorithms increase as now they can bring a particular signal to any of the inputs "I0","ir' or "12" for performing arithmetic operations. The three input logically equivalence can be exploited in a number of ways by those skilled in the art. One such use is the provision of carry insertion. Any of the 3 inputs "10","ir' or "12" can be used for insertion of external carry. Further when implementing multipliers the intermediate product terms can be absorbed in the LUTs. Note that the 3-input LUT structures (816) and (817) shown in Figure 16 can be implemented in number of ways; not limited to the implementation shown in Figure 16 . This three-output, five input Function Generator can implement efficient dynamic as well as fixed addition and subtraction apart from implementing the normal 4-input LUT functions. Apart from that it allows cascading of 4-input LUTs which causes the implementation of very wide functions without any additional hardware and without any functional limitations caused due to some additional cascade element. The addition and subtraction operations are dependent only on the configuration bits of the LUT. No additional configuration bit differentiates between the subtraction and the addition operation. Further, three inputs of the LUT become logically equivalent for arithmetic performing operations i.e. not only the operands used in the arithmetic operations can be swapped with each other but also the operands and the add-sub signal.

We Claim:
1. An improved configurable logic device providing enhanced flexibility,
scalability and providing area efficient implementation of arithmetic operation
on (N-1) bit variables comprising:
a first configurable logic subsystem capable of generating logic OR output in response to functions of N-1 input variables in arithmetic mode,
a second configurable logic subsystem capable of generating logic AND output in response to functions of N-1 input variables in arithmetic mode, and
a configurable logic block connected at its first input to the output of said first configurable logic subsystem, connected at its second input to the output of said second configurable logic subsystem, connected at its third input to the Nth input variable, and connected at its fourth input to a carry/borrow signal; the said configurable logic block providing a first output corresponding to carry/borrow value in arithmetic mode, a second output corresponding to logical functions of said N input variables in the logical mode and a third output corresponding to sum/difference value in the arithmetic mode.
2. An improved configurable logic device as claimed in claim 1, wherein each of
said first and second configurable logic subsystem comprising:
a pair of N-1 input Look Up Tables for generating a logic OR and logic AND output of said function of N-1 input variables, and a two input selector connected at its inputs to the output of said pair of N-1 input look up tables for generating a signal corresponding to a logic OR or logic AND function of N-1 input variables.
3. An improved configurable logic device as claimed in claim 2, wherein said N-1 Look Up Table is a two input Look Up Table.
4. An improved configurable logic device as claimed in claim 1, wherein said configurable logic block comprising:
a. a first switching means connected to said Nth input variable at its first
input, said carry/borrow signal at the second input and a configuration
bit at its third input, for generating a control signal.
b. a first selection means connected at its control input to the output of
the said switching means and connected at its first and second inputs to
the output of said first and second configurable logic subsystem; for
generating an output corresponding to said logical function of N input
variables, in logical mode.
c. a third configurable logic subsystem cormected at its first input to the
output of said first configurable logic sub system , connected at the
second input to the output of the second configurable logic subsystem,
connected at the third input to the said carryA)orrow signal and
connected at the fourth input to a configuration bit, for generating a
signal corresponding to carry/borrow value in the arithmetic mode. Further, the said third configurable logic sub system being capable of performing either the selection among its first and second inputs based on the said carry/borrow signal or capable of generating a constant logic value. The above selection being done by the configuration bit. d. a fourth configurable logic subsystem cormected at its first input to the output of first configurable logic sub system, cormected at the second input to the output of the second configurable logic subsystem, connected at the third input to the carry/borrow signal and connected at the fouth input to a configuration bit, for generating an output corresponding to sum/difference value in the arithmetic mode. Further, the said fourth configurable logic subsystem being capable of either passing the value of the first configurable logic subsystem to the output or of performing the XNOR operation of the said carry/borrow signal with the NANDing operation of the first input and the complement of the second input The above selection being done by the configuration bit.
5. An improved configurable logic device as claimed in claim 4, wherein said first selection means is a two input multiplexer.
6. An improved configurable logic device as claimed in claim 4, wherein said fourth configurable logic subsystem comprising:
a. a logic nand-gate connected at its first input to the output of one look
up table of said pair of N-1 input look up tables of said first logic
subsystem and connected at its second input to the output of a logic
inverter, said logic inverter cormected at its input to the output of one
look up table of said pair of N-1 input look up tables of said second
logic subsystem, and
b. a logic xnor-gate cormected at its first input to the output of said logic
nand-gate and connected to said carry/borrow signal at its second
input.
7. An improved configurable logic device as claimed in claim 4, wherein said third configurable logic subsystem comprising at least one selector coupled to a plurality of pass transistors.
8. An improved cascaded configurable logic device structure providing enhanced flexibility, scalability and providing area efficient implementation of arithmetic operations on (N-1) bit variables as claimed in claim 1, comprising at least two configurable logic devices cormected to each other for generating at least two bit arithmetic mode output.
9. An improved cascaded configurable logic device structure as claimed in claim 8, wherein the carry/ borrow signal of the first of said configurable logic devices is connected to the carry/borrow value output of the second of said configurable logic devices.
10. A method for providing enhanced flexibility, scalability and area efficient
implementation of arithmetic operation on (N-1) bit variables comprising steps
of:
a. generating logical OR output by a first configurable logic subsystem in
response to an input function of N-1 variables;
b. generating logical AND output by a second configurable logic
subsystem in response to an input fimction of N-1 variables; and
c. connecting the output of said first configurable logic subsystem to the
first input of a configurable logic block and connecting the output of
said second configurable logic subsystem to the second input of said
configurable logic block, the Nth input variable being coupled to the
third input of said configurable logic block and connecting a
carry/borrow signal to the fourth input of said configurable logic block
for generating a first output corresponding to carry/borrow value in
arithmetic mode, a second output corresponding to logical functions of
said N input variables in the logical mode and a third output
corresponding to sum/difference value in arithmetic mode.
11. An improved configurable logic device providing enhanced flexibility,
scalability and providing area efficient implementation of arithmetic operation
on (N-1) bit variables substantially as herein described with reference to and
as illustrated by the accompanying drawings.

Documents

Application Documents

#	Name	Date
1	2154-del-2004-abstract.pdf	2011-08-21
1	2154-del-2004-petition-138.pdf	2011-08-21
2	2154-del-2004-pa.pdf	2011-08-21
2	2154-del-2004-claims.pdf	2011-08-21
3	2154-del-2004-form-5.pdf	2011-08-21
3	2154-del-2004-correspondence-others.pdf	2011-08-21
4	2154-del-2004-description (complete).pdf	2011-08-21
4	2154-del-2004-form-3.pdf	2011-08-21
5	2154-del-2004-form-2.pdf	2011-08-21
5	2154-del-2004-description (provisional).pdf	2011-08-21
6	2154-del-2004-form-1.pdf	2011-08-21
6	2154-del-2004-drawings (provisional).pdf	2011-08-21
7	2154-del-2004-drawings.pdf	2011-08-21
8	2154-del-2004-form-1.pdf	2011-08-21
8	2154-del-2004-drawings (provisional).pdf	2011-08-21
9	2154-del-2004-form-2.pdf	2011-08-21
9	2154-del-2004-description (provisional).pdf	2011-08-21
10	2154-del-2004-description (complete).pdf	2011-08-21
10	2154-del-2004-form-3.pdf	2011-08-21
11	2154-del-2004-correspondence-others.pdf	2011-08-21
11	2154-del-2004-form-5.pdf	2011-08-21
12	2154-del-2004-pa.pdf	2011-08-21
12	2154-del-2004-claims.pdf	2011-08-21
13	2154-del-2004-petition-138.pdf	2011-08-21
13	2154-del-2004-abstract.pdf	2011-08-21