

## Design and Implementation of Hybrid Lut/Multiplexer Fpga Logic Architectures

Thukaram Gugulothu & Kurumaiah.N

Assistant Professor, Department Of Ece, Nalla Narasimhareddy Education Society's Group Of Institutions,Hyd

Assistant Professor Nalla Narasimhareddy Education Society's Group Of Institutions, Hyd

### Abstract:

*Hybrid configurable logic block architectures* for field-programmable door clusters that contain a blend of query tables and solidified multiplexers are assessed toward the objective of higher logic thickness and zone decrease. Innovation mapping enhancements that objective the proposed architectures are additionally executed inside ABC. Both representing complex logic block and steering zone while keeping up mapping profundity. For fracturable architectures, the proposed design of this paper investigation the logic size, zone and power utilization utilizing Xilinx 14.2.

### **Keywords:**

FPGA, Multiplexer logic element, Complex logic block, mapping technologies

### I. Introduction

A field-programmable entryway exhibit (FPGA) is a block of programmable logic that can actualize multi-level logic capacities. FPGAs are most ordinarily utilized as particular ware chips that can be customized to actualize vast capacities. In any case, little blocks of FPGA logic can be valuable segments on-chip to enable the client of the chip to alter some portion of the chip's logical capacity. A FPGA block must execute both combinational logic capacities and interconnect to have the capacity to develop multi-level logic capacities. There are a few unique technologies for programming FPGAs, however most logic forms are probably not going to execute antifuses or comparable hard programming technologies.

All through the historical backdrop of fieldprogrammable door exhibits (FPGAs), query tables (LUTs) have been the essential logic (LE) used acknowledge element to combinational logic. A K-input LUT is non specific and exceptionally adaptable ready to actualize any K-input Boolean capacity. The utilization of LUTs improves innovation mapping as the issue is diminished to a chart covering issue. In any case, an exponential region cost is paid as bigger LUTs are considered. The estimation of K in the vicinity of 4 and 6 is commonly found in industry and the scholarly world, and this range has been exhibited to offer a decent region/execution



trade off. As of late, various different works have investigated elective FPGA LE architectures for execution change to close the huge hole amongst FPGAs and applicationparticular coordinated circuits (ASICs)

### LOOKUP TABLES

The fundamental strategy used to manufacture a combinational logic block (CLB) likewise called a logic element in a SRAM-based FPGA is the query table (LUT). As appeared in Figure, the query table is a SRAM that is utilized to execute a fact table. Each address in the SRAM speaks to a blend of contributions to the logic element. The esteem put away at that address speaks to the estimation of the capacity for that information mix. A n-input work requires a SRAM with areas.



#### Fig -1 Lookup Tables

Since a fundamental SRAM isn't timed, the query table logic element works much as any other logic door as its sources of info changes, its yield changes after some postponement.

### **PROGRAMMING A LOOKUP TABLE**

Dissimilar to a run of the mill logic door, the capacity spoke to by the logic element can be changed by changing the estimations of the bits put away in the SRAM. Subsequently, the ninput logic element can speak to capacities (however some of these capacities are changes of each other).



Fig-2 Programming A Lookup Table

An ordinary logic element has four information sources. The deferral through the query table is autonomous of the bits put away in the SRAM, so the postponement through the logic element is the same for all capacities. This implies, for instance, a query table-based logic element will show the same postponement for a 4-input XOR and a 4-input NAND. Interestingly, a 4-input XOR worked with static CMOS logic is impressively slower than a 4-input NAND. Obviously, the static logic door is for the most part speedier than the logic element. Logic



e-ISSN: 2348-6848 p-ISSN: 2348-795X Volume 04 Issue-17 December 2017

elements for the most part contain registers flipflounders and locks and also combinational logic. A flip-slump or lock is little contrasted with the combinational logic element (in sharp differentiation to the circumstance in custom VLSI), so it bodes well to add it to the combinational logic element. Utilizing a different cell for the memory element would basically take up steering assets. The memory element is associated with the yield; regardless of whether it stores a given esteem is controlled by its clock and empower inputs.

In this paper, we propose consolidating (a few) solidified multiplexers (MUXs) in the FPGA logic blocks as a methods for expanding silicon region productivity and logic thickness. The MUXbased logic blocks for the FPGAs have accomplishment in seen early business architectures, for example, the Actel ACT-1/2/3 architectures, and productive mapping to these structures has been contemplated in the mid 1990s. Be that as it may, their utilization in business chips has wound down, maybe halfway because of the simplicity with which logic capacities can be mapped into LUTs, disentangling the whole PC helped design (CAD) stream. By the by, it is broadly comprehended that the LUTs are wasteful at executing MUXs, and that MUXs are often utilized as a part of logic circuits. To underscore the wastefulness of LUTs executing MUXs,

think about that as a six info LUT (6-LUT) is basically a 64-to-1 MUX (to choose 1 of 64 truth-table lines) and 64-SRAM arrangement cells, yet it can just understand a 4-to-1 MUX (4 data+2 select=6 inputs). In this paper, we display a six-input LE in light of a 4-to-1 MUX, MUX4, that can understand a subset of sixinput Boolean logic capacities, and another hybrid complex logic block (CLB) that contains a blend of MUX4s and 6-LUTs. The proposed MUX4s are little contrasted and a 6-LUT (15% of 6-LUT territory), and can proficiently delineate {2,3}-input capacities and some {4,5,6}-input capacities. What's more, we investigate factorability of Les the capacity to part the LEs into various littler elements in both LUTs and MUX4s to expand logic thickness. The proportion of LEs that ought to be LUTs versus MUX4s is additionally investigated toward enhancing logic thickness for both nonfracturable and fracturable FPGA architectures. To encourage the engineering investigation, we built up a CAD stream for mapping into the proposed hybrid CLBs, made utilizing ABC and VPR, and depict innovation mapping methods that empower the choice of logic works that can be installed into the MUX4 elements. The fundamental commitments in this paper are as per the following.

1) Two hybrid CLB architectures (nonfracturable and fracturable) that contain a



blend of MUX4 LEs and the customary LUTsyielding up to 8% territory investment funds.2) Mapping systems called Natural Mux andMuxMap focused toward the hybrid CLBengineering that streamline for territory, whileprotecting the first mapping profundity.

3) A full post-place-and-course engineering assessment with VTR7, and CHStone benchmarks encouraged by LegUp-HLS, the Verilog-to-Routing venture indicating sway on both zone and postponement.

Contrasted and the preparatory distribution, we have performed transistor level displaying of the MUX4 LE. additionally considered the fracturable architectures, and brought together the open source toolflow from C through LegUp-HLS to the VTR stream. Scanty crossbars (versus full crossbars in the past work) have likewise been incorporated into our CLBs, expanding displaying exactness. The new transistor-level demonstrating of the MUX4 likewise gives more precise outcomes as contrasted and the past work. Results have additionally been extended with the incorporation of timing comes about and also bigger building proportion clears.

### **II.** Literature Review

Late works have demonstrated that the heterogeneous architectures and amalgamation strategies can significantly affect enhancing logic thickness and deferral, narrowing the

ASIC- FPGA hole. Works by Anderson and Wang with "gated" LUTs, at that point with awry LUT LEs, demonstrate that the LUT elements introduce in business FPGAs give superfluous adaptability. Toward enhanced deferral and region, the macrocell-based FPGA architectures have been proposed. These investigations depict noteworthy changes to the customary FPGA architectures, while the progressions proposed here expand on architectures utilized as a part of industry and the scholarly community. So also, and-inverter cones have been proposed as substitutes for the LUTs, propelled by and-inverter diagrams (AIGs).

Purnaprajna and Ienne investigated the likelihood of repurposing the current MUXs contained inside the Xilinx Logic Slices. Like this work, they utilize the ABC need cut mapped and VPR for pressing, place, and course. Be that as it may, their work is basically based demonstrating а postpone normal accelerate of 16% utilizing just ten of 19 VTR7 benchmarks.

In this article, we contemplate the innovation mapping issue for a novel fieldprogrammable door cluster (FPGA) engineering that is based onk-input single-yield programmable logic exhibit (PLA-) like cells, or, k/m-macrocells. Every cell in this design can execute a solitary yield capacity of up to k inputs and up to m item terms. We build up an



extremely productive innovation mapping calculation, km stream, for this new kind of engineering. The test comes about demonstrate that our calculation can accomplish profundity optimality on all the experiments in an arrangement of 16 Microelectronics Center of North Carolina (MCNC) benchmarks. Moreover it is demonstrated that on this arrangement of benchmarks, with just a generally modest number of item terms ( $m \le k+3$ ), the k/m-full scale cellbased FPGAs can accomplish the same or comparative mapping profundity contrasted and the customary kinput single-yield query table-(k-LUT-) based FPGAs. We likewise research the aggregate zone and deferral of k/mfull scale cell-based FPGAs and contrast them and those of the generally utilized 4-LUT-based FPGAs. The trial comes about demonstrate that k/m-large scale cell-based FPGAs can outflank 4-LUT-based FPGAs as far as both deferral and zone after situation and steering by VPR on this arrangement of benchmarks.

This paper presents exploratory estimations of the contrasts between a 90-nm CMOS field programmable door cluster (FPGA) and 90-nm CMOS standard-cell application particular coordinated circuits (ASICs) as far as logic thickness, circuit speed, and power utilization for center logic. We are persuaded to make these estimations to empower framework designers to settle on better educated decisions between these two media and to offer

understanding to FPGA producers on the inadequacies to assault and, consequently, enhance FPGAs. We portray the system by which the estimations were acquired and demonstrate that, for circuits containing just look-into table-based logic and flip-slumps, the proportion of silicon territory required to execute them in FPGAs and ASICs is by and large 35. Current FPGAs likewise contain "hard" blocks. for example, multiplier/aggregators and block recollections. We find that these blocks decrease this normal zone hole altogether to as meager as 18 for our benchmarks, and we gauge that broad utilization of these hard blocks could possibly bring down the hole to beneath five. The proportion of basic from FPGA to ASIC, way delay, is approximately three to four with less impact from block memory and hard multipliers. The dynamic power utilization proportion is around 14 times and, with hard blocks, this hole for the most part winds up noticeably littler.

In this paper the new building proposition are routinely produced in both

the scholarly world and industry. For FPGA's to keep on growing, it is vital that these new engineering thoughts are decently and precisely assessed, with the goal that those commendable thoughts can be

incorporated into future chips. Normally, this assessment is finished utilizing experimentation. Be that as it may, the utilization of



e-ISSN: 2348-6848 p-ISSN: 2348-795X Volume 04 Issue-17 December 2017

experimentation is perilous, since it requires making suppositions with respect to the devices and design of the gadget being referred to. On the off chance that these suspicions are not exact, the conclusions from the trials may not be significant. In this paper, we examine the affectability of **FPGA** compositional conclusions to test varieties. To influence our examination to solid, we assess the affectability of four beforehand distributed and surely understood FPGA engineering comes about: query table size, switch block topology, group size, and memory estimate. It is demonstrated that these analyses are essentially influenced by the presumptions, instruments, and procedures utilized as a part of the examinations.

### **Proposed architectures**

#### III.

# A. MUX4: 4-to-1 Multiplexer Logic Element

The MUX4 LE appeared in Fig. 3 comprises of a 4-to-1 MUX with discretionary reversal on its sources of info that permit the acknowledgment of any {2,3}-input work, some {4,5}-input capacities, and one 6-input work a 4-to-1 MUX itself with discretionary reversal on the information inputs. A 4-to-1 MUX matches the information stick check of a 6-LUT, taking into account reasonable examinations concerning the

network and intra group directing. Any two info Boolean capacity can be effortlessly executed in the MUX4: the two capacity sources of info can be fixing to the select lines and reality table esteems (logic-0or logic-1) can be steered to the information inputs as needs be. For three-input capacities; consider that Shannon deterioration around one variable produces cofactors with at most two factors. A moment disintegration of the cofactors around one of their two residual factors produces cofactors with at most one variable. Such single-variable cofactors can be bolstered to the information inputs (the discretionary reversal might be required), with the decay factors sustaining the select sources of info. In like manner, elements of more than four sources of info can be actualized in the MUX4 as long as Shannon decay as for any two data sources produces cofactors with at most one information.



Fig.3. MUX4 LE depicting optional data input inversions

### International Journal of Research



Available at <u>https://edupediapublications.org/journals</u>

e-ISSN: 2348-6848 p-ISSN: 2348-795X Volume 04 Issue-17 December 2017

## B. Logic Elements, Fracturability, and MUX4-Based Variants

Two groups of architectures were made:

- 1) Without fracturable LEs
- 2) With fracturable Les

In this paper, the fracturable LEs allude to a compositional element on which at least one logic capacities can be alternatively mapped. Nonfracturable LEs allude to a compositional element on which just a single logic work is mapped. In the nonfracturable architectures, the UX4 element appeared in Fig. 3 is utilized together with nonfracturable 6-LUTs. This element shares an indistinguishable number of contributions from a 6-LUT loaning for reasonable examination concerning the input availability. For the fracturable design, we think about an eight-input LE, firmly coordinated with the versatile logic module in late Altera Stratix FPGA families.For the MUX4 variation, Dual MUX4, we utilize two MUX4s inside a solitary eight-input LE. In the arrangement, appeared in Fig. 4, the two MUX4s are wired to have devoted select data sources and shared information inputs. This design enables this structure to delineate autonomous (no mutual information sources) three-input capacities, while bigger capacities might be mapped reliant on the shared contributions between the two capacities. An engineering in which a 4-to-1 MUX (MUX4) is broken into two littler 2-to-1 MUXs was considered.



Fig.4. Dual MUX4 LE that utilizes dedicated select inputs and shared data Inputs

### C. Hybrid Complex Logic Block

A wide range of architectures were viewed as the first being a nonfracturable engineering. In the nonfracturable engineering, the CLB has 40 sources of info and ten essential LEs (BLEs), with each BLE having six information sources and one yield. Fig.5 demonstrates this nonfracturable CLB design with BLEs that contain a discretionary enroll. We change the proportion of MUX4s to LUTs inside the ten elements CLB from 1:9 to 5:5 MUX4s:6-LUTs. The MUX4 element is proposed to work in conjunction with 6-LUTs, making a hybrid CLB with a blend of 6-LUTs and MUX4s (or MUX4 variations).



### International Journal of Research

Available at https://edupediapublications.org/journals

e-ISSN: 2348-6848 p-ISSN: 2348-795X Volume 04 Issue-17 December 2017



Fig. 5. Hybrid CLB with a half drained intra-CLB crossbar portraying BLE internals for nonfracturable (one discretionary enlist and one Fig. 6 demonstrates the vield) design. association of our CLB and inner BLEs. For fracturable architectures, the CLB has 80 information sources and ten BLEs, with each BLE having eight data sources and two yields imitating an Altera Stratix Adaptive-LUT. A similar range of MUX4 to LUT proportions was additionally performed. Fig. 4 demonstrates the fracturable engineering with eight contributions to each BLE that contains two discretionary registers. We assess fracturability of LEs versus nonfracturable LEs with regards to MUX4 elements since fracturable LUTs are regular in business architectures. For instance, Altera Adaptive 6-LUTs in Stratix IV and Xilinx Virtex 5 6-LUTs can be broken into two littler LUTs with a few confinements on inputs.



Fig.6. Hybrid CLB with a half ousted intra-CLB crossbar portraying BLE internals for a fracturable (two discretionary registers and two yields) engineering.

### D. Area Modeling

1) MUX4 Logic Element: Introductory evaluations of the MUX4 element demonstrated that the MUX4 is  $\sim 10\%$  the region of a 6-LUT generally speaking. A 4-to-1 MUX can be acknowledged with three 2-to-1 MUXs. Subsequently, the MUX4 element contains seven 2-to-1 MUXs, four SRAM cells, and four altogether (see inverters Fig. 1). The discretionary reversal utilizes the four SRAM cells, while whatever remains of the LE setup is performed through directing. What's more, the profundity of the MUX tree is divided contrasted and the 6-LUT, which has six 2-to-1 MUXs on its longest ways. Moderately, expecting consistent pass transistor measuring and that the territory of a 2-to-1 MUX and six transistor SRAM cell are generally identical, the



MUX4 element has (1/16)th the SRAM region and(1/8)th the MUX zone of a 6-LUT.

These evaluations were changed utilizing transistor level displaying of the circuit blocks. Transistorlevel advancement of the constituent circuit blocks of a FPGA requires а comprehension of the ideal region postpone tradeoffs for every individual circuit block. This requires separating a delegate basic way, which is a way whose creation of blocks and topology will be like the basic way of a particular design. Separating the delegate basic way enables us to judge to what degree every individual block is timing basic, which in this manner builds up a region postpone tradeoff objectives for each block. This is in accordance with the transistorlevel advancement instrument grew beforehand. We utilize the aftereffects of earlier work to set up the ideal region defer tradeoff for 6-LUTs in traditional island-style FPGA engineering with regular structural parameters. The subsequent 6-LUT postpone fills in as a perspective for improvement for the circuits considered in this paper: in the intrigue of amplifying region diminishment while enabling execution to be kept up (disregarding the contrasts in cell tallies between mapping to a traditional LUT and the LEs proposed in this paper), we endeavor to coordinate the deferral of a 6-LUT while limiting the zone of each of the variations of the MUX4 circuits. Transistor level displaying and enhancements depended on a prescient 22-nm

[21], while elite process the territory demonstrate introduced in earlier work [20] was utilized to assess the zone of different circuit structures. With this system, we decided a region defer ideal 6-LUT has a zone of 930 least width transistors, and a most pessimistic scenario deferral of 261 ps. For the MUX4 cell and Dual MUX4 cell, a base region and least postpone cell was made. The base region MUX4 cell has a zone of 95 least width transistors and a postponement of 204 ps; all transistors were least width for this situation, and as the base zone solution for this circuit could meet (and enhance) the worstcase defer focus of a 6-LUT. Thus, the Dual MUX4 cell has a territory of 249 least width transistors while meeting the most pessimistic scenario postpone prerequisite. Be that as it may, we utilized the base postpone design for both the MUX4 and Dual MUX4 elements for whatever is left of the investigation as there isn't a noteworthy increment in region over the base zone design.

2) FPGA Area Model: Although deciding the territory of a MUX4 element in respect to a 6-LUT is vital, we have to likewise analyze worldwide FPGA region considering the quantity of CLB tiles, zone overheads inside the CLB and directing zone per CLB. All through this paper, worldwide FPGA region was evaluated expecting that, per tile, half of the region is entomb group and intra bunch



e-ISSN: 2348-6848 p-ISSN: 2348-795X Volume 04 Issue-17 December 2017

directing, 30% of the zone is utilized for LUTs, and 20% for registers and different incidental logic, following Anderson and Wang and a private correspondence. Note that this 50%-30%- 20% model is a gauge in light of a customary full FPGA design where-by the directing and inside CLB crossbars are advanced toward 6-LUTs. Creation of a streamlined FPGA using our new MUX4 elements would without a doubt change said show. Be that as it may, upgrading the whole directing design toward our MUX4 variations, measuring the steering engineering, and shutting the circle by making a more precise model is out of the extent of this work. Utilizing this model, we can mention some objective facts about the hybrid CLB engineering. The 30% that normallywould represent ten 6-LUT LEs inside the tile is presently part between the littler MUX4 elements and 6-LUTs.

## IV. Technology mapping using ABC

ABC was used for advancement mapping, with changes that consider MUX4-embeddable limit ID and MUX2-embeddable limit indentification because of fracturable MUX4s and custom mapping. Within data structure used inside the ABC is an AIG, where the logic circuit is addressed using 2-data AND entryways with inverters. Need Cuts mapping in ABC (summoned with the if arrange) was modified to play out our custom advancement mapping. This mapper crosses the AIG from fundamental commitments to basic yields finding moderate mappings for internal centers finally the basic yields, using a dynamic programming approach. The need removes mapper plays various passes on the AIG to find the best cut per center. For significance orchestrated mapping, the mapper at first sorts out mapping significance by then streamlines for an area discarding cuts whose decision would grow the general significance of the mapped organize. In light of this standard mapper, two mapper varieties were made and surveyed. The foremost variety, Natural Mux, evaluates and recognizes internal limits that are MUX4-embeddable, cynic of the goal designing; i.e., this stream uses the default require removes mapping and plays a post getting ready progress to perceive MUX4embeddable limits. From this mapping, we can evaluate what region hold reserves are possible with no mapper changes. The second variety MuxMap, locale weights the MUX4embeddable cuts in regard to 6-LUT cuts, in this way developing a slant for decision/making of MUX4-embeddable solutions.

## A. Modeling using VPR

VPR was utilized to perform engineering assessment. The standard ten 6-LUT CLB engineering in 40-nm included with the VPR dispersion was utilized for pattern demonstrating. The hybrid CLBs appeared in Figs. 3 and 4 were displayed utilizing the XML-



based VPR engineering dialect. The scrap from the design petition for the physical block solidified MUX4 element, this code indicates a MUX4 as a six-input one-yield black box to the VPR. What's more, since all MUX4s can likewise be mapped to the 6-LUTs, an extra mode was added to the 6-LUT physical block. The mode idea permits the VPR packer to pack LUTs into LUTs (obviously), yet in addition empowers MUX4s to be pressed into the LUTs. The architectures with CLBs having MUX4: LUT proportions from 1:9 to 5:5 were made from the standard 40-nm architectures with delays acquired through circuit reenactments of the MUX4 variations. Significantly, we made minor alterations to the VPR pressing calculation itself, so that the MUX4 net rundown elements are wanted to be stuffed into the MUX4 Les in the engineering (while at the same time constraining pressing MUX4 net rundown elements into LUTs). The adjustments included changing the fascination work amid the CLB pressing. One change was to guarantee that the logic capacities that were MUX4 embeddable were specially pressed into a physical MUX4 element and not into a LUT. Another was to apply a negative weight on MUX4-embeddable capacities when the current CLB's physical MUX4 elements are altogether involved likewise keeping MUX4-embeddable capacities from being set into the LUTs. Without this, the MUX4 net rundown elements

may unnecessarily devour LUTs, which ought to be held, where conceivable, for those net rundown elements that request their adaptability. This turns out to be doubly vital for fracturable architectures, since their pressing issue is more complex. Without this alteration, a critical CLB utilization increment was seen over all benchmark sets.

### VI. Conclusion

In this paper we proposed another hybrid CLB engineering containing MUX4 hard MUX elements and demonstrated systems for effectively mapping to these architectures. We moreover given investigation of the benchmark suites mapping, examining post the of capacities appropriation inside every benchmark suite. The zone diminishments for nonfracturable architectures. is 8% and MUX4:LUT proportion is 4:6 and on account of fracturable design the zone decreases are 2%.The CHStone benchmarks being highlevel blended with LegUp-HLS additionally indicated barely better execution and this could be because of the way LegUp performs HLS on the CHStone benchmarks themselves. Generally, the expansion of MUX4s to FPGA architectures insignificantly affect FMax and show potential for enhancing logic-thickness in nonfracturable architectures and unassuming potential for enhancing logic thickness in fracturable design.



### References

[1] J. Climbed et al., "The VTR venture: Architecture and CAD for FPGAs from verilog to directing," inProc. ACM/SIGDA FPGA, 2012, pp. 77–86.

[2] Y. Hara, H. Tomiyama, S. Honda, and H. Takada, "Proposition and quantitative examination of the CHStone benchmark program suite for down to earth C-based abnormal state combination," J. Inf. Process., vol. 17,pp. 242–254, Oct. 2009.

[3] A. Canis et al., "LegUp: High-level combination for FPGA-based processor/quickening agent frameworks," in Proc. ACM/SIGDA FPGA, 2011, pp. 33– 36.

[4] E. Ahmed and J. Rose, "The impact of LUT and group measure on deepsubmicron FPGA execution and thickness," IEEE Trans. Large Scale Integr. (VLSI), vol. 12, no. 3, pp. 288– 298, Mar. 2004.

[5] J. Rose, R. Francis, D. Lewis, and P. Chow, "Engineering of fieldprogrammable door exhibits: The impact of logic block usefulness on territory effectiveness," IEEE J. Strong State Circuits, vol. 25, no. 5,pp. 1217–1225, Oct. 1990.

[6] H. Parandeh-Afshar, H. Benbihi, D. Novo, and P. Ienne, "Reevaluating FPGAs: Elude the adaptability abundance of LUTs with andinverter cones," in Proc. ACM/SIGDA FPGA, 2012, pp. 119–128. [7] J. Anderson and Q. Wang, "Enhancing logic thickness through synthesisinspired engineering," inProc. IEEE FPL, Aug./Sep. 2009, pp. 105–111.

[8] J. Anderson and Q. Wang, "Zone proficient FPGA logic elements: Architecture and union," inProc. ASP DAC, 2011, pp. 369–375.

[9] J. Cong, H. Huang, and X. Yuan,
"Innovation mapping and engineering evalution for k/m-macrocell-based FPGAs,"ACM Trans.
Design Autom. Electron. Syst., vol. 10, no. 1, pp. 3–23, Jan. 2005