Welcome to **E-XFL.COM** # Understanding <u>Embedded - FPGAs (Field Programmable Gate Array)</u> Embedded - FPGAs, or Field Programmable Gate Arrays, are advanced integrated circuits that offer unparalleled flexibility and performance for digital systems. Unlike traditional fixed-function logic devices, FPGAs can be programmed and reprogrammed to execute a wide array of logical operations, enabling customized functionality tailored to specific applications. This reprogrammability allows developers to iterate designs quickly and implement complex functions without the need for custom hardware. #### **Applications of Embedded - FPGAs** The versatility of Embedded - FPGAs makes them indispensable in numerous fields. In telecommunications. | Details | | |--------------------------------|--------------------------------------------------------------| | Product Status | Obsolete | | Number of LABs/CLBs | 1936 | | Number of Logic Elements/Cells | 4598 | | Total RAM Bits | 61952 | | Number of I/O | 352 | | Number of Gates | 52000 | | Voltage - Supply | 3V ~ 3.6V | | Mounting Type | Surface Mount | | Operating Temperature | 0°C ~ 85°C (TJ) | | Package / Case | 432-LBGA Exposed Pad, Metal | | Supplier Device Package | 432-MBGA (40x40) | | Purchase URL | https://www.e-xfl.com/product-detail/xilinx/xc4052xl-1bg432c | Email: info@E-XFL.COM Address: Room A, 16/F, Full Win Commercial Centre, 573 Nathan Road, Mongkok, Hong Kong #### Input Thresholds The input thresholds of 5V devices can be globally configured for either TTL (1.2 V threshold) or CMOS (2.5 V threshold), just like XC2000 and XC3000 inputs. The two global adjustments of input threshold and output level are independent of each other. The XC4000XL family has an input threshold of 1.6V, compatible with both 3.3V CMOS and TTL levels. #### Global Signal Access to Logic There is additional access from global clocks to the F and G function generator inputs. #### Configuration Pin Pull-Up Resistors During configuration, these pins have weak pull-up resistors. For the most popular configuration mode, Slave Serial, the mode pins can thus be left unconnected. The three mode inputs can be individually configured with or without weak pull-up or pull-down resistors. A pull-down resistor value of $4.7~\mathrm{k}\Omega$ is recommended. The three mode inputs can be individually configured with or without weak pull-up or pull-down resistors after configuration. The PROGRAM input pin has a permanent weak pull-up. #### Soft Start-up Like the XC3000A, XC4000 Series devices have "Soft Start-up." When the configuration process is finished and the device starts up, the first activation of the outputs is automatically slew-rate limited. This feature avoids potential ground bounce when all outputs are turned on simultaneously. Immediately after start-up, the slew rate of the individual outputs is, as in the XC4000 family, determined by the individual configuration option. #### XC4000 and XC4000A Compatibility Existing XC4000 bitstreams can be used to configure an XC4000E device. XC4000A bitstreams must be recompiled for use with the XC4000E due to improved routing resources, although the devices are pin-for-pin compatible. ### Additional Improvements in XC4000X Only #### Increased Routing New interconnect in the XC4000X includes twenty-two additional vertical lines in each column of CLBs and twelve new horizontal lines in each row of CLBs. The twelve "Quad Lines" in each CLB row and column include optional repowering buffers for maximum speed. Additional high-performance routing near the IOBs enhances pin flexibility. #### Faster Input and Output A fast, dedicated early clock sourced by global clock buffers is available for the IOBs. To ensure synchronization with the regular global clocks, a Fast Capture latch driven by the early clock is available. The input data can be initially loaded into the Fast Capture latch with the early clock, then transferred to the input flip-flop or latch with the low-skew global clock. A programmable delay on the input can be used to avoid hold-time requirements. See "IOB Input Signals" on page 20 for more information. #### Latch Capability in CLBs Storage elements in the XC4000X CLB can be configured as either flip-flops or latches. This capability makes the FPGA highly synthesis-compatible. #### IOB Output MUX From Output Clock A multiplexer in the IOB allows the output clock to select either the output data or the IOB clock enable as the output to the pad. Thus, two different data signals can share a single output pad, effectively doubling the number of device outputs without requiring a larger, more expensive package. This multiplexer can also be configured as an AND-gate to implement a very fast pin-to-pin path. See "IOB Output Signals" on page 23 for more information. #### Additional Address Bits Larger devices require more bits of configuration data. A daisy chain of several large XC4000X devices may require a PROM that cannot be addressed by the eighteen address bits supported in the XC4000E. The XC4000X Series therefore extends the addressing in Master Parallel configuration mode to 22 bits. # **Detailed Functional Description** XC4000 Series devices achieve high speed through advanced semiconductor technology and improved architecture. The XC4000E and XC4000X support system clock rates of up to 80 MHz and internal performance in excess of 150 MHz. Compared to older Xilinx FPGA families, XC4000 Series devices are more powerful. They offer on-chip edge-triggered and dual-port RAM, clock enables on I/O flip-flops, and wide-input decoders. They are more versatile in many applications, especially those involving RAM. Design cycles are faster due to a combination of increased routing resources and more sophisticated software. #### **Basic Building Blocks** Xilinx user-programmable gate arrays include two major configurable elements: configurable logic blocks (CLBs) and input/output blocks (IOBs). - CLBs provide the functional elements for constructing the user's logic. - IOBs provide the interface between the package pins and internal signal lines. Three other types of circuits are also available: - 3-State buffers (TBUFs) driving horizontal longlines are associated with each CLB. - Wide edge decoders are available around the periphery of each device. - An on-chip oscillator is provided. Programmable interconnect resources provide routing paths to connect the inputs and outputs of these configurable elements to the appropriate networks. The functionality of each circuit block is customized during configuration by programming internal static memory cells. The values stored in these memory cells determine the logic functions and interconnections implemented in the FPGA. Each of these available circuits is described in this section. ### **Configurable Logic Blocks (CLBs)** Configurable Logic Blocks implement most of the logic in an FPGA. The principal CLB elements are shown in Figure 1. Two 4-input function generators (F and G) offer unrestricted versatility. Most combinatorial logic functions need four or fewer inputs. However, a third function generator (H) is provided. The H function generator has three inputs. Either zero, one, or two of these inputs can be the outputs of F and G; the other input(s) are from outside the CLB. The CLB can, therefore, implement certain functions of up to nine variables, like parity check or expandable-identity comparison of two sets of four inputs. Each CLB contains two storage elements that can be used to store the function generator outputs. However, the storage elements and function generators can also be used independently. These storage elements can be configured as flip-flops in both XC4000E and XC4000X devices; in the XC4000X they can optionally be configured as latches. DIN can be used as a direct input to either of the two storage elements. H1 can drive the other through the H function generator. Function generator outputs can also drive two outputs independent of the storage element outputs. This versatility increases logic capacity and simplifies routing. Thirteen CLB inputs and four CLB outputs provide access to the function generators and storage elements. These inputs and outputs connect to the programmable interconnect resources outside the block. #### **Function Generators** Four independent inputs are provided to each of two function generators (F1 - F4 and G1 - G4). These function generators, with outputs labeled F' and G', are each capable of implementing any arbitrarily defined Boolean function of four inputs. The function generators are implemented as memory look-up tables. The propagation delay is therefore independent of the function implemented. A third function generator, labeled H', can implement any Boolean function of its three inputs. Two of these inputs can optionally be the F' and G' functional generator outputs. Alternatively, one or both of these inputs can come from outside the CLB (H2, H0). The third input must come from outside the block (H1). Signals from the function generators can exit the CLB on two outputs. F' or H' can be connected to the X output. G' or H' can be connected to the Y output. A CLB can be used to implement any of the following functions: - any function of up to four variables, plus any second function of up to four unrelated variables, plus any third function of up to three unrelated variables<sup>1</sup> - any single function of five variables - any function of four variables together with some functions of six variables - · some functions of up to nine variables. Implementing wide functions in a single block reduces both the number of blocks required and the delay in the signal path, achieving both increased capacity and speed. The versatility of the CLB function generators significantly improves system speed. In addition, the design-software tools can deal with each function generator independently. This flexibility improves cell usage. <sup>1.</sup> When three separate functions are generated, one of the function outputs must be captured in a flip-flop internal to the CLB. Only two unregistered function generator outputs are available from the CLB. Supported CLB memory configurations and timing modes for single- and dual-port modes are shown in Table 3. XC4000 Series devices are the first programmable logic devices with edge-triggered (synchronous) and dual-port RAM accessible to the user. Edge-triggered RAM simplifies system timing. Dual-port RAM doubles the effective throughput of FIFO applications. These features can be individually programmed in any XC4000 Series CLB. #### Advantages of On-Chip and Edge-Triggered RAM The on-chip RAM is extremely fast. The read access time is the same as the logic delay. The write access time is slightly slower. Both access times are much faster than any off-chip solution, because they avoid I/O delays. Edge-triggered RAM, also called synchronous RAM, is a feature never before available in a Field Programmable Gate Array. The simplicity of designing with edge-triggered RAM, and the markedly higher achievable performance, add up to a significant improvement over existing devices with on-chip RAM. Three application notes are available from Xilinx that discuss edge-triggered RAM: "XC4000E Edge-Triggered and Dual-Port RAM Capability," "Implementing FIFOs in XC4000E RAM," and "Synchronous and Asynchronous FIFO Designs." All three application notes apply to both XC4000E and XC4000X RAM. **Table 3: Supported RAM Modes** | | 16 | 16 | 32 | Edge- | Level- | |-------------|----|----|----|-----------|-----------| | | х | х | x | Triggered | Sensitive | | | 1 | 2 | 1 | Timing | Timing | | Single-Port | V | 1 | 1 | 1 | <b>V</b> | | Dual-Port | 1 | | | V | | #### **RAM Configuration Options** The function generators in any CLB can be configured as RAM arrays in the following sizes: - Two 16x1 RAMs: two data inputs and two data outputs with identical or, if preferred, different addressing for each RAM - One 32x1 RAM: one data input and one data output. One F or G function generator can be configured as a 16x1 RAM while the other function generators are used to implement any function of up to 5 inputs. Additionally, the XC4000 Series RAM may have either of two timing modes: - Edge-Triggered (Synchronous): data written by the designated edge of the CLB clock. WE acts as a true clock enable. - Level-Sensitive (Asynchronous): an external WE signal acts as the write strobe. The selected timing mode applies to both function generators within a CLB when both are configured as RAM. The number of read ports is also programmable: - Single Port: each function generator has a common read and write port - Dual Port: both function generators are configured together as a single 16x1 dual-port RAM with one write port and two read ports. Simultaneous read and write operations to the same or different addresses are supported. RAM configuration options are selected by placing the appropriate library symbol. #### **Choosing a RAM Configuration Mode** The appropriate choice of RAM mode for a given design should be based on timing and resource requirements, desired functionality, and the simplicity of the design process. Recommended usage is shown in Table 4. The difference between level-sensitive, edge-triggered, and dual-port RAM is only in the write operation. Read operation and timing is identical for all modes of operation. **Table 4: RAM Mode Selection** | | Level-Sens itive | Edge-Trigg<br>ered | Dual-Port<br>Edge-Trigg<br>ered | |----------------------------|------------------|--------------------|---------------------------------| | Use for New Designs? | No | Yes | Yes | | Size (16x1,<br>Registered) | 1/2 CLB | 1/2 CLB | 1 CLB | | Simultaneous<br>Read/Write | No | No | Yes | | Relative<br>Performance | Х | 2X | 2X (4X<br>effective) | #### **RAM Inputs and Outputs** The F1-F4 and G1-G4 inputs to the function generators act as address lines, selecting a particular memory cell in each look-up table. The functionality of the CLB control signals changes when the function generators are configured as RAM. The DIN/H2, H1, and SR/H0 lines become the two data inputs (D0, D1) and the Write Enable (WE) input for the 16x2 memory. When the 32x1 configuration is selected, D1 acts as the fifth address bit and D0 is the data input. The contents of the memory cell(s) being addressed are available at the F' and G' function-generator outputs. They can exit the CLB through its X and Y outputs, or can be captured in the CLB flip-flop(s). Configuring the CLB function generators as Read/Write memory does not affect the functionality of the other por- Figure 4: 16x2 (or 16x1) Edge-Triggered Single-Port RAM Figure 5: 32x1 Edge-Triggered Single-Port RAM (F and G addresses are identical) 6-14 May 14, 1999 (Version 1.6) Figure 14: Detail of XC4000E Dedicated Carry Logic ### Input/Output Blocks (IOBs) User-configurable input/output blocks (IOBs) provide the interface between external package pins and the internal logic. Each IOB controls one package pin and can be configured for input, output, or bidirectional signals. Figure 15 shows a simplified block diagram of the XC4000E IOB. A more complete diagram which includes the boundary scan logic of the XC4000E IOB can be found in Figure 40 on page 43, in the "Boundary Scan" section. The XC4000X IOB contains some special features not included in the XC4000E IOB. These features are highlighted in a simplified block diagram found in Figure 16, and discussed throughout this section. When XC4000X special features are discussed, they are clearly identified in the text. Any feature not so identified is present in both XC4000E and XC4000X devices. #### IOB Input Signals Two paths, labeled I1 and I2 in Figure 15 and Figure 16, bring input signals into the array. Inputs also connect to an input register that can be programmed as either an edge-triggered flip-flop or a level-sensitive latch. The choice is made by placing the appropriate library symbol. For example, IFD is the basic input flip-flop (rising edge triggered), and ILD is the basic input latch (transparent-High). Variations with inverted clocks are available, and some combinations of latches and flip-flops can be implemented in a single IOB, as described in the *XACT Libraries Guide*. The XC4000E inputs can be globally configured for either TTL (1.2V) or 5.0 volt CMOS thresholds, using an option in the bitstream generation software. There is a slight input hysteresis of about 300mV. The XC4000E output levels are also configurable; the two global adjustments of input threshold and output level are independent. Inputs on the XC4000XL are TTL compatible and 3.3V CMOS compatible. Outputs on the XC4000XL are pulled to the 3.3V positive supply. The inputs of XC4000 Series 5-Volt devices can be driven by the outputs of any 3.3-Volt device, if the 5-Volt inputs are in TTL mode. Supported sources for XC4000 Series device inputs are shown in Table 8. or clear on reset and after configuration. Other than the global GSR net, no user-controlled set/reset signal is available to the I/O flip-flops. The choice of set or clear applies to both the initial state of the flip-flop and the response to the Global Set/Reset pulse. See "Global Set/Reset" on page 11 for a description of how to use GSR. #### **JTAG Support** Embedded logic attached to the IOBs contains test structures compatible with IEEE Standard 1149.1 for boundary scan testing, permitting easy chip and board-level testing. More information is provided in "Boundary Scan" on page 42. #### **Three-State Buffers** A pair of 3-state buffers is associated with each CLB in the array. (See Figure 27 on page 30.) These 3-state buffers can be used to drive signals onto the nearest horizontal longlines above and below the CLB. They can therefore be used to implement multiplexed or bidirectional buses on the horizontal longlines, saving logic resources. Programmable pull-up resistors attached to these longlines help to implement a wide wired-AND function. The buffer enable is an active-High 3-state (i.e. an active-Low enable), as shown in Table 13. Another 3-state buffer with similar access is located near each I/O block along the right and left edges of the array. (See Figure 33 on page 34.) The horizontal longlines driven by the 3-state buffers have a weak keeper at each end. This circuit prevents undefined floating levels. However, it is overridden by any driver, even a pull-up resistor. Special longlines running along the perimeter of the array can be used to wire-AND signals coming from nearby IOBs or from internal longlines. These longlines form the wide edge decoders discussed in "Wide Edge Decoders" on page 27. #### Three-State Buffer Modes The 3-state buffers can be configured in three modes: - · Standard 3-state buffer - Wired-AND with input on the I pin - Wired OR-AND #### Standard 3-State Buffer All three pins are used. Place the library element BUFT. Connect the input to the I pin and the output to the O pin. The T pin is an active-High 3-state (i.e. an active-Low enable). Tie the T pin to Ground to implement a standard buffer. #### Wired-AND with Input on the I Pin The buffer can be used as a Wired-AND. Use the WAND1 library symbol, which is essentially an open-drain buffer. WAND4, WAND8, and WAND16 are also available. See the *XACT Libraries Guide* for further information. The T pin is internally tied to the I pin. Connect the input to the I pin and the output to the O pin. Connect the outputs of all the WAND1s together and attach a PULLUP symbol. #### **Wired OR-AND** The buffer can be configured as a Wired OR-AND. A High level on either input turns off the output. Use the WOR2AND library symbol, which is essentially an open-drain 2-input OR gate. The two input pins are functionally equivalent. Attach the two inputs to the I0 and I1 pins and tie the output to the O pin. Tie the outputs of all the WOR2ANDs together and attach a PULLUP symbol. #### Three-State Buffer Examples Figure 21 shows how to use the 3-state buffers to implement a wired-AND function. When all the buffer inputs are High, the pull-up resistor(s) provide the High output. Figure 22 shows how to use the 3-state buffers to implement a multiplexer. The selection is accomplished by the buffer 3-state signal. Pay particular attention to the polarity of the T pin when using these buffers in a design. Active-High 3-state (T) is identical to an active-Low output enable, as shown in Table 13. **Table 13: Three-State Buffer Functionality** | IN | Т | OUT | |----|---|-----| | X | 1 | Z | | IN | 0 | IN | Figure 21: Open-Drain Buffers Implement a Wired-AND Function Figure 22: 3-State Buffers Implement a Multiplexer ### Wide Edge Decoders Dedicated decoder circuitry boosts the performance of wide decoding functions. When the address or data field is wider than the function generator inputs, FPGAs need multi-level decoding and are thus slower than PALs. XC4000 Series CLBs have nine inputs. Any decoder of up to nine inputs is, therefore, compact and fast. However, there is also a need for much wider decoders, especially for address decoding in large microprocessor systems. An XC4000 Series FPGA has four programmable decoders located on each edge of the device. The inputs to each decoder are any of the IOB I1 signals on that edge plus one local interconnect per CLB row or column. Each row or column of CLBs provides up to three variables or their compliments., as shown in Figure 23. Each decoder generates a High output (resistor pull-up) when the AND condition of the selected inputs, or their complements, is true. This is analogous to a product term in typical PAL devices. Each of these wired-AND gates is capable of accepting up to 42 inputs on the XC4005E and 72 on the XC4013E. There are up to 96 inputs for each decoder on the XC4028X and 132 on the XC4052X. The decoders may also be split in two when a larger number of narrower decoders are required, for a maximum of 32 decoders per device. The decoder outputs can drive CLB inputs, so they can be combined with other logic to form a PAL-like AND/OR structure. The decoder outputs can also be routed directly to the chip outputs. For fastest speed, the output should be on the same chip edge as the decoder. Very large PALs can be emulated by ORing the decoder outputs in a CLB. This decoding feature covers what has long been considered a weakness of older FPGAs. Users often resorted to external PALs for simple but fast decoding functions. Now, the dedicated decoders in the XC4000 Series device can implement these functions fast and efficiently. To use the wide edge decoders, place one or more of the WAND library symbols (WAND1, WAND4, WAND8, WAND16). Attach a DECODE attribute or property to each WAND symbol. Tie the outputs together and attach a PUL- LUP symbol. Location attributes or properties such as L (left edge) or TR (right half of top edge) should also be used to ensure the correct placement of the decoder inputs. Figure 23: XC4000 Series Edge Decoding Example Figure 24: XC4000 Series Oscillator Symbol #### **On-Chip Oscillator** XC4000 Series devices include an internal oscillator. This oscillator is used to clock the power-on time-out, for configuration memory clearing, and as the source of CCLK in Master configuration modes. The oscillator runs at a nominal 8 MHz frequency that varies with process, Vcc, and temperature. The output frequency falls between 4 and 10 MHz. The oscillator output is optionally available after configuration. Any two of four resynchronized taps of a built-in divider are also available. These taps are at the fourth, ninth, fourteenth and nineteenth bits of the divider. Therefore, if the primary oscillator output is running at the nominal 8 MHz, the user has access to an 8 MHz clock, plus any two of 500 kHz, 16kHz, 490Hz and 15Hz (up to 10% lower for low-voltage devices). These frequencies can vary by as much as -50% or +25%. These signals can be accessed by placing the OSC4 library element in a schematic or in HDL code (see Figure 24). The oscillator is automatically disabled after configuration if the OSC4 symbol is not used in the design. ## **Programmable Interconnect** All internal connections are composed of metal segments with programmable switching points and switching matrices to implement the desired routing. A structured, hierarchical matrix of routing resources is provided to achieve efficient automated routing. The XC4000E and XC4000X share a basic interconnect structure. XC4000X devices, however, have additional routing not available in the XC4000E. The extra routing resources allow high utilization in high-capacity devices. All XC4000X-specific routing resources are clearly identified throughout this section. Any resources not identified as XC4000X-specific are present in all XC4000 Series devices. This section describes the varied routing resources available in XC4000 Series devices. The implementation software automatically assigns the appropriate resources based on the density and timing requirements of the design. #### **Interconnect Overview** There are several types of interconnect. - CLB routing is associated with each row and column of the CLB array. - IOB routing forms a ring (called a VersaRing) around the outside of the CLB array. It connects the I/O with the internal logic blocks. Global routing consists of dedicated networks primarily designed to distribute clocks throughout the device with minimum delay and skew. Global routing can also be used for other high-fanout signals. Five interconnect types are distinguished by the relative length of their segments: single-length lines, double-length lines, quad and octal lines (XC4000X only), and longlines. In the XC4000X, direct connects allow fast data flow between adjacent CLBs, and between IOBs and CLBs. Extra routing is included in the IOB pad ring. The XC4000X also includes a ring of octal interconnect lines near the IOBs to improve pin-swapping and routing to locked pins. XC4000E/X devices include two types of global buffers. These global buffers have different properties, and are intended for different purposes. They are discussed in detail later in this section. #### **CLB Routing Connections** A high-level diagram of the routing resources associated with one CLB is shown in Figure 25. The shaded arrows represent routing present only in XC4000X devices. Table 14 shows how much routing of each type is available in XC4000E and XC4000X CLB arrays. Clearly, very large designs, or designs with a great deal of interconnect, will route more easily in the XC4000X. Smaller XC4000E designs, typically requiring significantly less interconnect, do not require the additional routing. Figure 27 on page 30 is a detailed diagram of both the XC4000E and the XC4000X CLB, with associated routing. The shaded square is the programmable switch matrix, present in both the XC4000E and the XC4000X. The L-shaped shaded area is present only in XC4000X devices. As shown in the figure, the XC4000X block is essentially an XC4000E block with additional routing. CLB inputs and outputs are distributed on all four sides, providing maximum routing flexibility. In general, the entire architecture is symmetrical and regular. It is well suited to established placement and routing algorithms. Inputs, outputs, and function generators can freely swap positions within a CLB to avoid routing congestion during the placement and routing operation. Figure 34: XC4000E Global Net Distribution Figure 35: XC4000X Global Net Distribution 6-36 May 14, 1999 (Version 1.6) Figure 36: Any BUFGLS (GCK1 - GCK8) Can Drive Any or All Clock Inputs on the Device #### **Global Early Buffers** Each corner of the XC4000X device has two Global Early buffers. The primary purpose of the Global Early buffers is to provide an earlier clock access than the potentially heavily-loaded Global Low-Skew buffers. A clock source applied to both buffers will result in the Global Early clock edge occurring several nanoseconds earlier than the Global Low-Skew buffer clock edge, due to the lighter loading. Global Early buffers also facilitate the fast capture of device inputs, using the Fast Capture latches described in "IOB Input Signals" on page 20. For Fast Capture, take a single clock signal, and route it through both a Global Early buffer and a Global Low-Skew buffer. (The two buffers share an input pad.) Use the Global Early buffer to clock the Fast Capture latch, and the Global Low-Skew buffer to clock the normal input flip-flop or latch, as shown in Figure 17 on page 23. The Global Early buffers can also be used to provide a fast Clock-to-Out on device output pins. However, an early clock in the output flip-flop IOB must be taken into consideration when calculating the internal clock speed for the design. The Global Early buffers at the left and right edges of the chip have slightly different capabilities than the ones at the top and bottom. Refer to Figure 37, Figure 38, and Figure 35 on page 36 while reading the following explanation. Each Global Early buffer can access the eight vertical Global lines for all CLBs in the quadrant. Therefore, only one-fourth of the CLB clock pins can be accessed. This restriction is in large part responsible for the faster speed of the buffers, relative to the Global Low-Skew buffers. Figure 37: Left and Right BUFGEs Can Drive Any or All Clock Inputs in Same Quadrant or Edge (GCK1 is shown. GCK2, GCK5 and GCK6 are similar.) The left-side Global Early buffers can each drive two of the four vertical lines accessing the IOBs on the entire left edge of the device. The right-side Global Early buffers can each drive two of the eight vertical lines accessing the IOBs on the entire right edge of the device. (See Figure 37.) Each left and right Global Early buffer can also drive half of the IOBs along either the top or bottom edge of the device, using a dedicated line that can only be accessed through the Global Early buffers. The top and bottom Global Early buffers can drive half of the IOBs along either the left or right edge of the device, as shown in Figure 38. They can only access the top and bottom IOBs via the CLB global lines. Figure 38: Top and Bottom BUFGEs Can Drive Any or All Clock Inputs in Same Quadrant (GCK8 is shown. GCK3, GCK4 and GCK7 are similar.) Figure 41: XC4000 Series Boundary Scan Logic #### **Instruction Set** The XC4000 Series boundary scan instruction set also includes instructions to configure the device and read back the configuration data. The instruction set is coded as shown in Table 17. #### **Bit Sequence** The bit sequence within each IOB is: In, Out, 3-State. The input-only M0 and M2 mode pins contribute only the In bit to the boundary scan I/O data register, while the output-only M1 pin contributes all three bits. The first two bits in the I/O data register are TDO.T and TDO.O, which can be used for the capture of internal signals. The final bit is BSCANT.UPD, which can be used to drive an internal net. These locations are primarily used by Xilinx for internal testing. From a cavity-up view of the chip (as shown in XDE or Epic), starting in the upper right chip corner, the boundary scan data-register bits are ordered as shown in Figure 42. The device-specific pinout tables for the XC4000 Series include the boundary scan locations for each IOB pin. BSDL (Boundary Scan Description Language) files for XC4000 Series devices are available on the Xilinx FTP site. #### **Including Boundary Scan in a Schematic** If boundary scan is only to be used during configuration, no special schematic elements need be included in the schematic or HDL code. In this case, the special boundary scan pins TDI, TMS, TCK and TDO can be used for user functions after configuration. To indicate that boundary scan remain enabled after configuration, place the BSCAN library symbol and connect the TDI, TMS, TCK and TDO pad symbols to the appropriate pins, as shown in Figure 43. Even if the boundary scan symbol is used in a schematic, the input pins TMS, TCK, and TDI can still be used as inputs to be routed to internal logic. Care must be taken not to force the chip into an undesired boundary scan state by inadvertently applying boundary scan input patterns to these pins. The simplest way to prevent this is to keep TMS High, and then apply whatever signal is desired to TDI and TCK. The default option, and the most practical one, is for DONE to go High first, disconnecting the configuration data source and avoiding any contention when the I/Os become active one clock later. Reset/Set is then released another clock period later to make sure that user-operation starts from stable internal conditions. This is the most common sequence, shown with heavy lines in Figure 47, but the designer can modify it to meet particular requirements. Normally, the start-up sequence is controlled by the internal device oscillator output (CCLK), which is asynchronous to the system clock. XC4000 Series offers another start-up clocking option, UCLK\_NOSYNC. The three events described above need not be triggered by CCLK. They can, as a configuration option, be triggered by a user clock. This means that the device can wake up in synchronism with the user system. When the UCLK\_SYNC option is enabled, the user can externally hold the open-drain DONE output Low, and thus stall all further progress in the start-up sequence until DONE is released and has gone High. This option can be used to force synchronization of several FPGAs to a common user clock, or to guarantee that all devices are successfully configured before any I/Os go active. If either of these two options is selected, and no user clock is specified in the design or attached to the device, the chip could reach a point where the configuration of the device is complete and the Done pin is asserted, but the outputs do not become active. The solution is either to recreate the bitstream specifying the start-up clock as CCLK, or to supply the appropriate user clock. #### Start-up Sequence The Start-up sequence begins when the configuration memory is full, and the total number of configuration clocks received since $\overline{\text{INIT}}$ went High equals the loaded value of the length count. The next rising clock edge sets a flip-flop Q0, shown in Figure 48. Q0 is the leading bit of a 5-bit shift register. The outputs of this register can be programmed to control three events. - The release of the open-drain DONE output - The change of configuration-related pins to the user function, activating all IOBs. - The termination of the global Set/Reset initialization of all CLB and IOB storage elements. The DONE pin can also be wire-ANDed with DONE pins of other FPGAs or with other external signals, and can then be used as input to bit Q3 of the start-up register. This is called "Start-up Timing Synchronous to Done In" and is selected by either CCLK SYNC or UCLK SYNC. When DONE is not used as an input, the operation is called "Start-up Timing Not Synchronous to DONE In," and is selected by either CCLK\_NOSYNC or UCLK\_NOSYNC. As a configuration option, the start-up control register beyond Q0 can be clocked either by subsequent CCLK pulses or from an on-chip user net called STARTUP.CLK. These signals can be accessed by placing the STARTUP library symbol. #### **Start-up from CCLK** If CCLK is used to drive the start-up, Q0 through Q3 provide the timing. Heavy lines in Figure 47 show the default timing, which is compatible with XC2000 and XC3000 devices using early DONE and late Reset. The thin lines indicate all other possible timing options. ### XC4000E/EX/XL Program Readback Switching Characteristic Guidelines Testing of the switching parameters is modeled after testing methods specified by MIL-M-38510/605. All devices are 100% functionally tested. Internal timing parameters are not measured directly. They are derived from benchmark timing patterns that are taken at device introduction, prior to any process improvements. The following guidelines reflect worst-case values over the recommended operating conditions. #### E/EX | | Description | 5 | Symbol | Min | Max | Units | |-----------|------------------------------------------------|---|-------------------|-----|-----|-------| | rdbk.TRIG | rdbk.TRIG setup to initiate and abort Readback | 1 | T <sub>RTRC</sub> | 200 | - | ns | | | rdbk.TRIG hold to initiate and abort Readback | 2 | T <sub>RCRT</sub> | 50 | - | ns | | rdclk.1 | rdbk.DATA delay | 7 | T <sub>RCRD</sub> | - | 250 | ns | | | rdbk.RIP delay | 6 | T <sub>RCRR</sub> | - | 250 | ns | | | High time | 5 | T <sub>RCH</sub> | 250 | 500 | ns | | | Low time | 4 | T <sub>RCL</sub> | 250 | 500 | ns | Note 1: Timing parameters apply to all speed grades. Note 2: If rdbk.TRIG is High prior to Finished, Finished will trigger the first Readback. #### XL | | Description | 5 | Symbol | Min | Max | Units | |-----------|------------------------------------------------|---|-------------------|-----|-----|-------| | rdbk.TRIG | rdbk.TRIG setup to initiate and abort Readback | 1 | T <sub>RTRC</sub> | 200 | - | ns | | | rdbk.TRIG hold to initiate and abort Readback | 2 | T <sub>RCRT</sub> | 50 | - | ns | | rdclk.1 | rdbk.DATA delay | 7 | T <sub>RCRD</sub> | - | 250 | ns | | | rdbk.RIP delay | 6 | T <sub>RCRR</sub> | - | 250 | ns | | | High time | 5 | T <sub>RCH</sub> | 250 | 500 | ns | | | Low time | 4 | T <sub>RCL</sub> | 250 | 500 | ns | Note 1: Timing parameters apply to all speed grades. Note 2: If rdbk.TRIG is High prior to Finished, Finished will trigger the first Readback. #### Master Serial Mode In Master Serial mode, the CCLK output of the lead FPGA drives a Xilinx Serial PROM that feeds the FPGA DIN input. Each rising edge of the CCLK output increments the Serial PROM internal address counter. The next data bit is put on the SPROM data output, connected to the FPGA DIN pin. The lead FPGA accepts this data on the subsequent rising CCLK edge. The lead FPGA then presents the preamble data—and all data that overflows the lead device—on its DOUT pin. There is an internal pipeline delay of 1.5 CCLK periods, which means that DOUT changes on the falling CCLK edge, and the next FPGA in the daisy chain accepts data on the subsequent rising CCLK edge. In the bitstream generation software, the user can specify Fast ConfigRate, which, starting several bits into the first frame, increases the CCLK frequency by a factor of eight. For actual timing values please refer to "Configuration Switching Characteristics" on page 68. Be sure that the serial PROM and slaves are fast enough to support this data rate. XC2000, XC3000/A, and XC3100A devices do not support the Fast ConfigRate option. The SPROM CE input can be driven from either LDC or DONE. Using LDC avoids potential contention on the DIN pin, if this pin is configured as user-I/O, but LDC is then restricted to be a permanently High user output after configuration. Using DONE can also avoid contention on DIN, provided the early DONE option is invoked. Figure 51 on page 60 shows a full master/slave system. The leftmost device is in Master Serial mode. Master Serial mode is selected by a <000> on the mode pins (M2, M1, M0). | | Description | , | Symbol | Min | Max | Units | |-------|-------------|---|-------------------|-----|-----|-------| | CCLK | DIN setup | 1 | T <sub>DSCK</sub> | 20 | | ns | | COLIN | DIN hold | 2 | T <sub>CKDS</sub> | 0 | | ns | Notes: 1. At power-up, Vcc must rise from 2.0 V to Vcc min in less than 25 ms, otherwise delay configuration by pulling PROGRAM Low until Vcc is valid. 2. Master Serial mode timing is based on testing in slave mode. Figure 53: Master Serial Mode Programming Switching Characteristics #### Synchronous Peripheral Mode Synchronous Peripheral mode can also be considered Slave Parallel mode. An external signal drives the CCLK input(s) of the FPGA(s). The first byte of parallel configuration data must be available at the Data inputs of the lead FPGA a short setup time before the rising CCLK edge. Subsequent data bytes are clocked in on every eighth consecutive rising CCLK edge. The same CCLK edge that accepts data, also causes the RDY/BUSY output to go High for one CCLK period. The pin name is a misnomer. In Synchronous Peripheral mode it is really an ACKNOWLEDGE signal. Synchronous operation does not require this response, but it is a meaningful signal for test purposes. Note that RDY/BUSY is pulled High with a high-impedance pullup prior to $\overline{\text{INIT}}$ going High. The lead FPGA serializes the data and presents the preamble data (and all data that overflows the lead device) on its DOUT pin. There is an internal delay of 1.5 CCLK periods, which means that DOUT changes on the falling CCLK edge, and the next FPGA in the daisy chain accepts data on the subsequent rising CCLK edge. In order to complete the serial shift operation, 10 additional CCLK rising edges are required after the last data byte has been loaded, plus one more CCLK cycle for each daisy-chained device. Synchronous Peripheral mode is selected by a <011> on the mode pins (M2, M1, M0). Figure 56: Synchronous Peripheral Mode Circuit Diagram 6-64 | | Description | Symbol | Min | Max | Units | |------|------------------------|------------------|-----|-----|-------| | | INIT (High) setup time | T <sub>IC</sub> | 5 | | μs | | | D0 - D7 setup time | T <sub>DC</sub> | 60 | | ns | | CCLK | D0 - D7 hold time | T <sub>CD</sub> | 0 | | ns | | CCLK | CCLK High time | T <sub>CCH</sub> | 50 | | ns | | | CCLK Low time | T <sub>CCL</sub> | 60 | | ns | | | CCLK Frequency | F <sub>CC</sub> | | 8 | MHz | Notes: 1. Peripheral Synchronous mode can be considered Slave Parallel mode. An external CCLK provides timing, clocking in the **first** data byte on the **second** rising edge of CCLK after INIT goes High. Subsequent data bytes are clocked in on every eighth consecutive rising edge of CCLK. - 2. The RDY/BUSY line goes High for one CCLK period after data has been clocked in, although synchronous operation does not require such a response. - 3. The pin name RDY/BUSY is a misnomer. In Synchronous Peripheral mode this is really an ACKNOWLEDGE signal. - 4. Note that data starts to shift out serially on the DOUT pin 0.5 CCLK periods after it was loaded in parallel. Therefore, additional CCLK pulses are clearly required after the last byte has been loaded. Figure 57: Synchronous Peripheral Mode Programming Switching Characteristics # **Configuration Switching Characteristics** ## Master Modes (XC4000E/EX) | Description | | Symbol | Min | Max | Units | |----------------------------|------------|-------------------|-----|------|------------| | | M0 = High | T <sub>POR</sub> | 10 | 40 | ms | | Power-On Reset | M0 = Low | T <sub>POR</sub> | 40 | 130 | ms | | Program Latency | am Latency | | 30 | 200 | μs per | | | | | | | CLB column | | CCLK (output) Delay | | T <sub>ICCK</sub> | 40 | 250 | μs | | CCLK (output) Period, slow | | T <sub>CCLK</sub> | 640 | 2000 | ns | | CCLK (output) Period, fast | | T <sub>CCLK</sub> | 80 | 250 | ns | # Master Modes (XC4000XL) | Description | | Symbol | Min | Max | Units | |----------------------------|-----------|-------------------|-----|------|------------| | | M0 = High | T <sub>POR</sub> | 10 | 40 | ms | | Power-On Reset M0 = Low | | T <sub>POR</sub> | 40 | 130 | ms | | Program Latency | atency | | 30 | 200 | μs per | | | | | | | CLB column | | CCLK (output) Delay | | T <sub>ICCK</sub> | 40 | 250 | μs | | CCLK (output) Period, slow | | T <sub>CCLK</sub> | 540 | 1600 | ns | | CCLK (output) Period, fast | | T <sub>CCLK</sub> | 67 | 200 | ns | # Slave and Peripheral Modes (All) | Description | Symbol | Min | Max | Units | |--------------------------------|-------------------|-----|-----|----------------------| | Power-On Reset | T <sub>POR</sub> | 10 | 33 | ms | | Program Latency | T <sub>Pl</sub> | 30 | 200 | μs per<br>CLB column | | CCLK (input) Delay (required) | T <sub>ICCK</sub> | 4 | | μs | | CCLK (input) Period (required) | T <sub>CCLK</sub> | 100 | | ns | Table 25: Component Availability Chart for XC4000E FPGAs | Р | PINS | 84 | 100 | 100 | 120 | 144 | 156 | 160 | 191 | 208 | 208 | 223 | 225 | 240 | 240 | 299 | 304 | |------------|------|----------------|----------------|----------------|---------------|----------------|---------------|----------------|---------------|-------------------|----------------|---------------|---------------|-------------------|----------------|---------------|------------------| | יד | YPE | Plast.<br>PLCC | Plast.<br>PQFP | Plast.<br>VQFP | Ceram.<br>PGA | Plast.<br>TQFP | Ceram.<br>PGA | Plast.<br>PQFP | Ceram.<br>PGA | High-Perf.<br>QFP | Plast.<br>PQFP | Ceram.<br>PGA | Plast.<br>BGA | High-Perf.<br>QFP | Plast.<br>PQFP | Ceram.<br>PGA | High-Perf.<br>QF | | CC | DDE | PC84 | PQ100 | VQ100 | PG120 | TQ144 | PG156 | PQ160 | PG191 | HQ208 | PQ208 | PG223 | BG225 | HQ240 | PQ240 | PG299 | HQ304 High-Perf. | | | -4 | CI | CI | CI | CI | | | | | | | | | | | | | | XC4003E | -3 | СІ | СІ | СІ | СІ | | | | | | | | | | | | | | AC4003E | -2 | СІ | СІ | СІ | CI | | | | | | | | | | | | | | | -1 | С | С | С | С | | | | | | | | | | | | | | | -4 | СІ | CI | | | СІ | СІ | CI | | | CI | | | | | | | | XC4005E | -3 | СІ | СІ | | | СІ | СІ | СІ | | | СІ | | | | | | | | AC4005E | -2 | СІ | СІ | | | CI | СІ | СІ | | | СІ | | | | | | | | | -1 | С | С | | | С | С | С | | | С | | | | | | | | | -4 | СІ | | | | CI | CI | СІ | | | CI | | | | | | | | XC4006E | -3 | СІ | | | | CI | CI | СІ | | | CI | | | | | | | | AC4000E | -2 | CI | | | | CI | CI | CI | | | CI | | | | | | | | | -1 | С | | | | С | С | С | | | С | | | | | | | | | -4 | CI | | | | | | CI | CI | | CI | | | | | | | | XC4008E | -3 | CI | | | | | | СІ | CI | | CI | | | | | | | | AC4000L | -2 | CI | | | | | | CI | CI | | CI | | | | | | | | | -1 | С | | | | | | С | С | | С | | | | | | | | | -4 | CI | | | | | | CI | СІ | CI | CI | | CI | | | | | | XC4010E | -3 | CI | | | | | | CI | CI | CI | CI | | CI | | | | | | AC4010L | -2 | CI | | | | | | CI | CI | CI | CI | | CI | | | | | | | -1 | С | | | | | | С | С | С | С | | С | | | | | | | -4 | | | | | | | CI | | CI | CI | CI | CI | CI | CI | | | | XC4013E | -3 | | | | | | | CI | | CI | CI | CI | CI | CI | CI | | | | 7.0 10 10L | -2 | | | | | | | CI | | CI | CI | CI | CI | CI | CI | | | | | -1 | | | | | | | С | | С | С | С | С | С | С | | | | | -4 | | | | | | | | | CI | | CI | | CI | | | | | XC4020E | -3 | | | | | | | | | CI | | CI | | CI | | | | | 7.0 10202 | -2 | | | | | | | | | CI | | CI | | CI | | | | | | -1 | | | | | | | | | С | | С | | С | | | | | <u>.</u> | -4 | | | | | | | | | | | CI | | CI | | CI | CI | | XC4025E | -3 | | | | | | | | | | | CI | | CI | | CI | CI | | 1/29/99 | -2 | | | | | | | | | | | С | | С | | С | С | C = Commercial $T_J = 0^{\circ}$ to +85°C I= Industrial $T_J = -40^{\circ}$ C to +100°C Table 26: Component Availability Chart for XC4000EX FPGAs | F | PINS | 208 | 240 | 299 | 304 | 352 | 411 | 432 | |----------|------|-------------------|-------------------|---------------|-------------------|---------------|---------------|---------------| | TYPE | | High-Perf.<br>QFP | High-Perf.<br>QFP | Ceram.<br>PGA | High-Perf.<br>QFP | Plast.<br>BGA | Ceram.<br>PGA | Plast.<br>BGA | | C | ODE | HQ208 | HQ240 | PG299 | HQ304 | BG352 | PG411 | BG432 | | | -4 | CI | CI | CI | CI | CI | | | | XC4028EX | -3 | CI | CI | CI | CI | CI | | | | | -2 | С | С | С | С | С | | | | | -4 | | CI | | CI | CI | CI | CI | | XC4036EX | -3 | | CI | | CI | CI | CI | CI | | | -2 | | С | | С | С | С | С | 1/29/99 C = Commercial $T_J = 0^{\circ}$ to +85°C I= Industrial $T_J = -40^{\circ}$ C to +100°C # **User I/O Per Package** Table 27, Table 28, and Table 29 show the number of user I/Os available in each package for XC4000-Series devices. Call your local sales office for the latest availability information, or see the Xilinx website at http://www.xilinx.com for the latest revision of the specifications. Table 27: User I/O Chart for XC4000XL FPGAs | | | | Maximum User Accessible I/O by Package Type | | | | | | | | | | | | | | | | | | | | | |------------|-----|------|---------------------------------------------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|------|-------|------|-------|-------|------|------|-------| | <b>.</b> . | Max | PC84 | PQ100 | VQ100 | TQ144 | HT144 | HQ160 | PQ160 | TQ176 | HT176 | HQ208 | PQ208 | HQ240 | PQ240 | BG256 | G299 | HQ304 | G352 | PG411 | BG432 | G475 | G259 | BG560 | | Device | I/O | П | ď | > | Ĕ | I | Ĭ | ď | Ĕ | I | Ĭ | ď | Ĭ | ď | ă | ď | Ĭ | ă | ď | ă | ď | ď | ă | | XC4002XL | 64 | 61 | 64 | 64 | | | | | | | | | | | | | | | | | | | | | XC4005XL | 112 | 61 | 77 | 77 | 112 | | | 112 | | | | 112 | | | | | | | | | | | | | XC4010XL | 160 | 61 | 77 | | 113 | | | 129 | 145 | | | 160 | | | 160 | | | | | | | | | | XC4013XL | 192 | | | | | 113 | | 129 | | 145 | | 160 | | 192 | 192 | | | | | | | | | | XC4020XL | 224 | | | | | 113 | | 129 | | 145 | | 160 | | 192 | 205 | | | | | | | | | | XC4028XL | 256 | | | | | | 129 | | | | 160 | | 193 | | 205 | 256 | 256 | 256 | | | | | | | XC4036XL | 288 | | | | | | 129 | | | | 160 | | 193 | | | | 256 | 288 | 288 | 288 | | | | | XC4044XL | 320 | | | | | | 129 | | | | 160 | | 193 | | | | 256 | 289 | 320 | 320 | | | | | XC4052XL | 352 | | | | | | | | | | | | 193 | | | | 256 | | 352 | 352 | | | 352 | | XC4062XL | 384 | | | | | | | | | | | | 193 | | | | 256 | | | 352 | 384 | | 384 | | XC4085XL | 448 | | | | | | | | | | | | | | | | | | | 352 | | 448 | 448 | 1/29/99 Table 28: User I/O Chart for XC4000E FPGAs | | | | Maximum User Accessible I/O by Package Type | | | | | | | | | | | | | | | |---------|------------|------|---------------------------------------------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------| | Device | Max<br>I/O | PC84 | PQ100 | /Q100 | PG120 | TQ144 | PG156 | PQ160 | PG191 | HQ208 | PQ208 | PG223 | BG225 | HQ240 | PQ240 | PG299 | HQ304 | | XC4003E | 80 | 61 | 77 | 77 | 80 | | | | _ | | | _ | | | | | _ | | XC4005E | 112 | 61 | 77 | | | 112 | 112 | 112 | | | 112 | | | | | | | | XC4006E | 128 | 61 | | | | 113 | 125 | 128 | | | 128 | | | | | | | | XC4008E | 144 | 61 | | | | | | 129 | 144 | | 144 | | | | | | | | XC4010E | 160 | 61 | | | | | | 129 | 160 | 160 | 160 | | 160 | | | | | | XC4013E | 192 | | | | | | | 129 | | 160 | 160 | 192 | 192 | 192 | 192 | | | | XC4020E | 224 | | | | | | | | | 160 | | 192 | | 193 | | | | | XC4025E | 256 | | | | | | | | | | | 192 | | 193 | | 256 | 256 | 1/29/99 Table 29: User I/O Chart for XC4000EX FPGAs | | Max | Maximum User Accessible I/O by Package Type | | | | | | | | | |----------|-----|---------------------------------------------|-------|-------|-------|-------|-------|-------|--|--| | Device | I/O | HQ208 | HQ240 | PG299 | HQ304 | BG352 | PG411 | BG432 | | | | XC4028EX | 256 | 160 | 193 | 256 | 256 | 256 | | | | | | XC4036EX | 288 | | 193 | | 256 | 288 | 288 | 288 | | | 1/29/99 # XC4000 Series Electrical Characteristics and Device-Specific Pinout Table For the latest Electrical Characteristics and package/pinout information for each XC4000 Family, see the Xilinx web site at <a href="http://www.xilinx.com/xlnx/xweb/xil\_publications\_index.jsp">http://www.xilinx.com/xlnx/xweb/xil\_publications\_index.jsp</a> # **Ordering Information** X9020 #### **Revision Control** | Version | Description | |---------------|---------------------------------------------------------------------------------------------------------| | 3/30/98 (1.5) | Updated XC4000XL timing and added XC4002XL | | 1/29/99 (1.5) | Updated pin diagrams | | 5/14/99 (1.6) | Replaced Electrical Specification and pinout pages for E, EX, and XL families with separate updates and | | | added URL link for electrical specifications/pinouts for Web users |