# E·XFL



Welcome to E-XFL.COM

#### Understanding <u>Embedded - FPGAs (Field</u> <u>Programmable Gate Array)</u>

Embedded - FPGAs, or Field Programmable Gate Arrays, are advanced integrated circuits that offer unparalleled flexibility and performance for digital systems. Unlike traditional fixed-function logic devices, FPGAs can be programmed and reprogrammed to execute a wide array of logical operations, enabling customized functionality tailored to specific applications. This reprogrammability allows developers to iterate designs quickly and implement complex functions without the need for custom hardware.

#### **Applications of Embedded - FPGAs**

The versatility of Embedded - FPGAs makes them indispensable in numerous fields. In telecommunications.

#### Details

| Details                        |                                                                |
|--------------------------------|----------------------------------------------------------------|
| Product Status                 | Active                                                         |
| Number of LABs/CLBs            | 62190                                                          |
| Number of Logic Elements/Cells | 1088325                                                        |
| Total RAM Bits                 | 58265600                                                       |
| Number of I/O                  | 624                                                            |
| Number of Gates                | -                                                              |
| Voltage - Supply               | 0.922V ~ 0.979V                                                |
| Mounting Type                  | Surface Mount                                                  |
| Operating Temperature          | -40°C ~ 100°C (TJ)                                             |
| Package / Case                 | 1517-BBGA, FCBGA                                               |
| Supplier Device Package        | 1517-FCBGA (40x40)                                             |
| Purchase URL                   | https://www.e-xfl.com/product-detail/xilinx/xcku085-1flva1517i |
|                                |                                                                |

Email: info@E-XFL.COM

Address: Room A, 16/F, Full Win Commercial Centre, 573 Nathan Road, Mongkok, Hong Kong

## **Summary of Features**

### **Processing System Overview**

UltraScale+ MPSoCs feature dual and quad core variants of the ARM Cortex-A53 (APU) with dual-core ARM Cortex-R5 (RPU) processing system (PS). Some devices also include a dedicated ARM Mali<sup>™</sup>-400 MP2 graphics processing unit (GPU). See Table 2.

|     | CG Devices               | EG Devices               | EV Devices               |  |  |  |  |  |
|-----|--------------------------|--------------------------|--------------------------|--|--|--|--|--|
| APU | Dual-core ARM Cortex-A53 | Quad-core ARM Cortex-A53 | Quad-core ARM Cortex-A53 |  |  |  |  |  |
| RPU | Dual-core ARM Cortex-R5  | Dual-core ARM Cortex-R5  | Dual-core ARM Cortex-R5  |  |  |  |  |  |
| GPU | -                        | Mali-400MP2              | Mali-400MP2              |  |  |  |  |  |
| VCU | -                        | _                        | H.264/H.265              |  |  |  |  |  |

To support the processors' functionality, a number of peripherals with dedicated functions are included in the PS. For interfacing to external memories for data or configuration storage, the PS includes a multi-protocol dynamic memory controller, a DMA controller, a NAND controller, an SD/eMMC controller and a Quad SPI controller. In addition to interfacing to external memories, the APU also includes a Level-1 (L1) and Level-2 (L2) cache hierarchy; the RPU includes an L1 cache and Tightly Coupled memory subsystem. Each has access to a 256KB on-chip memory.

For high-speed interfacing, the PS includes 4 channels of transmit (TX) and receive (RX) pairs of transceivers, called PS-GTR transceivers, supporting data rates of up to 6.0Gb/s. These transceivers can interface to the high-speed peripheral blocks to support PCIe Gen2 root complex or end point in x1, x2, or x4 configurations; Serial-ATA (SATA) at 1.5Gb/s, 3.0Gb/s, or 6.0Gb/s data rates; and up to two lanes of Display Port at 1.62Gb/s, 2.7Gb/s, or 5.4Gb/s data rates. The PS-GTR transceivers can also interface to components over USB 3.0 and Serial Gigabit Media Independent Interface (SGMII).

For general connectivity, the PS includes: a pair of USB 2.0 controllers, which can be configured as host, device, or On-The-Go (OTG); an I2C controller; a UART; and a CAN2.0B controller that conforms to ISO11898-1. There are also four triple speed Ethernet MACs and 128 bits of GPIO, of which 78 bits are available through the MIO and 96 through the EMIO.

High-bandwidth connectivity based on the ARM AMBA® AXI4 protocol connects the processing units with the peripherals and provides interface between the PS and the programmable logic (PL).

For additional information, go to: <u>DS891</u>, *Zynq UltraScale+ MPSoC Overview*.

## I/O, Transceiver, PCIe, 100G Ethernet, and 150G Interlaken

Data is transported on and off chip through a combination of the high-performance parallel SelectIO<sup>™</sup> interface and high-speed serial transceiver connectivity. I/O blocks provide support for cutting-edge memory interface and network protocols through flexible I/O standard and voltage support. The serial transceivers in the UltraScale architecture-based devices transfer data up to 32.75Gb/s, enabling 25G+ backplane designs with dramatically lower power per bit than previous generation transceivers. All transceivers, except the PS-GTR, support the required data rates for PCIe Gen3, and Gen4 (rev 0.5), and integrated blocks for PCIe enable UltraScale devices to support up to Gen4 x8 and Gen3 x16 Endpoint and Root Port designs. Integrated blocks for 150Gb/s Interlaken and 100Gb/s Ethernet (100G MAC/PCS) extend the capabilities of UltraScale devices, enabling simple, reliable support for Nx100G switch and bridge applications. Virtex UltraScale+ HBM devices include Cache Coherent Interconnect for Accelerators (CCIX) ports for coherently sharing data with different processors.

### **Clocks and Memory Interfaces**

UltraScale devices contain powerful clock management circuitry, including clock synthesis, buffering, and routing components that together provide a highly capable framework to meet design requirements. The clock network allows for extremely flexible distribution of clocks to minimize the skew, power consumption, and delay associated with clock signals. The clock management technology is tightly integrated with dedicated memory interface circuitry to enable support for high-performance external memories, including DDR4. In addition to parallel memory interfaces, UltraScale devices support serial memories, such as hybrid memory cube (HMC).

### Routing, SSI, Logic, Storage, and Signal Processing

Configurable Logic Blocks (CLBs) containing 6-input look-up tables (LUTs) and flip-flops, DSP slices with 27x18 multipliers, 36Kb block RAMs with built-in FIFO and ECC support, and 4Kx72 UltraRAM blocks (in UltraScale+ devices) are all connected with an abundance of high-performance, low-latency interconnect. In addition to logical functions, the CLB provides shift register, multiplexer, and carry logic functionality as well as the ability to configure the LUTs as distributed memory to complement the highly capable and configurable block RAMs. The DSP slice, with its 96-bit-wide XOR functionality, 27-bit pre-adder, and 30-bit A input, performs numerous independent functions including multiply accumulate, multiply add, and pattern detect. In addition to the device interconnect, in devices using SSI technology, signals can cross between super-logic regions (SLRs) using dedicated, low-latency interface tiles. These combined routing resources enable easy support for next-generation bus data widths. Virtex UltraScale+ HBM devices include up to 8GB of high bandwidth memory.

## Configuration, Encryption, and System Monitoring

The configuration and encryption block performs numerous device-level functions critical to the successful operation of the FPGA or MPSoC. This high-performance configuration block enables device configuration from external media through various protocols, including PCIe, often with no requirement to use multi-function I/O pins during configuration. The configuration block also provides 256-bit AES-GCM decryption capability at the same performance as unencrypted configuration. Additional features include SEU detection and correction, partial reconfiguration support, and battery-backed RAM or eFUSE technology for AES key storage to provide additional security. The System Monitor enables the monitoring of the physical environment via on-chip temperature and supply sensors and can also monitor up to 17 external analog inputs. With UltraScale+ MPSoCs, the device is booted via the Configuration and Security Unit (CSU), which supports secure boot via the 256-bit AES-GCM and SHA/384 blocks. The cryptographic engines in the CSU can be used in the MPSoC after boot for user encryption.

### **Migrating Devices**

UltraScale and UltraScale+ families provide footprint compatibility to enable users to migrate designs from one device or family to another. Any two packages with the same footprint identifier code are footprint compatible. For example, Kintex UltraScale devices in the A1156 packages are footprint compatible with Kintex UltraScale+ devices in the A1156 packages. Likewise, Virtex UltraScale devices in the B2104 packages are compatible with Virtex UltraScale+ devices and Kintex UltraScale devices in the B2104 packages. All valid device/package combinations are provided in the Device-Package Combinations and Maximum I/Os tables in this document. Refer to UG583, UltraScale Architecture PCB Design User Guide for more detail on migrating between UltraScale and UltraScale+ devices and packages.

## Kintex UltraScale+ FPGA Feature Summary

#### Table 5: Kintex UltraScale+ FPGA Feature Summary

|                                           | KU3P    | KU5P    | KU9P    | KU11P   | KU13P   | KU15P     |
|-------------------------------------------|---------|---------|---------|---------|---------|-----------|
| System Logic Cells                        | 355,950 | 474,600 | 599,550 | 653,100 | 746,550 | 1,143,450 |
| CLB Flip-Flops                            | 325,440 | 433,920 | 548,160 | 597,120 | 682,560 | 1,045,440 |
| CLB LUTs                                  | 162,720 | 216,960 | 274,080 | 298,560 | 341,280 | 522,720   |
| Max. Distributed RAM (Mb)                 | 4.7     | 6.1     | 8.8     | 9.1     | 11.3    | 9.8       |
| Block RAM Blocks                          | 360     | 480     | 912     | 600     | 744     | 984       |
| Block RAM (Mb)                            | 12.7    | 16.9    | 32.1    | 21.1    | 26.2    | 34.6      |
| UltraRAM Blocks                           | 48      | 64      | 0       | 80      | 112     | 128       |
| UltraRAM (Mb)                             | 13.5    | 18.0    | 0       | 22.5    | 31.5    | 36.0      |
| CMTs (1 MMCM and 2 PLLs)                  | 4       | 4       | 4       | 8       | 4       | 11        |
| Max. HP I/O <sup>(1)</sup>                | 208     | 208     | 208     | 416     | 208     | 572       |
| Max. HD I/O <sup>(2)</sup>                | 96      | 96      | 96      | 96      | 96      | 96        |
| DSP Slices                                | 1,368   | 1,824   | 2,520   | 2,928   | 3,528   | 1,968     |
| System Monitor                            | 1       | 1       | 1       | 1       | 1       | 1         |
| GTH Transceiver 16.3Gb/s                  | 0       | 0       | 28      | 32      | 28      | 44        |
| GTY Transceivers 32.75Gb/s <sup>(3)</sup> | 16      | 16      | 0       | 20      | 0       | 32        |
| Transceiver Fractional PLLs               | 8       | 8       | 14      | 26      | 14      | 38        |
| PCIe Gen3 x16 and Gen4 x8                 | 1       | 1       | 0       | 4       | 0       | 5         |
| 150G Interlaken                           | 0       | 0       | 0       | 1       | 0       | 4         |
| 100G Ethernet w/RS-FEC                    | 0       | 1       | 0       | 2       | 0       | 4         |

#### Notes:

1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.

2. HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V.

3. GTY transceiver line rates are package limited: SFVB784 to 12.5Gb/s; FFVA676, FFVD900, and FFVA1156 to 16.3Gb/s. See Table 6.

### Kintex UltraScale+ Device-Package Combinations and Maximum I/Os

| Table 6: Kintex UltraScale+ | Dovico Dockago | Combinations a | nd Maximum L/Oc |
|-----------------------------|----------------|----------------|-----------------|
|                             | Device-Package | compinations a | nu waximum 1705 |

| Dookogo                 | Package            | KU3P               | KU5P               | KU9P               | KU11P              | KU13P              | KU15P              |
|-------------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|
| Package<br>(1)(2)(4)    | Dimensions<br>(mm) | HD, HP<br>GTH, GTY |
| SFVB784 <sup>(3)</sup>  | 23x23              | 96, 208<br>0, 16   | 96, 208<br>0, 16   |                    |                    |                    |                    |
| FFVA676 <sup>(3)</sup>  | 27x27              | 48, 208<br>0, 16   | 48, 208<br>0, 16   |                    |                    |                    |                    |
| FFVB676                 | 27x27              | 72, 208<br>0, 16   | 72, 208<br>0, 16   |                    |                    |                    |                    |
| FFVD900 <sup>(3)</sup>  | 31x31              | 96, 208<br>0, 16   | 96, 208<br>0, 16   |                    | 96, 312<br>16, 0   |                    |                    |
| FFVE900                 | 31x31              |                    |                    | 96, 208<br>28, 0   |                    | 96, 208<br>28, 0   |                    |
| FFVA1156 <sup>(3)</sup> | 35x35              |                    |                    |                    | 48, 416<br>20, 8   |                    | 48, 468<br>20, 8   |
| FFVE1517                | 40x40              |                    |                    |                    | 96, 416<br>32, 20  |                    | 96, 416<br>32, 24  |
| FFVA1760                | 42.5x42.5          |                    |                    |                    |                    |                    | 96, 416<br>44, 32  |
| FFVE1760                | 42.5x42.5          |                    |                    |                    |                    |                    | 96, 572<br>32, 24  |

#### Notes:

1. Go to Ordering Information for package designation details.

2. FF packages have 1.0mm ball pitch. SF packages have 0.8mm ball pitch.

3. GTY transceiver line rates are package limited: SFVB784 to 12.5Gb/s; FFVA676, FFVD900, and FFVA1156 to 16.3Gb/s.

4. Packages with the same last letter and number sequence, e.g., A676, are footprint compatible with all other UltraScale architecture-based devices with the same sequence. The footprint compatible devices within this family are outlined. See the <u>UltraScale Architecture Product Selection Guide</u> for details on inter-family migration.

## Virtex UltraScale+ FPGA Feature Summary

#### Table 9: Virtex UltraScale+ FPGA Feature Summary

|                                           | VU3P    | VU5P      | VU7P      | VU9P      | VU11P     | VU13P     | VU31P   | VU33P   | VU35P     | VU37P     |
|-------------------------------------------|---------|-----------|-----------|-----------|-----------|-----------|---------|---------|-----------|-----------|
| System Logic Cells                        | 862,050 | 1,313,763 | 1,724,100 | 2,586,150 | 2,835,000 | 3,780,000 | 961,800 | 961,800 | 1,906,800 | 2,851,800 |
| CLB Flip-Flops                            | 788,160 | 1,201,154 | 1,576,320 | 2,364,480 | 2,592,000 | 3,456,000 | 879,360 | 879,360 | 1,743,360 | 2,607,360 |
| CLB LUTs                                  | 394,080 | 600,577   | 788,160   | 1,182,240 | 1,296,000 | 1,728,000 | 439,680 | 439,680 | 871,680   | 1,303,680 |
| Max. Distributed RAM (Mb)                 | 12.0    | 18.3      | 24.1      | 36.1      | 36.2      | 48.3      | 12.5    | 12.5    | 24.6      | 36.7      |
| Block RAM Blocks                          | 720     | 1,024     | 1,440     | 2,160     | 2,016     | 2,688     | 672     | 672     | 1,344     | 2,016     |
| Block RAM (Mb)                            | 25.3    | 36.0      | 50.6      | 75.9      | 70.9      | 94.5      | 23.6    | 23.6    | 47.3      | 70.9      |
| UltraRAM Blocks                           | 320     | 470       | 640       | 960       | 960       | 1,280     | 320     | 320     | 640       | 960       |
| UltraRAM (Mb)                             | 90.0    | 132.2     | 180.0     | 270.0     | 270.0     | 360.0     | 90.0    | 90.0    | 180.0     | 270.0     |
| HBM DRAM (GB)                             | _       | _         | _         | -         | _         | _         | 4       | 8       | 8         | 8         |
| CMTs (1 MMCM and 2 PLLs)                  | 10      | 20        | 20        | 30        | 12        | 16        | 4       | 4       | 8         | 12        |
| Max. HP I/O <sup>(1)</sup>                | 520     | 832       | 832       | 832       | 624       | 832       | 208     | 208     | 416       | 624       |
| DSP Slices                                | 2,280   | 3,474     | 4,560     | 6,840     | 9,216     | 12,288    | 2,880   | 2,880   | 5,952     | 9,024     |
| System Monitor                            | 1       | 2         | 2         | 3         | 3         | 4         | 1       | 1       | 2         | 3         |
| GTY Transceivers 32.75Gb/s <sup>(2)</sup> | 40      | 80        | 80        | 120       | 96        | 128       | 32      | 32      | 64        | 96        |
| Transceiver Fractional PLLs               | 20      | 40        | 40        | 60        | 48        | 64        | 16      | 16      | 32        | 48        |
| PCIe Gen3 x16 and Gen4 x8                 | 2       | 4         | 4         | 6         | 3         | 4         | 4       | 4       | 5         | 6         |
| CCIX Ports <sup>(3)</sup>                 | _       | _         | _         | _         | _         | _         | 4       | 4       | 4         | 4         |
| 150G Interlaken                           | 3       | 4         | 6         | 9         | 6         | 8         | 0       | 0       | 2         | 4         |
| 100G Ethernet w/RS-FEC                    | 3       | 4         | 6         | 9         | 9         | 12        | 2       | 2       | 5         | 8         |

#### Notes:

1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.

2. GTY transceivers in the FLGF1924 package support data rates up to 16.3Gb/s. See Table 10.

3. A CCIX port requires the use of a PCIe Gen3 x16 / Gen4 x8 block.

### Virtex UltraScale+ Device-Package Combinations and Maximum I/Os

| Package<br>(1)(2)(3)    | Package                  | VU3P    | VU5P    | VU7P    | VU9P     | VU11P   | VU13P    | VU31P   | VU33P   | VU35P   | VU37P   |
|-------------------------|--------------------------|---------|---------|---------|----------|---------|----------|---------|---------|---------|---------|
| (1)(2)(3)               | Dimensions<br>(mm)       | HP, GTY | HP, GTY | HP, GTY | HP, GTY  | HP, GTY | HP, GTY  | HP, GTY | HP, GTY | HP, GTY | HP, GTY |
| FFVC1517                | 40x40                    | 520, 40 |         |         |          |         |          |         |         |         |         |
| FLGF1924 <sup>(4)</sup> | 45x45                    |         |         |         |          | 624, 64 |          |         |         |         |         |
| FLVA2104                | 47.5x47.5                |         | 832, 52 | 832, 52 |          |         |          |         |         |         |         |
| FLGA2104                | 47.5x47.5                |         |         |         | 832, 52  |         |          |         |         |         |         |
| FHGA2104                | 52.5x52.5 <sup>(5)</sup> |         |         |         |          |         | 832, 52  |         |         |         |         |
| FLVB2104                | 47.5x47.5                |         | 702, 76 | 702, 76 |          |         |          |         |         |         |         |
| FLGB2104                | 47.5x47.5                |         |         |         | 702, 76  | 572, 76 |          |         |         |         |         |
| FHGB2104                | 52.5x52.5 <sup>(5)</sup> |         |         |         |          |         | 702, 76  |         |         |         |         |
| FLVC2104                | 47.5x47.5                |         | 416, 80 | 416, 80 |          |         |          |         |         |         |         |
| FLGC2104                | 47.5x47.5                |         |         |         | 416, 104 | 416, 96 |          |         |         |         |         |
| FHGC2104                | 52.5x52.5 <sup>(5)</sup> |         |         |         |          |         | 416, 104 |         |         |         |         |
| FSGD2104                | 47.5x47.5                |         |         |         | 676, 76  | 572, 76 |          |         |         |         |         |
| FIGD2104                | 52.5x52.5 <sup>(5)</sup> |         |         |         |          |         | 676, 76  |         |         |         |         |
| FLGA2577                | 52.5x52.5                |         |         |         | 448, 120 | 448, 96 | 448, 128 |         |         |         |         |
| FSVH1924                | 45x45                    |         |         |         | -        |         |          | 208, 32 |         |         |         |
| FSVH2104                | 47.5x47.5                |         |         |         |          |         |          |         | 208, 32 | 416, 64 |         |
| FSVH2892                | 55x55                    |         |         |         |          |         |          |         |         | 416, 64 | 624, 96 |

#### Table 10: Virtex UltraScale+ Device-Package Combinations and Maximum I/Os

#### Notes:

1. Go to Ordering Information for package designation details.

2. All packages have 1.0mm ball pitch.

3. Packages with the same last letter and number sequence, e.g., A2104, are footprint compatible with all other UltraScale architecture-based devices with the same sequence. The footprint compatible devices within this family are outlined. See the <u>UltraScale Architecture Product Selection Guide</u> for details on inter-family migration.

4. GTY transceivers in the FLGF1924 package support data rates up to 16.3Gb/s.

5. These 52.5x52.5mm overhang packages have the same PCB ball footprint as the corresponding 47.5x47.5mm packages (i.e., the same last letter and number sequence) and are footprint compatible.

## Zynq UltraScale+: CG Device Feature Summary

Table 11: Zynq UltraScale+: CG Device Feature Summary

|                                         | ZU2CG                                                                                                                           | ZU3CG          | ZU4CG                     | ZU5CG                            | ZU6CG                     | ZU7CG            | ZU9CG         |  |
|-----------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|----------------|---------------------------|----------------------------------|---------------------------|------------------|---------------|--|
| Application Processing Unit             | Dual-core ARM Cortex-A53 MPCore with CoreSight; NEON & Single/Double Precision Floating Point; 32KB/32KB L1 Cache, 1MB L2 Cache |                |                           |                                  |                           |                  |               |  |
| Real-Time Processing Unit               | Dua                                                                                                                             | Il-core ARM Co | rtex-R5 with C<br>32KB/33 | oreSight; Singl<br>2KB L1 Cache, | e/Double Preci<br>and TCM | sion Floating Po | pint;         |  |
| Embedded and External<br>Memory         | 256k                                                                                                                            | (B On-Chip Me  | mory w/ECC; E<br>External | xternal DDR4;<br>Quad-SPI; NAN   | DDR3; DDR3L<br>ID; eMMC   | ; LPDDR4; LPD    | DR3;          |  |
| General Connectivity                    | 214 PS I/O;                                                                                                                     | UART; CAN; U   | SB 2.0; I2C; S            | PI; 32b GPIO;<br>Timer Counters  | Real Time Cloc            | k; WatchDog T    | imers; Triple |  |
| High-Speed Connectivity                 | 2                                                                                                                               | 1 PS-GTR; PCI  | e Gen1/2; Seria           | al ATA 3.1; Dis                  | playPort 1.2a;            | USB 3.0; SGM     | 1             |  |
| System Logic Cells                      | 103,320                                                                                                                         | 154,350        | 192,150                   | 256,200                          | 469,446                   | 504,000          | 599,550       |  |
| CLB Flip-Flops                          | 94,464                                                                                                                          | 141,120        | 175,680                   | 234,240                          | 429,208                   | 460,800          | 548,160       |  |
| CLB LUTs                                | 47,232                                                                                                                          | 70,560         | 87,840                    | 117,120                          | 214,604                   | 230,400          | 274,080       |  |
| Distributed RAM (Mb)                    | 1.2                                                                                                                             | 1.8            | 2.6                       | 3.5                              | 6.9                       | 6.2              | 8.8           |  |
| Block RAM Blocks                        | 150                                                                                                                             | 216            | 128                       | 144                              | 714                       | 312              | 912           |  |
| Block RAM (Mb)                          | 5.3                                                                                                                             | 7.6            | 4.5                       | 5.1                              | 25.1                      | 11.0             | 32.1          |  |
| UltraRAM Blocks                         | 0                                                                                                                               | 0              | 48                        | 64                               | 0                         | 96               | 0             |  |
| UltraRAM (Mb)                           | 0                                                                                                                               | 0              | 14.0                      | 18.0                             | 0                         | 27.0             | 0             |  |
| DSP Slices                              | 240                                                                                                                             | 360            | 728                       | 1,248                            | 1,973                     | 1,728            | 2,520         |  |
| CMTs                                    | 3                                                                                                                               | 3              | 4                         | 4                                | 4                         | 8                | 4             |  |
| Max. HP I/O <sup>(1)</sup>              | 156                                                                                                                             | 156            | 156                       | 156                              | 208                       | 416              | 208           |  |
| Max. HD I/O <sup>(2)</sup>              | 96                                                                                                                              | 96             | 96                        | 96                               | 120                       | 48               | 120           |  |
| System Monitor                          | 2                                                                                                                               | 2              | 2                         | 2                                | 2                         | 2                | 2             |  |
| GTH Transceiver 16.3Gb/s <sup>(3)</sup> | 0                                                                                                                               | 0              | 16                        | 16                               | 24                        | 24               | 24            |  |
| GTY Transceivers 32.75Gb/s              | 0                                                                                                                               | 0              | 0                         | 0                                | 0                         | 0                | 0             |  |
| Transceiver Fractional PLLs             | 0                                                                                                                               | 0              | 8                         | 8                                | 12                        | 12               | 12            |  |
| PCIe Gen3 x16 and Gen4 x8               | 0                                                                                                                               | 0              | 2                         | 2                                | 0                         | 2                | 0             |  |
| 150G Interlaken                         | 0                                                                                                                               | 0              | 0                         | 0                                | 0                         | 0                | 0             |  |
| 100G Ethernet w/ RS-FEC                 | 0                                                                                                                               | 0              | 0                         | 0                                | 0                         | 0                | 0             |  |

#### Notes:

1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.

2. HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V.

3. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. See Table 12.

## Zynq UltraScale+: EG Device Feature Summary

#### Table 13: Zynq UltraScale+: EG Device Feature Summary

|                                         | ZU2EG   | ZU3EG        | ZU4EG        | ZU5EG          | ZU6EG                       | ZU7EG                         | ZU9EG                  | ZU11EG         | ZU15EG         | ZU17EG     | ZU19EG    |
|-----------------------------------------|---------|--------------|--------------|----------------|-----------------------------|-------------------------------|------------------------|----------------|----------------|------------|-----------|
| Application Processing Unit             | Quad-co | re ARM Corte | x-A53 MPCor  | e with CoreSig | ght; NEON & S               | Single/Double                 | Precision Flo          | ating Point; 3 | 2KB/32KB L1    | Cache, 1MB | L2 Cache  |
| Real-Time Processing Unit               |         | Dual-core    | ARM Cortex-  | R5 with Cores  | Sight; Single/              | Double Precis                 | ion Floating P         | oint; 32KB/32  | 2KB L1 Cache   | , and TCM  |           |
| Embedded and External<br>Memory         |         |              | 256KB (      | Dn-Chip Memo   | ory w/ECC; Ex<br>External C | kternal DDR4;<br>Quad-SPI; NA | DDR3; DDR3<br>ND; eMMC | BL; LPDDR4; I  | _PDDR3;        |            |           |
| General Connectivity                    |         | 214 PS I/0   | D; UART; CAN | ; USB 2.0; 12  | C; SPI; 32b C               | GPIO; Real Tir                | ne Clock; Wa           | tchDog Timer   | s; Triple Time | r Counters |           |
| High-Speed Connectivity                 |         |              | 4 PS         | S-GTR; PCIe G  | Gen1/2; Seria               | I ATA 3.1; Dis                | playPort 1.2a          | ; USB 3.0; S   | GMH            |            |           |
| Graphic Processing Unit                 |         |              |              |                | ARM Mali-4                  | 100 MP2; 64K                  | B L2 Cache             |                |                |            |           |
| System Logic Cells                      | 103,320 | 154,350      | 192,150      | 256,200        | 469,446                     | 504,000                       | 599,550                | 653,100        | 746,550        | 926,194    | 1,143,450 |
| CLB Flip-Flops                          | 94,464  | 141,120      | 175,680      | 234,240        | 429,208                     | 460,800                       | 548,160                | 597,120        | 682,560        | 846,806    | 1,045,440 |
| CLB LUTs                                | 47,232  | 70,560       | 87,840       | 117,120        | 214,604                     | 230,400                       | 274,080                | 298,560        | 341,280        | 423,403    | 522,720   |
| Distributed RAM (Mb)                    | 1.2     | 1.8          | 2.6          | 3.5            | 6.9                         | 6.2                           | 8.8                    | 9.1            | 11.3           | 8.0        | 9.8       |
| Block RAM Blocks                        | 150     | 216          | 128          | 144            | 714                         | 312                           | 912                    | 600            | 744            | 796        | 984       |
| Block RAM (Mb)                          | 5.3     | 7.6          | 4.5          | 5.1            | 25.1                        | 11.0                          | 32.1                   | 21.1           | 26.2           | 28.0       | 34.6      |
| UltraRAM Blocks                         | 0       | 0            | 48           | 64             | 0                           | 96                            | 0                      | 80             | 112            | 102        | 128       |
| UltraRAM (Mb)                           | 0       | 0            | 14.0         | 18.0           | 0                           | 27.0                          | 0                      | 22.5           | 31.5           | 28.7       | 36.0      |
| DSP Slices                              | 240     | 360          | 728          | 1,248          | 1,973                       | 1,728                         | 2,520                  | 2,928          | 3,528          | 1,590      | 1,968     |
| CMTs                                    | 3       | 3            | 4            | 4              | 4                           | 8                             | 4                      | 8              | 4              | 11         | 11        |
| Max. HP I/O <sup>(1)</sup>              | 156     | 156          | 156          | 156            | 208                         | 416                           | 208                    | 416            | 208            | 572        | 572       |
| Max. HD I/O <sup>(2)</sup>              | 96      | 96           | 96           | 96             | 120                         | 48                            | 120                    | 96             | 120            | 96         | 96        |
| System Monitor                          | 2       | 2            | 2            | 2              | 2                           | 2                             | 2                      | 2              | 2              | 2          | 2         |
| GTH Transceiver 16.3Gb/s <sup>(3)</sup> | 0       | 0            | 16           | 16             | 24                          | 24                            | 24                     | 32             | 24             | 44         | 44        |
| GTY Transceivers 32.75Gb/s              | 0       | 0            | 0            | 0              | 0                           | 0                             | 0                      | 16             | 0              | 28         | 28        |
| Transceiver Fractional PLLs             | 0       | 0            | 8            | 8              | 12                          | 12                            | 12                     | 24             | 12             | 36         | 36        |
| PCIe Gen3 x16 and Gen4 x8               | 0       | 0            | 2            | 2              | 0                           | 2                             | 0                      | 4              | 0              | 4          | 5         |
| 150G Interlaken                         | 0       | 0            | 0            | 0              | 0                           | 0                             | 0                      | 1              | 0              | 2          | 4         |
| 100G Ethernet w/ RS-FEC                 | 0       | 0            | 0            | 0              | 0                           | 0                             | 0                      | 2              | 0              | 2          | 4         |

#### Notes:

1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.

2. HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V.

3. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. See Table 14.

## Zynq UltraScale+: EG Device Feature Summary

| Table 1 | 15: Zyng Ul                  | traScale+: EV | <b>/ Device F</b> | eature | Summary |
|---------|------------------------------|---------------|-------------------|--------|---------|
|         | · · · _ <b>J</b> · · · · · · |               |                   |        | J       |

|                                         |                                                                                                                                | -                                                                 |                                |  |  |  |
|-----------------------------------------|--------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------|--------------------------------|--|--|--|
|                                         | ZU4EV                                                                                                                          | ZU5EV                                                             | ZU7EV                          |  |  |  |
| Application Processing Unit             | Quad-core ARM Cortex-A53 MPCore with CoreSight; NEON & Single/Double Precision Floating Po<br>32KB/32KB L1 Cache, 1MB L2 Cache |                                                                   |                                |  |  |  |
| Real-Time Processing Unit               | Dual-core ARM Cortex-                                                                                                          | R5 with CoreSight; Single/Double F<br>32KB/32KB L1 Cache, and TCM | Precision Floating Point;      |  |  |  |
| Embedded and External<br>Memory         | 256KB On-Chip Memory                                                                                                           | w/ECC; External DDR4; DDR3; DE<br>External Quad-SPI; NAND; eMMC   | DR3L; LPDDR4; LPDDR3;          |  |  |  |
| General Connectivity                    | 214 PS I/O; UART; CAN; USB 2                                                                                                   | .0; I2C; SPI; 32b GPIO; Real Time<br>Timer Counters               | Clock; WatchDog Timers; Triple |  |  |  |
| High-Speed Connectivity                 | 4 PS-GTR; PCIe Gen                                                                                                             | 1/2; Serial ATA 3.1; DisplayPort 1                                | .2a; USB 3.0; SGMII            |  |  |  |
| Graphic Processing Unit                 |                                                                                                                                | ARM Mali-400 MP2; 64KB L2 Cache                                   | 9                              |  |  |  |
| Video Codec                             | 1                                                                                                                              | 1                                                                 | 1                              |  |  |  |
| System Logic Cells                      | 192,150                                                                                                                        | 256,200                                                           | 504,000                        |  |  |  |
| CLB Flip-Flops                          | 175,680                                                                                                                        | 234,240                                                           | 460,800                        |  |  |  |
| CLB LUTs                                | 87,840                                                                                                                         | 117,120                                                           | 230,400                        |  |  |  |
| Distributed RAM (Mb)                    | 2.6                                                                                                                            | 3.5                                                               | 6.2                            |  |  |  |
| Block RAM Blocks                        | 128                                                                                                                            | 144                                                               | 312                            |  |  |  |
| Block RAM (Mb)                          | 4.5                                                                                                                            | 5.1                                                               | 11.0                           |  |  |  |
| UltraRAM Blocks                         | 48                                                                                                                             | 64                                                                | 96                             |  |  |  |
| UltraRAM (Mb)                           | 14.0                                                                                                                           | 18.0                                                              | 27.0                           |  |  |  |
| DSP Slices                              | 728                                                                                                                            | 1,248                                                             | 1,728                          |  |  |  |
| CMTs                                    | 4                                                                                                                              | 4                                                                 | 8                              |  |  |  |
| Max. HP I/O <sup>(1)</sup>              | 156                                                                                                                            | 156                                                               | 416                            |  |  |  |
| Max. HD I/O <sup>(2)</sup>              | 96                                                                                                                             | 96                                                                | 48                             |  |  |  |
| System Monitor                          | 2                                                                                                                              | 2                                                                 | 2                              |  |  |  |
| GTH Transceiver 16.3Gb/s <sup>(3)</sup> | 16                                                                                                                             | 16                                                                | 24                             |  |  |  |
| GTY Transceivers 32.75Gb/s              | 0                                                                                                                              | 0                                                                 | 0                              |  |  |  |
| Transceiver Fractional PLLs             | 8                                                                                                                              | 8                                                                 | 12                             |  |  |  |
| PCIe Gen3 x16 and Gen4 x8               | 2                                                                                                                              | 2                                                                 | 2                              |  |  |  |
| 150G Interlaken                         | 0                                                                                                                              | 0                                                                 | 0                              |  |  |  |
| 100G Ethernet w/ RS-FEC                 | 0                                                                                                                              | 0                                                                 | 0                              |  |  |  |

#### Notes:

1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.

2. HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V.

3. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. See Table 16.

### Zynq UltraScale+: EG Device-Package Combinations and Maximum I/Os

| Package<br>(1)(2)(3)(4) | Package            | ZU4EV              | ZU5EV              | ZU7EV<br>HD, HP<br>GTH, GTY |  |  |
|-------------------------|--------------------|--------------------|--------------------|-----------------------------|--|--|
|                         | Dimensions<br>(mm) | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY |                             |  |  |
| SFVC784 <sup>(5)</sup>  | 23x23              | 96, 156<br>4, 0    | 96, 156<br>4, 0    |                             |  |  |
| FBVB900                 | 31x31              | 48, 156<br>16, 0   | 48, 156<br>16, 0   | 48, 156<br>16, 0            |  |  |
| FFVC1156                | 35x35              |                    |                    | 48, 312<br>20, 0            |  |  |
| FFVF1517                | 40x40              |                    |                    | 48, 416<br>24, 0            |  |  |

Table 16: Zynq UltraScale+: EV Device-Package Combinations and Maximum I/Os

#### Notes:

- 1. Go to Ordering Information for package designation details.
- 2. FB/FF packages have 1.0mm ball pitch. SF packages have 0.8mm ball pitch.
- 3. All device package combinations bond out 4 PS-GTR transceivers.
- 4. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s.
- 5. Packages with the same last letter and number sequence, e.g., B900, are footprint compatible with all other UltraScale architecture-based devices with the same sequence. The footprint compatible devices within this family are outlined.

## **Device Layout**

UltraScale devices are arranged in a column-and-grid layout. Columns of resources are combined in different ratios to provide the optimum capability for the device density, target market or application, and device cost. At the core of UltraScale+ MPSoCs is the processing system that displaces some of the full or partial columns of programmable logic resources. Figure 1 shows a device-level view with resources grouped together. For simplicity, certain resources such as the processing system, integrated blocks for PCIe, configuration logic, and System Monitor are not shown.

| Transceivers | CLB, DSP, Block RAM | I/O, Clocking, Memory Interface Logic | CLB, DSP, Block RAM | I/O, Clocking, Memory Interface Logic | CLB, DSP, Block RAM | Transceivers |  |
|--------------|---------------------|---------------------------------------|---------------------|---------------------------------------|---------------------|--------------|--|
|--------------|---------------------|---------------------------------------|---------------------|---------------------------------------|---------------------|--------------|--|

DS890\_01\_101712

Figure 1: FPGA with Columnar Resources

Resources within the device are divided into segmented clock regions. The height of a clock region is 60 CLBs. A bank of 52 I/Os, 24 DSP slices, 12 block RAMs, or 4 transceiver channels also matches the height of a clock region. The width of a clock region is essentially the same in all cases, regardless of device size or the mix of resources in the region, enabling repeatable timing results. Each segmented clock region

contains vertical and horizontal clock routing that span its full height and width. These horizontal and vertical clock routes can be segmented at the clock region boundary to provide a flexible, high-performance, low-power clock distribution architecture. Figure 2 is a representation of an FPGA divided into regions.



Figure 2: Column-Based FPGA Divided into Clock Regions

## Processing System (PS)

Zynq UltraScale+ MPSoCs consist of a PS coupled with programmable logic. The contents of the PS varies between the different Zynq UltraScale+ devices. All devices contain an APU, an RPU, and many peripherals for connecting the multiple processing engines to external components. The EG and EV devices contain a GPU and the EV devices contain a video codec unit (VCU). The components of the PS are connected together and to the PL through a multi-layered ARM AMBA AXI non-blocking interconnect that supports multiple simultaneous master-slave transactions. Traffic through the interconnect can be regulated by the quality of service (QoS) block in the interconnect. Twelve dedicated AXI 32-bit, 64-bit, or 128-bit ports connect the PL to high-speed interconnect and DDR in the PS via a FIFO interface.

There are four independently controllable power domains: the PL plus three within the PS (full power, lower power, and battery power domains). Additionally, many peripherals support clock gating and power gating to further reduce dynamic and static power consumption.

### **Application Processing Unit (APU)**

The APU has a feature-rich dual-core or quad-core ARM Cortex-A53 processor. Cortex-A53 cores are 32-bit/64-bit application processors based on ARM-v8A architecture, offering the best performance-to-power ratio. The ARMv8 architecture supports hardware virtualization. Each of the Cortex-A53 cores has: 32KB of instruction and data L1 caches, with parity and ECC protection respectively; a NEON SIMD engine; and a single and double precision floating point unit. In addition to these blocks, the APU consists of a snoop control unit and a 1MB L2 cache with ECC protection to enhance system-level performance. The snoop control unit keeps the L1 caches coherent thus eliminating the need of spending software bandwidth for coherency. The APU also has a built-in interrupt controller supporting virtual interrupts. The APU communicates to the rest of the PS through 128-bit AXI coherent extension (ACE) port via Cache Coherent Interconnect (CCI) block, using the System Memory Management Unit (SMMU). The APU is also connected to the Programmable Logic (PL), through the 128-bit accelerator coherency port

(ACP), providing a low latency coherent port for accelerators in the PL. To support real-time debug and trace, each core also has an Embedded Trace Macrocell (ETM) that communicates with the ARM CoreSight<sup>™</sup> Debug System.

### **Real-Time Processing Unit (RPU)**

The RPU in the PS contains a dual-core ARM Cortex-R5 PS. Cortex-R5 cores are 32-bit real-time processor cores based on ARM-v7R architecture. Each of the Cortex-R5 cores has 32KB of level-1 (L1) instruction and data cache with ECC protection. In addition to the L1 caches, each of the Cortex-R5 cores also has a 128KB tightly coupled memory (TCM) interface for real-time single cycle access. The RPU also has a dedicated interrupt controller. The RPU can operate in either split or lock-step mode. In split mode, both processors run independently of each other. In lock-step mode, they run in parallel with each other, with integrated comparator logic, and the TCMs are used as 256KB unified memory. The RPU communicates with the rest of the PS via the 128-bit AXI-4 ports connected to the low power domain switch. It also communicates directly with the PL through 128-bit low latency AXI-4 ports. To support real-time debug and trace each core also has an embedded trace macrocell (ETM) that communicates with the ARM CoreSight Debug System.

### **External Memory**

The PS can interface to many types of external memories through dedicated memory controllers. The dynamic memory controller supports DDR3, DDR3L, DDR4, LPDDR3, and LPDDR4 memories. The multi-protocol DDR memory controller can be configured to access a 2GB address space in 32-bit addressing mode and up to 32GB in 64-bit addressing mode using a single or dual rank configuration of 8-bit, 16-bit, or 32-bit DRAM memories. Both 32-bit and 64-bit bus access modes are protected by ECC using extra bits.

The SD/eMMC controller supports 1 and 4 bit data interfaces at low, default, high-speed, and ultra-high-speed (UHS) clock rates. This controller also supports 1-, 4-, or 8-bit-wide eMMC interfaces that are compliant to the eMMC 4.51 specification. eMMC is one of the primary boot and configuration modes for Zynq UltraScale+ MPSoCs and supports boot from managed NAND devices. The controller has a built-in DMA for enhanced performance.

The Quad-SPI controller is one of the primary boot and configuration devices. It supports 4-byte and 3-byte addressing modes. In both addressing modes, single, dual-stacked, and dual-parallel configurations are supported. Single mode supports a quad serial NOR flash memory, while in double stacked and double parallel modes, it supports two quad serial NOR flash memories.

The NAND controller is based on ONFI3.1 specification. It has an 8-pin interface and provides 200Mb/s of bandwidth in synchronous mode. It supports 24 bits of ECC thus enabling support for SLC NAND memories. It has two chip-selects to support deeper memory and a built-in DMA for enhanced performance.

## Stacked Silicon Interconnect (SSI) Technology

Many challenges associated with creating high-capacity devices are addressed by Xilinx with the second generation of the pioneering 3D SSI technology. SSI technology enables multiple super-logic regions (SLRs) to be combined on a passive interposer layer, using proven manufacturing and assembly techniques from industry leaders, to create a single device with more than 20,000 low-power inter-SLR connections. Dedicated interface tiles within the SLRs provide ultra-high bandwidth, low latency connectivity to other SLRs. Table 19 shows the number of SLRs in devices that use SSI technology and their dimensions.

|                            | Kintex Virtex<br>UltraScale UltraScale |       |       |       | Virtex<br>UltraScale+ |       |      |      |      |       |       |       |       |       |       |
|----------------------------|----------------------------------------|-------|-------|-------|-----------------------|-------|------|------|------|-------|-------|-------|-------|-------|-------|
| Device                     | KU085                                  | KU115 | VU125 | VU160 | VU190                 | VU440 | VU5P | VU7P | VU9P | VU11P | VU13P | VU31P | VU33P | VU35P | VU37P |
| # SLRs                     | 2                                      | 2     | 2     | 3     | 3                     | 3     | 2    | 2    | 3    | 3     | 4     | 1     | 1     | 2     | 3     |
| SLR Width<br>(in regions)  | 6                                      | 6     | 6     | 6     | 6                     | 9     | 6    | 6    | 6    | 8     | 8     | 8     | 8     | 8     | 8     |
| SLR Height<br>(in regions) | 5                                      | 5     | 5     | 5     | 5                     | 5     | 5    | 5    | 5    | 4     | 4     | 4     | 4     | 4     | 4     |

## **Clock Management**

The clock generation and distribution components in UltraScale devices are located adjacent to the columns that contain the memory interface and input and output circuitry. This tight coupling of clocking and I/O provides low-latency clocking to the I/O for memory interfaces and other I/O protocols. Within every clock management tile (CMT) resides one mixed-mode clock manager (MMCM), two PLLs, clock distribution buffers and routing, and dedicated circuitry for implementing external memory interfaces.

### **Mixed-Mode Clock Manager**

The mixed-mode clock manager (MMCM) can serve as a frequency synthesizer for a wide range of frequencies and as a jitter filter for incoming clocks. At the center of the MMCM is a voltage-controlled oscillator (VCO), which speeds up and slows down depending on the input voltage it receives from the phase frequency detector (PFD).

There are three sets of programmable frequency dividers (D, M, and O) that are programmable by configuration and during normal operation via the Dynamic Reconfiguration Port (DRP). The pre-divider D reduces the input frequency and feeds one input of the phase/frequency comparator. The feedback divider M acts as a multiplier because it divides the VCO output frequency before feeding the other input of the phase comparator. D and M must be chosen appropriately to keep the VCO within its specified frequency range. The VCO has eight equally-spaced output phases (0°, 45°, 90°, 135°, 180°, 225°, 270°, and 315°). Each phase can be selected to drive one of the output dividers, and each divider is programmable by configuration to divide by any integer from 1 to 128.

The MMCM has three input-jitter filter options: low bandwidth, high bandwidth, or optimized mode. Low-Bandwidth mode has the best jitter attenuation. High-Bandwidth mode has the best phase offset. Optimized mode allows the tools to find the best setting.

## **Block RAM**

Every UltraScale architecture-based device contains a number of 36 Kb block RAMs, each with two completely independent ports that share only the stored data. Each block RAM can be configured as one 36Kb RAM or two independent 18Kb RAMs. Each memory access, read or write, is controlled by the clock. Connections in every block RAM column enable signals to be cascaded between vertically adjacent block RAMs, providing an easy method to create large, fast memory arrays, and FIFOs with greatly reduced power consumption.

All inputs, data, address, clock enables, and write enables are registered. The input address is always clocked (unless address latching is turned off), retaining data until the next operation. An optional output data pipeline register allows higher clock rates at the cost of an extra cycle of latency. During a write operation, the data output can reflect either the previously stored data or the newly written data, or it can remain unchanged. Block RAM sites that remain unused in the user design are automatically powered down to reduce total power consumption. There is an additional pin on every block RAM to control the dynamic power gating feature.

### Programmable Data Width

Each port can be configured as  $32K \times 1$ ;  $16K \times 2$ ;  $8K \times 4$ ;  $4K \times 9$  (or 8);  $2K \times 18$  (or 16);  $1K \times 36$  (or 32); or  $512 \times 72$  (or 64). Whether configured as block RAM or FIFO, the two ports can have different aspect ratios without any constraints. Each block RAM can be divided into two completely independent 18Kb block RAMs that can each be configured to any aspect ratio from  $16K \times 1$  to  $512 \times 36$ . Everything described previously for the full 36Kb block RAM also applies to each of the smaller 18Kb block RAMs. Only in simple dual-port (SDP) mode can data widths of greater than 18bits (18Kb RAM) or 36 bits (36Kb RAM) be accessed. In this mode, one port is dedicated to read operation, the other to write operation. In SDP mode, one side (read or write) can be variable, while the other is fixed to 32/36 or 64/72. Both sides of the dual-port 36Kb RAM can be of variable width.

### **Error Detection and Correction**

Each 64-bit-wide block RAM can generate, store, and utilize eight additional Hamming code bits and perform single-bit error correction and double-bit error detection (ECC) during the read process. The ECC logic can also be used when writing to or reading from external 64- to 72-bit-wide memories.

### **FIFO Controller**

Each block RAM can be configured as a 36Kb FIFO or an 18Kb FIFO. The built-in FIFO controller for single-clock (synchronous) or dual-clock (asynchronous or multirate) operation increments the internal addresses and provides four handshaking flags: full, empty, programmable full, and programmable empty. The programmable flags allow the user to specify the FIFO counter values that make these flags go active. The FIFO width and depth are programmable with support for different read port and write port widths on a single FIFO. A dedicated cascade path allows for easy creation of deeper FIFOs.

## UltraRAM

UltraRAM is a high-density, dual-port, synchronous memory block available in UltraScale+ devices. Both of the ports share the same clock and can address all of the 4K x 72 bits. Each port can independently read from or write to the memory array. UltraRAM supports two types of write enable schemes. The first mode is consistent with the block RAM byte write enable mode. The second mode allows gating the data and parity byte writes separately. UltraRAM blocks can be connected together to create larger memory arrays. Dedicated routing in the UltraRAM column enables the entire column height to be connected together. If additional density is required, all the UltraRAM columns in an SLR can be connected together with a few fabric resources to create single instances of RAM approximately 100Mb in size. This makes UltraRAM an ideal solution for replacing external memories such as SRAM. Cascadable anywhere from 288Kb to 100Mb, UltraRAM provides the flexibility to fulfill many different memory requirements.

### **Error Detection and Correction**

Each 64-bit-wide UltraRAM can generate, store and utilize eight additional Hamming code bits and perform single-bit error correction and double-bit error detection (ECC) during the read process.

## High Bandwidth Memory (HBM)

Virtex UltraScale+ HBM devices incorporate 4GB HBM stacks adjacent to the FPGA die. Using stacked silicon interconnect technology, the FPGA communicates to the HBM stacks through memory controllers that connect to dedicated low-inductance interconnect in the silicon interposer. Each Virtex UltraScale+ HBM FPGA contains one or two HBM stacks, resulting in up to 8GB of HBM per FPGA.

The FPGA has 32 HBM AXI interfaces used to communicate with the HBM. Through a built-in switch mechanism, any of the 32 HBM AXI interfaces can access any memory address on either one or both of the HBM stacks due to the flexible addressing feature. This flexible connection between the FPGA and the HBM stacks results in easy floorplanning and timing closure. The memory controllers perform read and write reordering to improve bus efficiency. Data integrity is ensured through error checking and correction (ECC) circuitry.

## **Configurable Logic Block**

Every Configurable Logic Block (CLB) in the UltraScale architecture contains 8 LUTs and 16 flip-flops. The LUTs can be configured as either one 6-input LUT with one output, or as two 5-input LUTs with separate outputs but common inputs. Each LUT can optionally be registered in a flip-flop. In addition to the LUTs and flip-flops, the CLB contains arithmetic carry logic and multiplexers to create wider logic functions.

Each CLB contains one slice. There are two types of slices: SLICEL and SLICEM. LUTs in the SLICEM can be configured as 64-bit RAM, as 32-bit shift registers (SRL32), or as two SRL16s. CLBs in the UltraScale architecture have increased routing and connectivity compared to CLBs in previous-generation Xilinx devices. They also have additional control signals to enable superior register packing, resulting in overall higher device utilization.

#### **E** XILINX.

The ordering information shown in Figure 3 applies to all packages in the Kintex UltraScale and Virtex UltraScale FPGAs. Refer to the Package Marking section of <u>UG575</u>, *UltraScale and UltraScale+ FPGAs Packaging and Pinouts User Guide* for a more detailed explanation of the device markings.



 L1 and -H1 are the ordering codes for the -1L and -1H speed grades, respectively.
See UG575: UltraScale and UltraScale+ FPGAs Packaging and Pinouts User Guide for more information. DS890\_03\_050316

Figure 3: Kintex UltraScale and Virtex UltraScale FPGA Ordering Information

The ordering information shown in Figure 4 applies to all packages in the Kintex UltraScale+ and Virtex UltraScale+ FPGAs, and Figure 5 applies to Zynq UltraScale+s.

The -1L and -2L speed grades in the UltraScale+ families can run at one of two different V<sub>CCINT</sub> operating voltages. At 0.72V, they operate at similar performance to the Kintex UltraScale and Virtex UltraScale devices with up to 30% reduction in power consumption. At 0.85V, they consume similar power to the Kintex UltraScale and Virtex UltraScale devices, but operate over 30% faster.

For UltraScale+ devices, the information in this document is pre-release, provided ahead of silicon ordering availability. Please contact your Xilinx sales representative for more information on Early Access Programs.



1) -L1 and -L2 are the ordering codes for the low power -1L and -2L speed grades, respectively.

DS890 04 042816

Figure 4: UltraScale+ FPGA Ordering Information



1) -L1 and -L2 are the ordering codes for the low power -1L and -2L speed grades, respectively.

DS890\_05\_042816

Figure 5: Zynq UltraScale+ Ordering Information

## **Revision History**

The following table shows the revision history for this document:

| Date       | Version | Description of Revisions                                                                                                                                                                                                                                                                                                                               |
|------------|---------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 02/15/2017 | 2.11    | Updated Table 1, Table 9: Converted HBM from Gb to GB. Updated Table 11, Table 13, and Table 15: Updated DSP count for Zynq UltraScale+ MPSoCs. Updated Cache Coherent Interconnect for Accelerators (CCIX). Updated High Bandwidth Memory (HBM). Updated Table 21: Added-2E speed grade to all UltraScale+ devices. Removed -3E from XCZU2 and XCZU3. |
| 11/09/2016 | 2.10    | Updated Table 1. Added HBM devices to Table 9, Table 10, Table 19 and new High<br>Bandwidth Memory (HBM) section. Added Cache Coherent Interconnect for Accelerators<br>(CCIX) section.                                                                                                                                                                |
| 09/27/2016 | 2.9     | Updated Table 5, Table 12, Table 13, and Table 14.                                                                                                                                                                                                                                                                                                     |
| 06/03/2016 | 2.8     | Added Zynq UltraScale+ MPSoC CG devices: Added Table 2. Updated Table 11, Table 12, Table 21, and Figure 5. Created separate tables for EG and EV devices: Table 13, Table 14, Table 15, and Table 16.                                                                                                                                                 |
|            |         | Updated Table 1, Table 3, Table 5 and notes, Table 6 and notes, Table 7, Table 9, Table 10, Processing System Overview, and Processing System (PS) details.                                                                                                                                                                                            |
| 02/17/2016 | 2.7     | Added Migrating Devices. Updated Table 4, Table 5, Table 6, Table 10, Table 11, Table 12, and Figure 4.                                                                                                                                                                                                                                                |
| 12/15/2015 | 2.6     | Updated Table 1, Table 5, Table 6, Table 9, Table 12, and Configuration.                                                                                                                                                                                                                                                                               |
| 11/24/2015 | 2.5     | Updated Configuration, Encryption, and System Monitoring, Table 5, Table 9, Table 11, and Table 21.                                                                                                                                                                                                                                                    |
| 10/15/2015 | 2.4     | Updated Table 1, Table 3, Table 5, Table 7, Table 9, and Table 11 with System Logic Cells.<br>Updated Figure 3. Updated Table 19.                                                                                                                                                                                                                      |
| 09/29/2015 | 2.3     | Added A1156 to KU095 in Table 4. Updated Table 5. Updated Max. Distributed RAM in Table 9. Updated Distributed RAM in Table 11. Added Table 19. Updated Table 21. Updated Figure 3.                                                                                                                                                                    |
| 08/14/2015 | 2.2     | Updated Table 1. Added XCKU025 to Table 3, Table 4, and Table 21. Updated Table 7, Table 9, Table 11, Table 12, Table 18. Updated System Monitor. Added voltage information to Table 21.                                                                                                                                                               |
| 04/27/2015 | 2.1     | Updated Table 1, Table 3, Table 4, Table 5, Table 6, Table 7, Table 10, Table 11, Table 12, Table 17, I/O, Transceiver, PCIe, 100G Ethernet, and 150G Interlaken, Integrated Interface Blocks for PCI Express Designs, USB 3.0/2.0, Clock Management, System Monitor, and Figure 3.                                                                    |
| 02/23/2015 | 2.0     | UltraScale+ device information (Kintex UltraScale+ FPGA, Virtex UltraScale+ FPGA, and Zynq UltraScale+ MPSoC) added throughout document.                                                                                                                                                                                                               |
| 12/16/2014 | 1.6     | Updated Table 1; I/O, Transceiver, PCIe, 100G Ethernet, and 150G Interlaken; Table 3, Table 7; Table 8; and Table 17.                                                                                                                                                                                                                                  |
| 11/17/2014 | 1.5     | Updated I/O, Transceiver, PCIe, 100G Ethernet, and 150G Interlaken; Table 1; Table 4; Table 7; Table 8; Table 17; Input/Output; and Figure 3.                                                                                                                                                                                                          |
| 09/16/2014 | 1.4     | Updated Logic Cell information in Table 1. Updated Table 3; I/O, Transceiver, PCIe, 100G Ethernet, and 150G Interlaken; Table 7; Table 8; Integrated Block for 100G Ethernet; and Figure 3.                                                                                                                                                            |
| 05/20/2014 | 1.3     | Updated Table 8.                                                                                                                                                                                                                                                                                                                                       |
| 05/13/2014 | 1.2     | Added Ordering Information. Updated Table 1, Clocks and Memory Interfaces, Table 3, Table 7 (removed XCVU145; added XCVU190), Table 8 (removed XCVU145; removed FLVD1924 from XCVU160; added XCVU190; updated Table Notes), Table 17, Integrated Interface Blocks for PCI Express Designs, and Integrated Block for Interlaken, and Memory Interfaces. |

## Disclaimer

The information disclosed to you hereunder (the "Materials") is provided solely for the selection and use of Xilinx products. To the maximum extent permitted by applicable law: (1) Materials are made available "AS IS" and with all faults, Xilinx hereby DISCLAIMS ALL WARRANTIES AND CONDITIONS, EXPRESS, IMPLIED, OR STATUTORY, INCLUDING BUT NOT LIMITED TO WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, OR FITNESS FOR ANY PARTICULAR PURPOSE; and (2) Xilinx shall not be liable (whether in contract or tort, including negligence, or under any other theory of liability) for any loss or damage of any kind or nature related to, arising under, or in connection with, the Materials (including your use of the Materials), including for any direct, indirect, special, incidental, or consequential loss or damage (including loss of data, profits, goodwill, or any type of loss or damage suffered as a result of any action brought by a third party) even if such damage or loss was reasonably foreseeable or Xilinx had been advised of the possibility of the same. Xilinx assumes no obligation to correct any errors contained in the Materials or to notify you of updates to the Materials or to product specifications. You may not reproduce, modify, distribute, or publicly display the Materials without prior written consent. Certain products are subject to the terms and conditions of Xilinx's limited warranty, please refer to Xilinx's Terms of Sale which can be viewed at <a href="http://www.xilinx.com/legal.htm#tos">http://www.xilinx.com/legal.htm#tos</a>; IP cores may be subject to be fail-safe or for use in any application requiring fail-safe performance; you assume sole risk and liability for use of Xilinx products in such critical applications, please refer to Xilinx's Terms of Sale which can be viewed at <a href="http://www.xilinx.com/legal.htm#tos">http://www.xilinx.com/legal.htm#tos</a>.

This document contains preliminary information and is subject to change without notice. Information provided herein relates to products and/or services not yet available for sale, and provided solely for information purposes and are not intended, or to be construed, as an offer for sale or an attempted commercialization of the products and/or services referred to herein.

## **Automotive Applications Disclaimer**

AUTOMOTIVE PRODUCTS (IDENTIFIED AS "XA" IN THE PART NUMBER) ARE NOT WARRANTED FOR USE IN THE DEPLOYMENT OF AIRBAGS OR FOR USE IN APPLICATIONS THAT AFFECT CONTROL OF A VEHICLE ("SAFETY APPLICATION") UNLESS THERE IS A SAFETY CONCEPT OR REDUNDANCY FEATURE CONSISTENT WITH THE ISO 26262 AUTOMOTIVE SAFETY STANDARD ("SAFETY DESIGN"). CUSTOMER SHALL, PRIOR TO USING OR DISTRIBUTING ANY SYSTEMS THAT INCORPORATE PRODUCTS, THOROUGHLY TEST SUCH SYSTEMS FOR SAFETY PURPOSES. USE OF PRODUCTS IN A SAFETY APPLICATION WITHOUT A SAFETY DESIGN IS FULLY AT THE RISK OF CUSTOMER, SUBJECT ONLY TO APPLICABLE LAWS AND REGULATIONS GOVERNING LIMITATIONS ON PRODUCT LIABILITY.