



Welcome to **E-XFL.COM** 

#### Embedded - System On Chip (SoC): The Heart of Modern Embedded Systems

Embedded - System On Chip (SoC) refers to an integrated circuit that consolidates all the essential components of a computer system into a single chip. This includes a microprocessor, memory, and other peripherals, all packed into one compact and efficient package. SoCs are designed to provide a complete computing solution, optimizing both space and power consumption, making them ideal for a wide range of embedded applications.

What are **Embedded - System On Chip (SoC)**?

**System On Chip (SoC)** integrates multiple functions of a computer or electronic system onto a single chip. Unlike traditional multi-chip solutions. SoCs combine a central

| Details                    |                                                                                                                                                                                                           |
|----------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Product Status             | Active                                                                                                                                                                                                    |
| Architecture               | MCU, FPGA                                                                                                                                                                                                 |
| Core Processor             | Quad ARM® Cortex®-A53 MPCore $^{\text{\tiny M}}$ with CoreSight $^{\text{\tiny M}}$ , Dual ARM® Cortex $^{\text{\tiny M}}$ -R5 with CoreSight $^{\text{\tiny M}}$ , ARM Mali $^{\text{\tiny M}}$ -400 MP2 |
| Flash Size                 | -                                                                                                                                                                                                         |
| RAM Size                   | 256KB                                                                                                                                                                                                     |
| Peripherals                | DMA, WDT                                                                                                                                                                                                  |
| Connectivity               | CANbus, EBI/EMI, Ethernet, I <sup>2</sup> C, MMC/SD/SDIO, SPI, UART/USART, USB OTG                                                                                                                        |
| Speed                      | 500MHz, 600MHz, 1.2GHz                                                                                                                                                                                    |
| Primary Attributes         | Zynq®UltraScale+™ FPGA, 256K+ Logic Cells                                                                                                                                                                 |
| Operating<br>Temperature   | 0°C ~ 100°C (TJ)                                                                                                                                                                                          |
| Package / Case             | 900-BBGA, FCBGA                                                                                                                                                                                           |
| Supplier Device<br>Package | 900-FCBGA (31x31)                                                                                                                                                                                         |
| Purchase URL               | https://www.e-xfl.com/product-detail/xilinx/xczu5eg-1fbvb900e                                                                                                                                             |

Email: info@E-XFL.COM

Address: Room A, 16/F, Full Win Commercial Centre, 573 Nathan Road, Mongkok, Hong Kong



# **Programmable Logic (PL)**

## **Configurable Logic Blocks (CLB)**

- Look-up tables (LUT)
- Flip-flops
- Cascadable adders

### 36Kb Block RAM

- True dual-port
- Up to 72 bits wide
- Configurable as dual 18Kb

### **UltraRAM**

- 288Kb dual-port
- 72 bits wide
- Error checking and correction

### **DSP Blocks**

- 27 x 18 signed multiply
- 48-bit adder/accumulator
- 27-bit pre-adder

### **Programmable I/O Blocks**

- Supports LVCMOS, LVDS, and SSTL
- 1.0V to 3.3V I/O
- Programmable I/O delay and SerDes

## JTAG Boundary-Scan

• IEEE Std 1149.1 Compatible Test Interface

### **PCI Express**

- Supports Root complex and End Point configurations
- Supports up to Gen4 speeds
- Up to five integrated blocks in select devices

## 100G Ethernet MAC/PCS

- IEEE Std 802.3 compliant
- CAUI-10 (10x 10.3125Gb/s) or CAUI-4 (4x 25.78125Gb/s)
- RSFEC (IEEE Std 802.3bj) in CAUI-4 configuration
- Up to four integrated blocks in select devices

### Interlaken

- Interlaken spec 1.2 compliant
- 64/67 encoding
- 12 x 12.5Gb/s or 6 x 25Gb/s
- Up to four integrated blocks in select devices

## Video Encoder/Decoder (VCU)

- Available in EV devices
- Accessible from either PS or PL
- Simultaneous encode and decode
- H.264 and H.265 support

## **System Monitor in PL**

- On-chip voltage and temperature sensing
- 10-bit 200KSPS ADC with up to 17 external inputs



Table 3: Zynq UltraScale+ MPSoC: EG Device Feature Summary

|                                         | ZU2EG                                                                                                                           | ZU3EG                                                                                                       | ZU4EG   | ZU5EG         | ZU6EG         | ZU7EG          | ZU9EG         | ZU11EG       | ZU15EG  | ZU17EG  | ZU19EG    |
|-----------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------|---------|---------------|---------------|----------------|---------------|--------------|---------|---------|-----------|
| Application Processing Unit             | Quad-core ARM Cortex-A53 MPCore with CoreSight; NEON & Single/Double Precision Floating Point; 32KB/32KB L1 Cache, 1MB L2 Cache |                                                                                                             |         |               |               |                |               |              |         |         |           |
| Real-Time Processing Unit               |                                                                                                                                 | Dual-core ARM Cortex-R5 with CoreSight; Single/Double Precision Floating Point; 32KB/32KB L1 Cache, and TCM |         |               |               |                |               |              |         |         |           |
| Embedded and External<br>Memory         |                                                                                                                                 | 256KB On-Chip Memory w/ECC; External DDR4; DDR3; DDR3L; LPDDR4; LPDDR3; External Quad-SPI; NAND; eMMC       |         |               |               |                |               |              |         |         |           |
| General Connectivity                    |                                                                                                                                 | 214 PS I/O; UART; CAN; USB 2.0; I2C; SPI; 32b GPIO; Real Time Clock; WatchDog Timers; Triple Timer Counters |         |               |               |                |               |              |         |         |           |
| High-Speed Connectivity                 |                                                                                                                                 |                                                                                                             | 4 PS    | S-GTR; PCIe C | Sen1/2; Seria | I ATA 3.1; Dis | playPort 1.2a | ; USB 3.0; S | GMII    |         |           |
| Graphic Processing Unit                 |                                                                                                                                 |                                                                                                             |         |               | ARM Mali™-    | 400 MP2; 64I   | KB L2 Cache   |              |         |         |           |
| System Logic Cells                      | 103,320                                                                                                                         | 154,350                                                                                                     | 192,150 | 256,200       | 469,446       | 504,000        | 599,550       | 653,100      | 746,550 | 926,194 | 1,143,450 |
| CLB Flip-Flops                          | 94,464                                                                                                                          | 141,120                                                                                                     | 175,680 | 234,240       | 429,208       | 460,800        | 548,160       | 597,120      | 682,560 | 846,806 | 1,045,440 |
| CLB LUTs                                | 47,232                                                                                                                          | 70,560                                                                                                      | 87,840  | 117,120       | 214,604       | 230,400        | 274,080       | 298,560      | 341,280 | 423,403 | 522,720   |
| Distributed RAM (Mb)                    | 1.2                                                                                                                             | 1.8                                                                                                         | 2.6     | 3.5           | 6.9           | 6.2            | 8.8           | 9.1          | 11.3    | 8.0     | 9.8       |
| Block RAM Blocks                        | 150                                                                                                                             | 216                                                                                                         | 128     | 144           | 714           | 312            | 912           | 600          | 744     | 796     | 984       |
| Block RAM (Mb)                          | 5.3                                                                                                                             | 7.6                                                                                                         | 4.5     | 5.1           | 25.1          | 11.0           | 32.1          | 21.1         | 26.2    | 28.0    | 34.6      |
| UltraRAM Blocks                         | 0                                                                                                                               | 0                                                                                                           | 48      | 64            | 0             | 96             | 0             | 80           | 112     | 102     | 128       |
| UltraRAM (Mb)                           | 0                                                                                                                               | 0                                                                                                           | 14.0    | 18.0          | 0             | 27.0           | 0             | 22.5         | 31.5    | 28.7    | 36.0      |
| DSP Slices                              | 240                                                                                                                             | 360                                                                                                         | 728     | 1,248         | 1,973         | 1,728          | 2,520         | 2,928        | 3,528   | 1,590   | 1,968     |
| CMTs                                    | 3                                                                                                                               | 3                                                                                                           | 4       | 4             | 4             | 8              | 4             | 8            | 4       | 11      | 11        |
| Max. HP I/O <sup>(1)</sup>              | 156                                                                                                                             | 156                                                                                                         | 156     | 156           | 208           | 416            | 208           | 416          | 208     | 572     | 572       |
| Max. HD I/O <sup>(2)</sup>              | 96                                                                                                                              | 96                                                                                                          | 96      | 96            | 120           | 48             | 120           | 96           | 120     | 96      | 96        |
| System Monitor                          | 2                                                                                                                               | 2                                                                                                           | 2       | 2             | 2             | 2              | 2             | 2            | 2       | 2       | 2         |
| GTH Transceiver 16.3Gb/s <sup>(3)</sup> | 0                                                                                                                               | 0                                                                                                           | 16      | 16            | 24            | 24             | 24            | 32           | 24      | 44      | 44        |
| GTY Transceivers 32.75Gb/s              | 0                                                                                                                               | 0                                                                                                           | 0       | 0             | 0             | 0              | 0             | 16           | 0       | 28      | 28        |
| Transceiver Fractional PLLs             | 0                                                                                                                               | 0                                                                                                           | 8       | 8             | 12            | 12             | 12            | 24           | 12      | 36      | 36        |
| PCIe Gen3 x16 and Gen4 x8               | 0                                                                                                                               | 0                                                                                                           | 2       | 2             | 0             | 2              | 0             | 4            | 0       | 4       | 5         |
| 150G Interlaken                         | 0                                                                                                                               | 0                                                                                                           | 0       | 0             | 0             | 0              | 0             | 1            | 0       | 2       | 4         |
| 100G Ethernet w/ RS-FEC                 | 0                                                                                                                               | 0                                                                                                           | 0       | 0             | 0             | 0              | 0             | 2            | 0       | 2       | 4         |

- 1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.
- HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V.
   GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. See Table 4.



Table 4: Zynq UltraScale+ MPSoC: EG Device-Package Combinations and Maximum I/Os

| Package (1)(2)(3)(4)(5) | Package<br>Dimensions<br>(mm) | ZU2EG              | ZU3EG              | ZU4EG              | ZU5EG              | ZU6EG              | ZU7EG              | ZU9EG              | ZU11EG             | ZU15EG             | ZU17EG             | ZU19EG             |
|-------------------------|-------------------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|
|                         |                               | HD, HP<br>GTH, GTY |
| SBVA484 <sup>(6)</sup>  | 19x19                         | 24, 58<br>0, 0     | 24, 58<br>0, 0     |                    |                    |                    |                    |                    |                    |                    |                    |                    |
| SFVA625                 | 21x21                         | 24, 156<br>0, 0    | 24, 156<br>0, 0    |                    |                    |                    |                    |                    |                    |                    |                    |                    |
| SFVC784 <sup>(7)</sup>  | 23x23                         | 96, 156<br>0, 0    | 96, 156<br>0, 0    | 96, 156<br>4, 0    | 96, 156<br>4, 0    |                    |                    |                    |                    |                    |                    |                    |
| FBVB900                 | 31x31                         |                    |                    | 48, 156<br>16, 0   | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |                    |                    |                    |                    |                    |
| FFVC900                 | 31x31                         |                    |                    |                    |                    | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |                    |                    |
| FFVB1156                | 35x35                         |                    |                    |                    |                    | 120, 208<br>24, 0  |                    | 120, 208<br>24, 0  |                    | 120, 208<br>24, 0  |                    |                    |
| FFVC1156                | 35x35                         |                    |                    |                    |                    |                    | 48, 312<br>20, 0   |                    | 48, 312<br>20, 0   |                    |                    |                    |
| FFVB1517                | 40x40                         |                    |                    |                    |                    |                    |                    |                    | 72, 416<br>16, 0   |                    | 72, 572<br>16, 0   | 72, 572<br>16, 0   |
| FFVF1517                | 40x40                         |                    |                    |                    |                    |                    | 48, 416<br>24, 0   |                    | 48, 416<br>32, 0   |                    |                    |                    |
| FFVC1760                | 42.5x42.5                     |                    |                    |                    |                    |                    |                    |                    | 96, 416<br>32, 16  |                    | 96, 416<br>32, 16  | 96, 416<br>32, 16  |
| FFVD1760                | 42.5x42.5                     |                    |                    |                    |                    |                    |                    |                    |                    |                    | 48, 260<br>44, 28  | 48, 260<br>44, 28  |
| FFVE1924                | 45x45                         |                    |                    |                    |                    |                    |                    |                    |                    |                    | 96, 572<br>44, 0   | 96, 572<br>44, 0   |

- 1. Go to Ordering Information for package designation details. (5)
- 2. FB/FF packages have 1.0mm ball pitch. SB/SF packages have 0.8mm ball pitch.
- 3. All device package combinations bond out 4 PS-GTR transceivers.
- 4. All device package combinations bond out 214 PS I/O except ZU2EG and ZU3EG in the SBVA484 and SFVA625 packages, which bond out 170 PS I/Os.
- 5. Packages with the same last letter and number sequence, e.g., A484, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined.
- 6. All 58 HP I/O pins are powered by the same  $V_{CCO}$  supply.
- 7. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s.



Table 5: Zynq UltraScale+ MPSoC: EV Device Feature Summary

|                                         | ZU4EV                                                                                                                           | ZU5EV                               | ZU7EV               |  |  |  |  |
|-----------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|-------------------------------------|---------------------|--|--|--|--|
| Application Processing Unit             | Quad-core ARM Cortex-A53 MPCore with CoreSight; NEON & Single/Double Precision Floating Point; 32KB/32KB L1 Cache, 1MB L2 Cache |                                     |                     |  |  |  |  |
| Real-Time Processing Unit               | Dual-core ARM Cortex-R5 with CoreSight; Single/Double Precision Floating Point; 32KB/32KB L1 Cache, and TCM                     |                                     |                     |  |  |  |  |
| Embedded and External<br>Memory         | 256KB On-Chip Memory w/ECC; External DDR4; DDR3; DDR3L; LPDDR4; LPDDR3; External Quad-SPI; NAND; eMMC                           |                                     |                     |  |  |  |  |
| General Connectivity                    | 214 PS I/O; UART; CAN; USB 2.0; I2C; SPI; 32b GPIO; Real Time Clock; WatchDog Timers; Triple Timer Counters                     |                                     |                     |  |  |  |  |
| High-Speed Connectivity                 | 4 PS-GTR; PCIe Gen                                                                                                              | n1/2; Serial ATA 3.1; DisplayPort 1 | .2a; USB 3.0; SGMII |  |  |  |  |
| Graphic Processing Unit                 | А                                                                                                                               | RM Mali™-400 MP2; 64KB L2 Cach      | ne                  |  |  |  |  |
| Video Codec                             | 1                                                                                                                               | 1                                   | 1                   |  |  |  |  |
| System Logic Cells                      | 192,150                                                                                                                         | 256,200                             | 504,000             |  |  |  |  |
| CLB Flip-Flops                          | 175,680                                                                                                                         | 234,240                             | 460,800             |  |  |  |  |
| CLB LUTs                                | 87,840                                                                                                                          | 117,120                             | 230,400             |  |  |  |  |
| Distributed RAM (Mb)                    | 2.6                                                                                                                             | 3.5                                 | 6.2                 |  |  |  |  |
| Block RAM Blocks                        | 128                                                                                                                             | 144                                 | 312                 |  |  |  |  |
| Block RAM (Mb)                          | 4.5                                                                                                                             | 5.1                                 | 11.0                |  |  |  |  |
| UltraRAM Blocks                         | 48                                                                                                                              | 64                                  | 96                  |  |  |  |  |
| UltraRAM (Mb)                           | 14.0                                                                                                                            | 18.0                                | 27.0                |  |  |  |  |
| DSP Slices                              | 728                                                                                                                             | 1,248                               | 1,728               |  |  |  |  |
| CMTs                                    | 4                                                                                                                               | 4                                   | 8                   |  |  |  |  |
| Max. HP I/O <sup>(1)</sup>              | 156                                                                                                                             | 156                                 | 416                 |  |  |  |  |
| Max. HD I/O <sup>(2)</sup>              | 96                                                                                                                              | 96                                  | 48                  |  |  |  |  |
| System Monitor                          | 2                                                                                                                               | 2                                   | 2                   |  |  |  |  |
| GTH Transceiver 16.3Gb/s <sup>(3)</sup> | 16                                                                                                                              | 16                                  | 24                  |  |  |  |  |
| GTY Transceivers 32.75Gb/s              | 0                                                                                                                               | 0                                   | 0                   |  |  |  |  |
| Transceiver Fractional PLLs             | 8                                                                                                                               | 8                                   | 12                  |  |  |  |  |
| PCIe Gen3 x16 and Gen4 x8               | 2                                                                                                                               | 2                                   | 2                   |  |  |  |  |
| 150G Interlaken                         | 0                                                                                                                               | 0                                   | 0                   |  |  |  |  |
| 100G Ethernet w/ RS-FEC                 | 0                                                                                                                               | 0                                   | 0                   |  |  |  |  |

- HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.
   HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V.
   GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. See Table 6.



Table 6: Zynq UltraScale+ MPSoC: EV Device-Package Combinations and Maximum I/Os

| Package (1)(2)(3)(4)   | Package         | ZU4EV              | ZU5EV              | ZU7EV              |
|------------------------|-----------------|--------------------|--------------------|--------------------|
|                        | Dimensions (mm) | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY |
| SFVC784 <sup>(5)</sup> | 23x23           | 96, 156<br>4, 0    | 96, 156<br>4, 0    |                    |
| FBVB900                | 31x31           | 48, 156<br>16, 0   | 48, 156<br>16, 0   | 48, 156<br>16, 0   |
| FFVC1156               | 35x35           |                    |                    | 48, 312<br>20, 0   |
| FFVF1517               | 40x40           |                    |                    | 48, 416<br>24, 0   |

- 1. Go to Ordering Information for package designation details.
- 2. FB/FF packages have 1.0mm ball pitch. SF packages have 0.8mm ball pitch.
- 3. All device package combinations bond out 4 PS-GTR transceivers.
- 4. Packages with the same last letter and number sequence, e.g., C784, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined.
- 5. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s.



# **Processing System**

## **Application Processing Unit (APU)**

The key features of the APU include:

- 64-bit guad-core ARM Cortex-A53 MPCores. Features associated with each core include:
  - o ARM v8-A Architecture
  - Operating target frequency: up to 1.5GHz
  - Single and double precision floating point:4 SP / 2 DP FLOPs
  - NEON Advanced SIMD support with single and double precision floating point instructions
  - o A64 instruction set in 64-bit operating mode, A32/T32 instruction set in 32-bit operating mode
  - Level 1 cache (separate instruction and data, 32KB each for each Cortex-A53 CPU)
    - 2-way set-associative Instruction Cache with parity support
    - 4-way set-associative Data Cache with ECC support
  - Integrated memory management unit (MMU) per processor core
  - TrustZone for secure mode operation
  - Virtualization support
- Ability to operate in single processor, symmetric quad processor, and asymmetric quad-processor modes
- Integrated 16-way set-associative 1MB Unified Level 2 cache with ECC support
- Interrupts and Timers
  - Generic interrupt controller (GIC-400)
  - ARM generic timers (4 timers per CPU)
  - One watchdog timer (WDT)
  - One global timer
  - Two triple timers/counters (TTC)
- Little and big endian support
  - Big endian support in BE8 mode
- CoreSight debug and trace support
  - Embedded Trace Macrocell (ETM) for instruction trace
  - Cross trigger interface (CTI) enabling hardware breakpoints and triggers
- ACP interface to PL for I/O coherency and Level 2 cache allocation
- ACE interface to PL for full coherency
- Power island gating on each processor core
- Optional eFUSE disable per core



## **Xilinx Memory Protection Unit (XMPU)**

- Region based memory protection unit
- Up to 16 regions
- Each region supports address alignment of 1MB or 4KB
- Regions can overlap; the higher region number has priority
- Each region can be independently enabled or disabled
- Each region has a start and end address

## **Graphics Processing Unit (GPU)**

- Supports OpenGL ES 1.1 & 2.0
- Supports OpenVG 1.1
- Operating target frequency: up to 667MHz
- Single Geometry Processor and two Pixel processor
- Pixel Fill Rate: 2 Mpixel/sec/MHz
- Triangle Rate: 0.11 Mtriangles/sec/MHz
- 64KB Level 2 Cache (read-only)
- 4X and 16X Anti-aliasing Support
- ETC1 texture compression to reduce external memory bandwidth
- Extensive texture format support
  - o RGBA 8888, 565, 1556
  - o Mono 8, 16
  - YUV format support
- Automatic load balancing across different graphics shader engines
- 2D and 3D graphic acceleration
- Up to 4K texture input and 4K render output resolutions
- Each geometry processor and pixel processor supports 4KB page MMU
- Power island gating on each GPU engine and shared cache
- Optional eFUSE disable

## **Dynamic Memory Controller (DDRC)**

- DDR3, DDR3L, DDR4, LPDDR3, LPDDR4
- Target data rate: Up to 2400Mb/s DDR4 operation in -1 speed grade
- 32-bit and 64-bit bus width support for DDR4, DDR3, DDR3L, or LPDDR3 memories, and 32-bit bus width support for LPDDR4 memory
- ECC support (using extra bits)
- Up to a total DRAM capacity of 32GB



- Low power modes
  - Active/precharge power down
  - o Self-refresh, including clean exit from self-refresh after a controller power cycle
- Enhanced DDR training by allowing software to measure read/write eye and make delay adjustments dynamically
- Independent performance monitors for read path and write path
- Integration of PHY Debug Access Port (DAP) into JTAG for testing

The DDR memory controller is multi-ported and enables the PS and the PL to have shared access to a common memory. The DDR controller features six AXI slave ports for this purpose:

- Two 128-bit AXI ports from the ARM Cortex-A53 CPU(s), RPU (ARM Cortex-R5 and LPD peripherals), GPU, high speed peripherals (USB3, PCIe & SATA), and High Performance Ports (HPO & HP1) from the PL through the Cache Coherent Interconnect (CCI)
- One 64-bit port is dedicated for the ARM Cortex-R5 CPU(s)
- One 128-bit AXI port from the DisplayPort and HP2 port from the PL
- One 128-bit AXI port from HP3 and HP4 ports from the PL
- One 128-bit AXI port from General DMA and HP5 from the PL

### **High-Speed Connectivity Peripherals**

#### **PCIe**

- Compliant with the PCI Express Base Specification 2.1
- Fully compliant with PCI Express transaction ordering rules
- Lane width: x1, x2, or x4 at Gen1 or Gen2 rates
- 1 Virtual Channel
- Full duplex PCIe port
- End Point and single PCIe link Root Port
- Root Port supports Enhanced Configuration Access Mechanism (ECAM), Cfg Transaction generation
- Root Port support for INTx, and MSI
- Endpoint support for MSI or MSI-X
  - 1 physical function, no SR-IOV
  - No relaxed or ID ordering
  - Fully configurable BARs
  - o INTx not recommended, but can be generated
  - Endpoint to support configurable target/slave apertures with address translation and Interrupt capability



- Audio support
  - A single stream carries up to 8 LPCM channels at 192kHz with 24-bit resolution
  - Supports compressed formats including DRA, Dolby MAT, and DTS HD
  - Multi-Stream Transport can extend the number of audio channels
  - Audio copy protection
  - o 2-channel streaming or input from the PL
  - o Multi-channel non-streaming audio from a memory audio frame buffer
- Includes a System Time Clock (STC) compliant with ISO/IEC 13818-1
- Boot-time display using minimum resources

### **Platform Management Unit (PMU)**

- Performs system initialization during boot
- Acts as a delegate to the application and real-time processors during sleep state
- Initiates power-up and restart after the wake-up request
- Maintains the system power state at all time
- Manages the sequence of low-level events required for power-up, power-down, reset, clock gating, and power gating of islands and domains
- Provides error management (error handling and reporting)
- Provides safety check functions (e.g., memory scrubbing)

The PMU includes the following blocks:

- Platform management processor
- Fixed ROM for boot-up of the device
- 128KB RAM with ECC for optional user/firmware code
- Local and global registers to manage power-down, power-up, reset, clock gating, and power gating requests
- Interrupt controller with 16 interrupts from other modules and the inter-processor communication interface (IPI)
- GPI and GPO interfaces to and from PS I/O and PL
- JTAG interface for PMU debug
- Optional User-Defined Firmware



- Full duplex flow control with recognition of incoming pause frames and hardware generation of transmitted pause frames
- 802.1Q VLAN tagging with recognition of incoming VLAN and priority tagged frames
- Supports IEEE Std 1588 v2

### SD/SDIO 3.0 Controller

In addition to secure digital (SD) devices, this controller also supports eMMC 4.51.

- Host mode support only
- Built-in DMA
- 1/4-Bit SD Specification, version 3.0
- 1/4/8-Bit eMMC Specification, version 4.51
- Supports primary boot from SD Card and eMMC (Managed NAND)
- High speed, default speed, and low-speed support
- 1 and 4-bit data interface support
  - Low speed clock 0-400KHz
  - Default speed 0-25MHz
  - High speed clock 0-50MHz
- High speed Interface
  - o SD UHS-1: 208MHz
  - o eMMC HS200: 200MHz
- Memory, I/O, and SD cards
- Power control modes
- Data FIFO interface up to 512B

#### **UART**

- Programmable baud rate generator
- 6, 7, or 8 data bits
- 1, 1.5, or 2 stop bits
- Odd, even, space, mark, or no parity
- Parity, framing, and overrun error detection
- Line break generation and detection
- Automatic echo, local loopback, and remote loopback channel modes
- Modem control signals: CTS, RTS, DSR, DTR, RI, and DCD (from EMIO only)



#### SPI

- Full-duplex operation offers simultaneous receive and transmit
- 128B deep read and write FIFO
- Master or slave SPI mode
- Up to 3 chip select lines
- Multi-master environment
- Identifies an error condition if more than one master detected
- Selectable master clock reference
- Software can poll for status or be interrupt driven

#### **12C**

- 128-bit buffer size
- Both normal (100kHz) and fast bus data rates (400kHz)
- Master or slave mode
- Normal or extended addressing
- I2C bus hold for slow host service

#### **GPIO**

- Up to 128 GPIO bits
  - Up to 78-bits from MIO and 96-bits from EMIO
- Each GPIO bit can be dynamically programmed as input or output
- Independent reset values for each bit of all registers
- Interrupt request generation for each GPIO signals
- Single Channel (Bit) write capability for all control registers include data output register, direction control register, and interrupt clear register
- Read back in output mode

#### CAN

- Conforms to the ISO 11898 -1, CAN2.0A, and CAN 2.0B standards
- Both standard (11-bit identifier) and extended (29-bit identifier) frames
- Bit rates up to 1Mb/s
- Transmit and Receive message FIFO with a depth of 64 messages
- Watermark interrupts for TXFIFO and RXFIFO
- Automatic re-transmission on errors or arbitration loss in normal mode
- Acceptance filtering of 4 acceptance filters



- 2 chip selects
- Programmable access timing
- 1.8V and 3.3V I/O
- Built-in DMA for improved performance

#### **Quad-SPI Controller**

- 4 bytes (32-bit) and 3 bytes (24-bit) address width
- Maximum SPI Clock at Master Mode at 150MHz
- Single, Dual-Parallel, and Dual-Stacked mode
- 32-bit AXI Linear Address Mapping Interface for read operation
- Up to 2 chip select signals
- Write Protection Signal
- Hold signals
- 4-bit bidirectional I/O signals
- x1/x2/x4 Read speed required
- x1 write speed required only
- 64 byte Entry FIFO depth to improve QSPI read efficiency
- Built-in DMA for improved performance

# Video Encoder/Decoder (VCU)

Zynq UltraScale+ MPSoCs include a Video codec (encoder/decoder) available in the devices designated with the EV suffix. The VCU is located in the PL and can be accessed from either the PL or PS.

- Simultaneous Encode and Decode through separate cores
- H.264 high profile level 5.2 (4Kx2K-60)
- H.265 (HEVC) main, main10 profile, level 5.1, high Tier, up to 4Kx2K-60 rate
- 8 and 10 bit encoding
- 4:2:0 and 4:2:2 chroma sampling
- 8Kx4K-15 rate
- Multi-stream up to total of 4Kx2K-60 rate
- Low Latency mode
- Can share the PS DRAM or use dedicated DRAM in the PL
- Clock/power management
- OpenMax Linux drivers



### Interconnect

All the blocks are connected to each other and to the PL through a multi-layered ARM Advanced Microprocessor Bus Architecture (AMBA) AXI interconnect. The interconnect is non-blocking and supports multiple simultaneous master-slave transactions.

The interconnect is designed with latency sensitive masters, such as the ARM CPU, having the shortest paths to memory, and bandwidth critical masters, such as the potential PL masters, having high throughput connections to the slaves with which they need to communicate.

Traffic through the interconnect can be regulated through the Quality of Service (QoS) block in the interconnect. The QoS feature is used to regulate traffic generated by the CPU, DMA controller, and a combined entity representing the masters in the IOP.

## **PS Interfaces**

PS interfaces include external interfaces going off-chip or signals going from PS to PL.

### **PS External Interfaces**

The Zynq UltraScale+ MPSoC's external interfaces use dedicated pins that cannot be assigned as PL pins. These include:

- Clock, reset, boot mode, and voltage reference
- Up to 78 dedicated multiplexed I/O (MIO) pins, software-configurable to connect to any of the internal I/O peripherals and static memory controllers
- 32-bit or 64-bit DDR4/DDR3/DDR3L/LPDDR3 memories with optional ECC
- 32-bit LPDDR4 memory with optional ECC
- 4 channels (TX and RX pair) for transceivers

### **MIO Overview**

The IOP peripherals communicate to external devices through a shared pool of up to 78 dedicated multiplexed I/O (MIO) pins. Each peripheral can be assigned one of several pre-defined groups of pins, enabling a flexible assignment of multiple devices simultaneously. Although 78 pins are not enough for simultaneous use of all the I/O peripherals, most IOP interface signals are available to the PL, allowing use of standard PL I/O pins when powered up and properly configured. Extended multiplexed I/O (EMIO) allows unmapped PS peripherals to access PL I/O.

Port mappings can appear in multiple locations. For example, there are up to 12 possible port mappings for CAN pins. The PS Configuration Wizard (PCW) tool aids in peripheral and static memory pin mapping.



Table 8: MIO Peripheral Interface Mapping

| Peripheral<br>Interface                  | MIO                                        | ЕМІО                                                                                                                                                                                                                                                      |
|------------------------------------------|--------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Quad-SPI<br>NAND                         | Yes                                        | No                                                                                                                                                                                                                                                        |
| USB2.0: 0,1                              | Yes: External PHY                          | No                                                                                                                                                                                                                                                        |
| SDIO 0,1                                 | Yes                                        | Yes                                                                                                                                                                                                                                                       |
| SPI: 0,1<br>I2C: 0,1<br>CAN: 0,1<br>GPIO | Yes  CAN: External PHY GPIO: Up to 78 bits | Yes  CAN: External PHY GPIO: Up to 96 bits                                                                                                                                                                                                                |
| GigE: 0,1,2,3                            | RGMII v2.0:<br>External PHY                | Supports GMII, RGMII v2.0 (HSTL), RGMII v1.3, MII, SGMII, and 1000BASE-X in Programmable Logic                                                                                                                                                            |
| UART: 0,1                                | Simple UART:<br>Only two pins (TX and RX)  | <ul> <li>Full UART (TX, RX, DTR, DCD, DSR, RI, RTS, and CTS) requires either:</li> <li>Two Processing System (PS) pins (RX and TX) through MIO and six additional Programmable Logic (PL) pins, or</li> <li>Eight Programmable Logic (PL) pins</li> </ul> |
| Debug Trace Ports                        | Yes: Up to 16 trace bits                   | Yes: Up to 32 trace bits                                                                                                                                                                                                                                  |
| Processor JTAG                           | Yes                                        | Yes                                                                                                                                                                                                                                                       |

### Transceiver (PS-GTR)

The four PS-GTR transceivers, which reside in the full power domain (FPD), support data rates of up to 6.0Gb/s. All the protocols cannot be pinned out at the same time. At any given time, four differential pairs can be pinned out using the transceivers. This is user programmable via the high-speed I/O multiplexer (HS-MIO).

- A Quad transceiver PS-GTR (TX/RX pair) able to support following standards simultaneously
  - o x1, x2, or x4 lane of PCIe at Gen1 (2.5Gb/s) or Gen2 (5.0Gb/s) rates
  - o 1 or 2 lanes of DisplayPort (TX only) at 1.62Gb/s, 2.7Gb/s, or 5.4Gb/s
  - o 1 or 2 SATA channels at 1.5Gb/s, 3.0Gb/s, or 6.0Gb/s
  - o 1 or 2 USB3.0 channels at 5.0Gb/s
  - o 1-4 Ethernet SGMII channels at 1.25Gb/s
- Provides flexible host-programmable multiplexing function for connecting the transceiver resources to the PS masters (DisplayPort, PCIe, Serial-ATA, USB3.0, and GigE).



### **High-Performance AXI Ports**

The high-performance AXI4 ports provide access from the PL to DDR and high-speed interconnect in the PS. The six dedicated AXI memory ports from the PL to the PS are configurable as either 128-bit, 64-bit, or 32-bit interfaces. These interfaces connect the PL to the memory interconnect via a FIFO interface. Two of the AXI interfaces support I/O coherent access to the APU caches.

Each high-performance AXI port has these characteristics:

- Reduced latency between PL and processing system memory
- 1KB deep FIFO
- Configurable either as 128-bit, 64-bit, or 32-bit AXI interfaces
- Multiple AXI command issuing to DDR

### Accelerator Coherency Port (ACP)

The Zynq UltraScale+ MPSoC accelerator coherency port (ACP) is a 64-bit AXI slave interface that provides connectivity between the APU and a potential accelerator function in the PL. The ACP directly connects the PL to the snoop control unit (SCU) of the ARM Cortex-A53 processors, enabling cache-coherent access to CPU data in the L2 cache. The ACP provides a low latency path between the PS and a PL-based accelerator when compared with a legacy cache flushing and loading scheme. The ACP only snoops access in the CPU L2 cache, providing coherency in hardware. It does not support coherency on the PL side. So this interface is ideal for a DMA or an accelerator in the PL that only requires coherency on the CPU cache memories. For example, if a MicroBlaze™ processor in the PL is attached to the ACP interface, the cache of MicroBlaze processor will not be coherent with Cortex-A53 caches.

### AXI Coherency Extension (ACE)

The Zynq UltraScale+ MPSoC AXI coherency extension (ACE) is a 64-bit AXI4 slave interface that provides connectivity between the APU and a potential accelerator function in the PL. The ACE directly connects the PL to the snoop control unit (SCU) of the ARM Cortex-A53 processors, enabling cache-coherent access to Cache Coherent Interconnect (CCI). The ACE provides a low-latency path between the PS and a PL-based accelerator when compared with a legacy cache flushing and loading scheme. The ACE snoops accesses to the CCI and the PL side, thus, providing full coherency in hardware. This interface can be used to hook up a cached interface in the PL to the PS as caches on both the Cortex-A53 memories and the PL master are snooped thus providing full coherency. For example, if a MicroBlaze processor in the PL is hooked up using an ACE interface, then Cortex-A53 and MicroBlaze processor caches will be coherent with each other.



# **Programmable Logic**

This section covers the information about blocks in the Programmable Logic (PL).

# **Device Layout**

UltraScale architecture-based devices are arranged in a column-and-grid layout. Columns of resources are combined in different ratios to provide the optimum capability for the device density, target market or application, and device cost. At the core of UltraScale+ MPSoCs is the processing system that displaces some of the full or partial columns of programmable logic resources. Figure 1 shows a device-level view with resources grouped together. For simplicity, certain resources such as the processing system, integrated blocks for PCIe, configuration logic, and System Monitor are not shown.



Figure 1: Device with Columnar Resources

Resources within the device are divided into segmented clock regions. The height of a clock region is 60 CLBs. A bank of 52 I/Os, 24 DSP slices, 12 block RAMs, or 4 transceiver channels also matches the height of a clock region. The width of a clock region is essentially the same in all cases, regardless of device size or the mix of resources in the region, enabling repeatable timing results. Each segmented clock region contains vertical and horizontal clock routing that span its full height and width. These horizontal and vertical clock routes can be segmented at the clock region boundary to provide a flexible, high-performance, low-power clock distribution architecture. Figure 2 is a representation of a device divided into regions.



### 3-State Digitally Controlled Impedance and Low Power I/O Features

The 3-state Digitally Controlled Impedance (T\_DCI) can control the output drive impedance (series termination) or can provide parallel termination of an input signal to  $V_{CCO}$  or split (Thevenin) termination to  $V_{CCO}/2$ . This allows users to eliminate off-chip termination for signals using T\_DCI. In addition to board space savings, the termination automatically turns off when in output mode or when 3-stated, saving considerable power compared to off-chip termination. The I/Os also have low power modes for IBUF and IDELAY to provide further power savings, especially when used to implement memory interfaces.

## I/O Logic

### Input and Output Delay

All inputs and outputs can be configured as either combinatorial or registered. Double data rate (DDR) is supported by all inputs and outputs. Any input or output can be individually delayed by up to 1,250ps of delay with a resolution of 5–15ps. Such delays are implemented as IDELAY and ODELAY. The number of delay steps can be set by configuration and can also be incremented or decremented while in use. The IDELAY and ODELAY can be cascaded together to double the amount of delay in a single direction.

#### **ISERDES** and **OSERDES**

Many applications combine high-speed, bit-serial I/O with slower parallel operation inside the device. This requires a serializer and deserializer (SerDes) inside the I/O logic. Each I/O pin possesses an IOSERDES (ISERDES and OSERDES) capable of performing serial-to-parallel or parallel-to-serial conversions with programmable widths of 2, 4, or 8 bits. These I/O logic features enable high-performance interfaces, such as Gigabit Ethernet/1000BaseX/SGMII, to be moved from the transceivers to the SelectIO interface.

# **High-Speed Serial Transceivers**

Ultra-fast serial data transmission between devices on the same PCB, over backplanes, and across even longer distances is becoming increasingly important for scaling to 100 Gb/s and 400 Gb/s line cards. Specialized dedicated on-chip circuitry and differential I/O capable of coping with the signal integrity issues are required at these high data rates.

Three types of transceivers are used in Zynq UltraScale+ MPSoCs: GTH, GTY, and PS-GTR. All transceivers are arranged in groups of four, known as a transceiver Quad. Each serial transceiver is a combined transmitter and receiver. Table 10 compares the available transceivers.



#### **PLL**

With fewer features than the MMCM, the two PLLs in a clock management tile are primarily present to provide the necessary clocks to the dedicated memory interface circuitry. The circuit at the center of the PLLs is similar to the MMCM, with PFD feeding a VCO and programmable M, D, and O counters. There are two divided outputs to the device fabric per PLL as well as one clock plus one enable signal to the memory interface circuitry.

Zynq UltraScale+ MPSoCs are equipped with five additional PLLs in the PS for independently configuring the four primary clock domains with the PS: the APU, the RPU, the DDR controller, and the I/O peripherals.

## **Clock Distribution**

Clocks are distributed throughout Zynq UltraScale+ MPSoCs via buffers that drive a number of vertical and horizontal tracks. There are 24 horizontal clock routes per clock region and 24 vertical clock routes per clock region with 24 additional vertical clock routes adjacent to the MMCM and PLL. Within a clock region, clock signals are routed to the device logic (CLBs, etc.) via 16 gateable leaf clocks.

Several types of clock buffers are available. The BUFGCE and BUFCE\_LEAF buffers provide clock gating at the global and leaf levels, respectively. BUFGCTRL provides glitchless clock muxing and gating capability. BUFGCE\_DIV has clock gating capability and can divide a clock by 1 to 8. BUFG\_GT performs clock division from 1 to 8 for the transceiver clocks. In MPSoCs, clocks can be transferred from the PS to the PL using dedicated buffers.

# **Memory Interfaces**

Memory interface data rates continue to increase, driving the need for dedicated circuitry that enables high performance, reliable interfacing to current and next-generation memory technologies. Every Zynq UltraScale+ MPSoC includes dedicated physical interfaces (PHY) blocks located between the CMT and I/O columns that support implementation of high-performance PHY blocks to external memories such as DDR4, DDR3, QDRII+, and RLDRAM3. The PHY blocks in each I/O bank generate the address/control and data bus signaling protocols as well as the precision clock/data alignment required to reliably communicate with a variety of high-performance memory standards. Multiple I/O banks can be used to create wider memory interfaces.

As well as external parallel memory interfaces, Zynq UltraScale+ MPSoC can communicate to external serial memories, such as Hybrid Memory Cube (HMC), via the high-speed serial transceivers. All transceivers in the UltraScale architecture support the HMC protocol, up to 15Gb/s line rates. UltraScale architecture-based devices support the highest bandwidth HMC configuration of 64 lanes with a single device.



# **Configurable Logic Block**

Every Configurable Logic Block (CLB) in the UltraScale architecture contains 8 LUTs and 16 flip-flops. The LUTs can be configured as either one 6-input LUT with one output, or as two 5-input LUTs with separate outputs but common inputs. Each LUT can optionally be registered in a flip-flop. In addition to the LUTs and flip-flops, the CLB contains arithmetic carry logic and multiplexers to create wider logic functions.

Each CLB contains one slice. There are two types of slices: SLICEL and SLICEM. LUTs in the SLICEM can be configured as 64-bit RAM, as 32-bit shift registers (SRL32), or as two SRL16s. CLBs in the UltraScale architecture have increased routing and connectivity compared to CLBs in previous-generation Xilinx devices. They also have additional control signals to enable superior register packing, resulting in overall higher device utilization.

## Interconnect

Various length vertical and horizontal routing resources in the UltraScale architecture that span 1, 2, 4, 5, 12, or 16 CLBs ensure that all signals can be transported from source to destination with ease, providing support for the next generation of wide data buses to be routed across even the highest capacity devices while simultaneously improving quality of results and software run time.

## **Block RAM**

Every UltraScale architecture-based device contains a number of 36Kb block RAMs, each with two completely independent ports that share only the stored data. Each block RAM can be configured as one 36Kb RAM or two independent 18Kb RAMs. Each memory access, read or write, is controlled by the clock. Connections in every block RAM column enable signals to be cascaded between vertically adjacent block RAMs, providing an easy method to create large, fast memory arrays, and FIFOs with greatly reduced power consumption.

All inputs, data, address, clock enables, and write enables are registered. The input address is always clocked (unless address latching is turned off), retaining data until the next operation. An optional output data pipeline register allows higher clock rates at the cost of an extra cycle of latency. During a write operation, the data output can reflect either the previously stored data or the newly written data, or it can remain unchanged. Block RAM sites that remain unused in the user design are automatically powered down to reduce total power consumption. There is an additional pin on every block RAM to control the dynamic power gating feature.



In FPGAs and the MPSoC PL, sensor outputs and up to 17 user-allocated external analog inputs are digitized using a 10-bit 200 kilo-sample-per-second (kSPS) ADC, and the measurements are stored in registers that can be accessed via internal FPGA (DRP), JTAG, PMBus, or I2C interfaces. The I2C interface and PMBus allow the on-chip monitoring to be easily accessed by the System Manager/Host before and after device configuration.

The System Monitor in the MPSoC PS uses a 10-bit, 1 mega-sample-per-second (MSPS) ADC to digitize the sensor inputs. The measurements are stored in registers and are accessed via the Advanced Peripheral Bus (APB) interface by the processors and the PMU in the PS.

# **Packaging**

The UltraScale architecture-based devices are available in a variety of organic flip-chip and lidless flip-chip packages supporting different quantities of I/Os and transceivers. Maximum supported performance can depend on the style of package and its material. Always refer to the specific device data sheet for performance specifications by package type.

In flip-chip packages, the silicon device is attached to the package substrate using a high-performance flip-chip process. Decoupling capacitors are mounted on the package substrate to optimize signal integrity under simultaneous switching of outputs (SSO) conditions.

# **System-Level Features**

Several functions span both the PS and PL and include:

- Reset Management
- Clock Management
- Power Domains
- PS Boot and Device Configuration
- Hardware and Software Debug Support

## **Reset Management**

The reset management function provides the ability to reset the entire device or individual units within it. The PS supports these reset functions and signals:

- External and internal power-on reset signal
- Warm reset
- Watchdog timer reset
- User resets to PL
- Software, watchdog timer, or JTAG provided resets
- Security violation reset (locked down reset)