



Welcome to **E-XFL.COM** 

Embedded - System On Chip (SoC): The Heart of Modern Embedded Systems

Embedded - System On Chip (SoC) refers to an integrated circuit that consolidates all the essential components of a computer system into a single chip. This includes a microprocessor, memory, and other peripherals, all packed into one compact and efficient package. SoCs are designed to provide a complete computing solution, optimizing both space and power consumption, making them ideal for a wide range of embedded applications.

What are **Embedded - System On Chip (SoC)**?

**System On Chip (SoC)** integrates multiple functions of a computer or electronic system onto a single chip. Unlike traditional multi-chip solutions. SoCs combine a central

| Details                    |                                                                                                       |
|----------------------------|-------------------------------------------------------------------------------------------------------|
| Product Status             | Active                                                                                                |
| Architecture               | MCU, FPGA                                                                                             |
| Core Processor             | Quad ARM® Cortex®-A53 MPCore™ with CoreSight™, Dual ARM®Cortex™-R5 with CoreSight™, ARM Mali™-400 MP2 |
| Flash Size                 | -                                                                                                     |
| RAM Size                   | 256KB                                                                                                 |
| Peripherals                | DMA, WDT                                                                                              |
| Connectivity               | CANbus, EBI/EMI, Ethernet, I <sup>2</sup> C, MMC/SD/SDIO, SPI, UART/USART, USB OTG                    |
| Speed                      | 533MHz, 600MHz, 1.3GHz                                                                                |
| Primary Attributes         | Zynq®UltraScale+™ FPGA, 154K+ Logic Cells                                                             |
| Operating<br>Temperature   | 0°C ~ 100°C (TJ)                                                                                      |
| Package / Case             | 784-BFBGA, FCBGA                                                                                      |
| Supplier Device<br>Package | 784-FCBGA (23x23)                                                                                     |
| Purchase URL               | https://www.e-xfl.com/product-detail/xilinx/xczu3eg-l2sfvc784e                                        |

Email: info@E-XFL.COM

Address: Room A, 16/F, Full Win Commercial Centre, 573 Nathan Road, Mongkok, Hong Kong



#### ARM Mali-400 Based GPU

- Supports OpenGL ES 1.1 and 2.0
- Supports OpenVG 1.1
- GPU frequency: Up to 667MHz
- Single Geometry Processor, Two Pixel Processors
- Pixel Fill Rate: 2 Mpixels/sec/MHz
- Triangle Rate: 0.11 Mtriangles/sec/MHz
- 64KB L2 Cache
- Power island gating

### **External Memory Interfaces**

- Multi-protocol dynamic memory controller
- 32-bit or 64-bit interfaces to DDR4, DDR3, DDR3L, or LPDDR3 memories, and 32-bit interface to LPDDR4 memory
- ECC support in 64-bit and 32-bit modes
- Up to 32GB of address space using single or dual rank of 8-, 16-, or 32-bit-wide memories
- Static memory interfaces
  - eMMC4.51 Managed NAND flash support
  - ONFI3.1 NAND flash with 24-bit ECC
  - 1-bit SPI, 2-bit SPI, 4-bit SPI (Quad-SPI), or two Quad-SPI (8-bit) serial NOR flash

#### **8-Channel DMA Controller**

- Two DMA controllers of 8-channels each
- Memory-to-memory, memory-to-peripheral, peripheral-to-memory, and scatter-gather transaction support

#### **Serial Transceivers**

- Four dedicated PS-GTR receivers and transmitters supports up to 6.0Gb/s data rates
  - Supports SGMII tri-speed Ethernet, PCI Express® Gen2, Serial-ATA (SATA), USB3.0, and DisplayPort

# Dedicated I/O Peripherals and Interfaces

- PCI Express Compliant with PCIe® 2.1 base specification
  - Root complex and End Point configurations
  - o x1, x2, and x4 at Gen1 or Gen2 rates
- SATA Host
  - 1.5, 3.0, and 6.0Gb/s data rates as defined by SATA Specification, revision 3.1
  - Supports up to two channels
- DisplayPort Controller
  - Up to 5.4Gb/s rate
  - Up to two TX lanes (no RX support)

- Four 10/100/1000 tri-speed Ethernet MAC peripherals with IEEE Std 802.3 and IEEE Std 1588 revision 2.0 support
  - Scatter-gather DMA capability
  - Recognition of IEEE Std 1588 rev.2 PTP frames
  - o GMII, RGMII, and SGMII interfaces
  - Jumbo frames
- Two USB 3.0/2.0 Device, Host, or OTG peripherals, each supporting up to 12 endpoints
  - o USB 3.0/2.0 compliant device IP core
  - Super-speed, high- speed, full-speed, and low-speed modes
  - Intel XHCI- compliant USB host
- Two full CAN 2.0B-compliant CAN bus interfaces
  - o CAN 2.0-A and CAN 2.0-B and ISO 118981-1 standard compliant
- Two SD/SDIO 2.0/eMMC4.51 compliant controllers
- Two full-duplex SPI ports with three peripheral chip selects
- Two high-speed UARTs (up to 1Mb/s)
- Two master and slave I2C interfaces
- Up to 78 flexible multiplexed I/O (MIO) (up to three banks of 26 I/Os) for peripheral pin assignment
- Up to 96 EMIOs (up to three banks of 32 I/Os) connected to the PL

#### Interconnect

- High-bandwidth connectivity within PS and between PS and PL
- ARM AMBA® AXI4-based
- QoS support for latency and bandwidth control
- Cache Coherent Interconnect (CCI)

### **System Memory Management**

- System Memory Management Unit (SMMU)
- Xilinx Memory Protection Unit (XMPU)

### **Platform Management Unit**

- Power gates PS peripherals, power islands, and power domains
- Clock gates PS peripheral user firmware option

### **Configuration and Security Unit**

- Boots PS and configures PL
- Supports secure and non-secure boot modes

### **System Monitor in PS**

• On-chip voltage and temperature sensing



# **Feature Summary**

Table 1: Zynq UltraScale+ MPSoC: CG Device Feature Summary

|                                         | ZU2CG                                                                                                                           | ZU3CG                                                                                                       | ZU4CG                     | ZU5CG                           | ZU6CG                    | ZU7CG         | ZU9CG         |  |  |
|-----------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------|---------------------------|---------------------------------|--------------------------|---------------|---------------|--|--|
| Application Processing Unit             | Dual-core ARM Cortex-A53 MPCore with CoreSight; NEON & Single/Double Precision Floating Point; 32KB/32KB L1 Cache, 1MB L2 Cache |                                                                                                             |                           |                                 |                          |               |               |  |  |
| Real-Time Processing Unit               | Dual-core A                                                                                                                     | Dual-core ARM Cortex-R5 with CoreSight; Single/Double Precision Floating Point; 32KB/32KB L1 Cache, and TCM |                           |                                 |                          |               |               |  |  |
| Embedded and External<br>Memory         | 256K                                                                                                                            | (B On-Chip Mer                                                                                              | mory w/ECC; E<br>External | xternal DDR4;<br>Quad-SPI; NAN  | DDR3; DDR3L;<br>ID; eMMC | ; LPDDR4; LPD | DR3;          |  |  |
| General Connectivity                    | 214 PS I/O;                                                                                                                     | UART; CAN; U                                                                                                | SB 2.0; I2C; S            | PI; 32b GPIO;<br>Timer Counters | Real Time Cloc           | k; WatchDog T | imers; Triple |  |  |
| High-Speed Connectivity                 | 4                                                                                                                               | PS-GTR; PCIe                                                                                                | Gen1/2; Seria             | ıl ATA 3.1; Disp                | olayPort 1.2a;           | USB 3.0; SGMI | I             |  |  |
| System Logic Cells                      | 103,320                                                                                                                         | 154,350                                                                                                     | 192,150                   | 256,200                         | 469,446                  | 504,000       | 599,550       |  |  |
| CLB Flip-Flops                          | 94,464                                                                                                                          | 141,120                                                                                                     | 175,680                   | 234,240                         | 429,208                  | 460,800       | 548,160       |  |  |
| CLB LUTs                                | 47,232                                                                                                                          | 70,560                                                                                                      | 87,840                    | 117,120                         | 214,604                  | 230,400       | 274,080       |  |  |
| Distributed RAM (Mb)                    | 1.2                                                                                                                             | 1.8                                                                                                         | 2.6                       | 3.5                             | 6.9                      | 6.2           | 8.8           |  |  |
| Block RAM Blocks                        | 150                                                                                                                             | 216                                                                                                         | 128                       | 144                             | 714                      | 312           | 912           |  |  |
| Block RAM (Mb)                          | 5.3                                                                                                                             | 7.6                                                                                                         | 4.5                       | 5.1                             | 25.1                     | 11.0          | 32.1          |  |  |
| UltraRAM Blocks                         | 0                                                                                                                               | 0                                                                                                           | 48                        | 64                              | 0                        | 96            | 0             |  |  |
| UltraRAM (Mb)                           | 0                                                                                                                               | 0                                                                                                           | 14.0                      | 18.0                            | 0                        | 27.0          | 0             |  |  |
| DSP Slices                              | 240                                                                                                                             | 360                                                                                                         | 728                       | 1,248                           | 1,973                    | 1,728         | 2,520         |  |  |
| CMTs                                    | 3                                                                                                                               | 3                                                                                                           | 4                         | 4                               | 4                        | 8             | 4             |  |  |
| Max. HP I/O <sup>(1)</sup>              | 156                                                                                                                             | 156                                                                                                         | 156                       | 156                             | 208                      | 416           | 208           |  |  |
| Max. HD I/O <sup>(2)</sup>              | 96                                                                                                                              | 96                                                                                                          | 96                        | 96                              | 120                      | 48            | 120           |  |  |
| System Monitor                          | 2                                                                                                                               | 2                                                                                                           | 2                         | 2                               | 2                        | 2             | 2             |  |  |
| GTH Transceiver 16.3Gb/s <sup>(3)</sup> | 0                                                                                                                               | 0                                                                                                           | 16                        | 16                              | 24                       | 24            | 24            |  |  |
| GTY Transceivers 32.75Gb/s              | 0                                                                                                                               | 0                                                                                                           | 0                         | 0                               | 0                        | 0             | 0             |  |  |
| Transceiver Fractional PLLs             | 0                                                                                                                               | 0                                                                                                           | 8                         | 8                               | 12                       | 12            | 12            |  |  |
| PCIe Gen3 x16 and Gen4 x8               | 0                                                                                                                               | 0                                                                                                           | 2                         | 2                               | 0                        | 2             | 0             |  |  |
| 150G Interlaken                         | 0                                                                                                                               | 0                                                                                                           | 0                         | 0                               | 0                        | 0             | 0             |  |  |
| 100G Ethernet w/ RS-FEC                 | 0                                                                                                                               | 0                                                                                                           | 0                         | 0                               | 0                        | 0             | 0             |  |  |

- 1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V.
- HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V.
   GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. See Table 2.



Table 2: Zynq UltraScale+ MPSoC: CG Device-Package Combinations and Maximum I/Os

| Dackago                    | Package         | ZU2CG              | ZU3CG              | ZU4CG              | ZU5CG              | ZU6CG              | ZU7CG              | ZU9CG              |
|----------------------------|-----------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|
| Package<br>(1)(2)(3)(4)(5) | Dimensions (mm) | HD, HP<br>GTH, GTY |
| SBVA484 <sup>(6)</sup>     | 19x19           | 24, 58<br>0, 0     | 24, 58<br>0, 0     |                    |                    |                    |                    |                    |
| SFVA625                    | 21x21           | 24, 156<br>0, 0    | 24, 156<br>0, 0    |                    |                    |                    |                    |                    |
| SFVC784 <sup>(7)</sup>     | 23x23           | 96, 156<br>0, 0    | 96, 156<br>0, 0    | 96, 156<br>4, 0    | 96, 156<br>4, 0    |                    |                    |                    |
| FBVB900                    | 31x31           |                    |                    | 48, 156<br>16, 0   | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |                    |
| FFVC900                    | 31x31           |                    |                    |                    |                    | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |
| FFVB1156                   | 35x35           |                    |                    |                    |                    | 120, 208<br>24, 0  |                    | 120, 208<br>24, 0  |
| FFVC1156                   | 35x35           |                    |                    |                    |                    |                    | 48, 312<br>20, 0   |                    |
| FFVF1517                   | 40x40           |                    |                    |                    |                    |                    | 48, 416<br>24, 0   |                    |

- 1. Go to Ordering Information for package designation details.
- 2. FB/FF packages have 1.0mm ball pitch. SB/SF packages have 0.8mm ball pitch.
- 3. All device package combinations bond out 4 PS-GTR transceivers.
- 4. All device package combinations bond out 214 PS I/O except ZU2CG and ZU3CG in the SBVA484 and SFVA625 packages, which bond out 170 PS I/Os.
- 5. Packages with the same last letter and number sequence, e.g., A484, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined.
- 6. All 58 HP I/O pins are powered by the same  $V_{\text{CCO}}$  supply.
- 7. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s.



Table 4: Zynq UltraScale+ MPSoC: EG Device-Package Combinations and Maximum I/Os

| Package (1)(2)(3)(4)(5) | Package<br>Dimensions<br>(mm) | ZU2EG              | ZU3EG              | ZU4EG              | ZU5EG              | ZU6EG              | ZU7EG              | ZU9EG              | ZU11EG             | ZU15EG             | ZU17EG             | ZU19EG             |
|-------------------------|-------------------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|
|                         |                               | HD, HP<br>GTH, GTY |
| SBVA484 <sup>(6)</sup>  | 19x19                         | 24, 58<br>0, 0     | 24, 58<br>0, 0     |                    |                    |                    |                    |                    |                    |                    |                    |                    |
| SFVA625                 | 21x21                         | 24, 156<br>0, 0    | 24, 156<br>0, 0    |                    |                    |                    |                    |                    |                    |                    |                    |                    |
| SFVC784 <sup>(7)</sup>  | 23x23                         | 96, 156<br>0, 0    | 96, 156<br>0, 0    | 96, 156<br>4, 0    | 96, 156<br>4, 0    |                    |                    |                    |                    |                    |                    |                    |
| FBVB900                 | 31x31                         |                    |                    | 48, 156<br>16, 0   | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |                    |                    |                    |                    |                    |
| FFVC900                 | 31x31                         |                    |                    |                    |                    | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |                    | 48, 156<br>16, 0   |                    |                    |
| FFVB1156                | 35x35                         |                    |                    |                    |                    | 120, 208<br>24, 0  |                    | 120, 208<br>24, 0  |                    | 120, 208<br>24, 0  |                    |                    |
| FFVC1156                | 35x35                         |                    |                    |                    |                    |                    | 48, 312<br>20, 0   |                    | 48, 312<br>20, 0   |                    |                    |                    |
| FFVB1517                | 40x40                         |                    |                    |                    |                    |                    |                    |                    | 72, 416<br>16, 0   |                    | 72, 572<br>16, 0   | 72, 572<br>16, 0   |
| FFVF1517                | 40x40                         |                    |                    |                    |                    |                    | 48, 416<br>24, 0   |                    | 48, 416<br>32, 0   |                    |                    |                    |
| FFVC1760                | 42.5x42.5                     |                    |                    |                    |                    |                    |                    |                    | 96, 416<br>32, 16  |                    | 96, 416<br>32, 16  | 96, 416<br>32, 16  |
| FFVD1760                | 42.5x42.5                     |                    |                    |                    |                    |                    |                    |                    |                    |                    | 48, 260<br>44, 28  | 48, 260<br>44, 28  |
| FFVE1924                | 45x45                         |                    |                    |                    |                    |                    |                    |                    |                    |                    | 96, 572<br>44, 0   | 96, 572<br>44, 0   |

- 1. Go to Ordering Information for package designation details. (5)
- 2. FB/FF packages have 1.0mm ball pitch. SB/SF packages have 0.8mm ball pitch.
- 3. All device package combinations bond out 4 PS-GTR transceivers.
- 4. All device package combinations bond out 214 PS I/O except ZU2EG and ZU3EG in the SBVA484 and SFVA625 packages, which bond out 170 PS I/Os.
- 5. Packages with the same last letter and number sequence, e.g., A484, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined.
- 6. All 58 HP I/O pins are powered by the same  $V_{CCO}$  supply.
- 7. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s.



Table 6: Zynq UltraScale+ MPSoC: EV Device-Package Combinations and Maximum I/Os

| Packago                 | Package         | ZU4EV              | ZU5EV              | ZU7EV              |
|-------------------------|-----------------|--------------------|--------------------|--------------------|
| Package<br>(1)(2)(3)(4) | Dimensions (mm) | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY |
| SFVC784 <sup>(5)</sup>  | 23x23           | 96, 156<br>4, 0    | 96, 156<br>4, 0    |                    |
| FBVB900                 | 31x31           | 48, 156<br>16, 0   | 48, 156<br>16, 0   | 48, 156<br>16, 0   |
| FFVC1156                | 35x35           |                    |                    | 48, 312<br>20, 0   |
| FFVF1517                | 40x40           |                    |                    | 48, 416<br>24, 0   |

- 1. Go to Ordering Information for package designation details.
- 2. FB/FF packages have 1.0mm ball pitch. SF packages have 0.8mm ball pitch.
- 3. All device package combinations bond out 4 PS-GTR transceivers.
- 4. Packages with the same last letter and number sequence, e.g., C784, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined.
- 5. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s.



ASIC-class capabilities afforded by the UltraScale MPSoC architecture while supporting rapid system development.

The inclusion of an application processor enables high-level operating system support, e.g., Linux. Other standard operating systems used with the Cortex-A53 processor are also available for the Zynq UltraScale+ MPSoC family. The PS and the PL are on separate power domains, enabling users to power down the PL for power management if required. The processors in the PS always boot first, allowing a software centric approach for PL configuration. PL configuration is managed by software running on the CPU, so it boots similar to an ASSP.



## **Processing System**

### **Application Processing Unit (APU)**

The key features of the APU include:

- 64-bit guad-core ARM Cortex-A53 MPCores. Features associated with each core include:
  - o ARM v8-A Architecture
  - Operating target frequency: up to 1.5GHz
  - Single and double precision floating point:4 SP / 2 DP FLOPs
  - NEON Advanced SIMD support with single and double precision floating point instructions
  - o A64 instruction set in 64-bit operating mode, A32/T32 instruction set in 32-bit operating mode
  - Level 1 cache (separate instruction and data, 32KB each for each Cortex-A53 CPU)
    - 2-way set-associative Instruction Cache with parity support
    - 4-way set-associative Data Cache with ECC support
  - Integrated memory management unit (MMU) per processor core
  - TrustZone for secure mode operation
  - Virtualization support
- Ability to operate in single processor, symmetric quad processor, and asymmetric quad-processor modes
- Integrated 16-way set-associative 1MB Unified Level 2 cache with ECC support
- Interrupts and Timers
  - Generic interrupt controller (GIC-400)
  - ARM generic timers (4 timers per CPU)
  - One watchdog timer (WDT)
  - One global timer
  - Two triple timers/counters (TTC)
- Little and big endian support
  - Big endian support in BE8 mode
- CoreSight debug and trace support
  - Embedded Trace Macrocell (ETM) for instruction trace
  - Cross trigger interface (CTI) enabling hardware breakpoints and triggers
- ACP interface to PL for I/O coherency and Level 2 cache allocation
- ACE interface to PL for full coherency
- Power island gating on each processor core
- Optional eFUSE disable per core



- Low power modes
  - Active/precharge power down
  - o Self-refresh, including clean exit from self-refresh after a controller power cycle
- Enhanced DDR training by allowing software to measure read/write eye and make delay adjustments dynamically
- Independent performance monitors for read path and write path
- Integration of PHY Debug Access Port (DAP) into JTAG for testing

The DDR memory controller is multi-ported and enables the PS and the PL to have shared access to a common memory. The DDR controller features six AXI slave ports for this purpose:

- Two 128-bit AXI ports from the ARM Cortex-A53 CPU(s), RPU (ARM Cortex-R5 and LPD peripherals), GPU, high speed peripherals (USB3, PCIe & SATA), and High Performance Ports (HPO & HP1) from the PL through the Cache Coherent Interconnect (CCI)
- One 64-bit port is dedicated for the ARM Cortex-R5 CPU(s)
- One 128-bit AXI port from the DisplayPort and HP2 port from the PL
- One 128-bit AXI port from HP3 and HP4 ports from the PL
- One 128-bit AXI port from General DMA and HP5 from the PL

### **High-Speed Connectivity Peripherals**

#### **PCIe**

- Compliant with the PCI Express Base Specification 2.1
- Fully compliant with PCI Express transaction ordering rules
- Lane width: x1, x2, or x4 at Gen1 or Gen2 rates
- 1 Virtual Channel
- Full duplex PCIe port
- End Point and single PCIe link Root Port
- Root Port supports Enhanced Configuration Access Mechanism (ECAM), Cfg Transaction generation
- Root Port support for INTx, and MSI
- Endpoint support for MSI or MSI-X
  - 1 physical function, no SR-IOV
  - No relaxed or ID ordering
  - Fully configurable BARs
  - o INTx not recommended, but can be generated
  - Endpoint to support configurable target/slave apertures with address translation and Interrupt capability



- Audio support
  - A single stream carries up to 8 LPCM channels at 192kHz with 24-bit resolution
  - Supports compressed formats including DRA, Dolby MAT, and DTS HD
  - Multi-Stream Transport can extend the number of audio channels
  - Audio copy protection
  - o 2-channel streaming or input from the PL
  - o Multi-channel non-streaming audio from a memory audio frame buffer
- Includes a System Time Clock (STC) compliant with ISO/IEC 13818-1
- Boot-time display using minimum resources

### **Platform Management Unit (PMU)**

- Performs system initialization during boot
- Acts as a delegate to the application and real-time processors during sleep state
- Initiates power-up and restart after the wake-up request
- Maintains the system power state at all time
- Manages the sequence of low-level events required for power-up, power-down, reset, clock gating, and power gating of islands and domains
- Provides error management (error handling and reporting)
- Provides safety check functions (e.g., memory scrubbing)

The PMU includes the following blocks:

- Platform management processor
- Fixed ROM for boot-up of the device
- 128KB RAM with ECC for optional user/firmware code
- Local and global registers to manage power-down, power-up, reset, clock gating, and power gating requests
- Interrupt controller with 16 interrupts from other modules and the inter-processor communication interface (IPI)
- GPI and GPO interfaces to and from PS I/O and PL
- JTAG interface for PMU debug
- Optional User-Defined Firmware



### **Configuration Security Unit (CSU)**

- Triple redundant Secure Processor Block (SPB) with built-in ECC
- Crypto Interface Block consisting of
  - 256-bit AES-GCM
  - o SHA-3/384
  - o 4096-bit RSA
- Key Management Unit
- Built-in DMA
- PCAP interface
- Supports ROM validation during pre-configuration stage
- Loads First Stage Boot Loader (FSBL) into OCM in either secure or non-secure boot modes
- Supports voltage, temperature, and frequency monitoring after configuration

### Xilinx Peripheral Protection Unit (XPPU)

- Provides peripheral protection support
- Up to 20 masters simultaneously
- Multiple aperture sizes
- Access control for a specified set of address apertures on a per master basis
- 64KB peripheral apertures and controls access on per peripheral basis

### I/O Peripherals

The IOP unit contains the data communication peripherals. Key features of the IOP include:

### Triple-Speed Gigabit Ethernet

- Compatible with IEEE Std 802.3 and supports 10/100/1000Mb/s transfer rates (Full and Half duplex)
- Supports jumbo frames
- Built-in Scatter-Gather DMA capability
- Statistics counter registers for RMON/MIB
- Multiple I/O types (1.8, 2.5, 3.3V) on RGMII interface with external PHY
- GMII interface to PL to support interfaces as: TBI, SGMII, and RGMII v2.0 support
- Automatic pad and cyclic redundancy check (CRC) generation on transmitted frames
- Transmitter and Receive IP, TCP, and UDP checksum offload
- MDIO interface for physical layer management



- Full duplex flow control with recognition of incoming pause frames and hardware generation of transmitted pause frames
- 802.1Q VLAN tagging with recognition of incoming VLAN and priority tagged frames
- Supports IEEE Std 1588 v2

#### SD/SDIO 3.0 Controller

In addition to secure digital (SD) devices, this controller also supports eMMC 4.51.

- Host mode support only
- Built-in DMA
- 1/4-Bit SD Specification, version 3.0
- 1/4/8-Bit eMMC Specification, version 4.51
- Supports primary boot from SD Card and eMMC (Managed NAND)
- High speed, default speed, and low-speed support
- 1 and 4-bit data interface support
  - Low speed clock 0-400KHz
  - Default speed 0-25MHz
  - High speed clock 0-50MHz
- High speed Interface
  - o SD UHS-1: 208MHz
  - o eMMC HS200: 200MHz
- Memory, I/O, and SD cards
- Power control modes
- Data FIFO interface up to 512B

#### **UART**

- Programmable baud rate generator
- 6, 7, or 8 data bits
- 1, 1.5, or 2 stop bits
- Odd, even, space, mark, or no parity
- Parity, framing, and overrun error detection
- Line break generation and detection
- Automatic echo, local loopback, and remote loopback channel modes
- Modem control signals: CTS, RTS, DSR, DTR, RI, and DCD (from EMIO only)



#### SPI

- Full-duplex operation offers simultaneous receive and transmit
- 128B deep read and write FIFO
- Master or slave SPI mode
- Up to 3 chip select lines
- Multi-master environment
- Identifies an error condition if more than one master detected
- Selectable master clock reference
- Software can poll for status or be interrupt driven

#### **12C**

- 128-bit buffer size
- Both normal (100kHz) and fast bus data rates (400kHz)
- Master or slave mode
- Normal or extended addressing
- I2C bus hold for slow host service

#### **GPIO**

- Up to 128 GPIO bits
  - Up to 78-bits from MIO and 96-bits from EMIO
- Each GPIO bit can be dynamically programmed as input or output
- Independent reset values for each bit of all registers
- Interrupt request generation for each GPIO signals
- Single Channel (Bit) write capability for all control registers include data output register, direction control register, and interrupt clear register
- Read back in output mode

#### CAN

- Conforms to the ISO 11898 -1, CAN2.0A, and CAN 2.0B standards
- Both standard (11-bit identifier) and extended (29-bit identifier) frames
- Bit rates up to 1Mb/s
- Transmit and Receive message FIFO with a depth of 64 messages
- Watermark interrupts for TXFIFO and RXFIFO
- Automatic re-transmission on errors or arbitration loss in normal mode
- Acceptance filtering of 4 acceptance filters





Figure 2: Column-Based Device Divided into Clock Regions

### Input/Output

All Zynq UltraScale+ MPSoCs have I/O pins for communicating to external components. In addition, in the MPSoC's PS, there are another 78 I/Os that the I/O peripherals use to communicate to external components, referred to as multiplexed I/O (MIO). If more than 78 pins are required by the I/O peripherals, the I/O pins in the PL can be used to extend the MPSoC interfacing capability, referred to as extended MIO (EMIO).

The number of I/O pins in the PL of Zynq UltraScale+ MPSoCs varies depending on device and package. Each I/O is configurable and can comply with a large number of I/O standards. The I/Os are classed as high-performance (HP), or high-density (HD). The HP I/Os are optimized for highest performance operation, from 1.0V to 1.8V. The HD I/Os are reduced-feature I/Os organized in banks of 24, providing voltage support from 1.2V to 3.3V.

All I/O pins are organized in banks, with 52 HP pins per bank or 24 HD pins per bank. Each bank has one common  $V_{CCO}$  output buffer power supply, which also powers certain input buffers. Some single-ended input buffers require an internally generated or an externally applied reference voltage ( $V_{REF}$ ).  $V_{REF}$  pins can be driven directly from the PCB or internally generated using the internal  $V_{REF}$  generator circuitry present in each bank.

#### I/O Electrical Characteristics

Single-ended outputs use a conventional CMOS push/pull output structure driving High towards  $V_{CCO}$  or Low towards ground, and can be put into a high-Z state. The system designer can specify the slew rate and the output strength. The input is always active but is usually ignored while the output is active. Each pin can optionally have a weak pull-up or a weak pull-down resistor.

Most signal pin pairs can be configured as differential input pairs or output pairs. Differential input pin pairs can optionally be terminated with a  $100\Omega$  internal resistor. All UltraScale architecture-based devices support differential standards beyond LVDS, including RSDS, BLVDS, differential SSTL, and differential HSTL. Each of the I/Os supports memory I/O standards, such as single-ended and differential HSTL as well as single-ended and differential SSTL. The Zynq UltraScale+ family includes support for MIPI with a dedicated D-PHY in the I/O bank.



#### 3-State Digitally Controlled Impedance and Low Power I/O Features

The 3-state Digitally Controlled Impedance (T\_DCI) can control the output drive impedance (series termination) or can provide parallel termination of an input signal to  $V_{CCO}$  or split (Thevenin) termination to  $V_{CCO}/2$ . This allows users to eliminate off-chip termination for signals using T\_DCI. In addition to board space savings, the termination automatically turns off when in output mode or when 3-stated, saving considerable power compared to off-chip termination. The I/Os also have low power modes for IBUF and IDELAY to provide further power savings, especially when used to implement memory interfaces.

### I/O Logic

### Input and Output Delay

All inputs and outputs can be configured as either combinatorial or registered. Double data rate (DDR) is supported by all inputs and outputs. Any input or output can be individually delayed by up to 1,250ps of delay with a resolution of 5–15ps. Such delays are implemented as IDELAY and ODELAY. The number of delay steps can be set by configuration and can also be incremented or decremented while in use. The IDELAY and ODELAY can be cascaded together to double the amount of delay in a single direction.

#### **ISERDES** and **OSERDES**

Many applications combine high-speed, bit-serial I/O with slower parallel operation inside the device. This requires a serializer and deserializer (SerDes) inside the I/O logic. Each I/O pin possesses an IOSERDES (ISERDES and OSERDES) capable of performing serial-to-parallel or parallel-to-serial conversions with programmable widths of 2, 4, or 8 bits. These I/O logic features enable high-performance interfaces, such as Gigabit Ethernet/1000BaseX/SGMII, to be moved from the transceivers to the SelectIO interface.

# **High-Speed Serial Transceivers**

Ultra-fast serial data transmission between devices on the same PCB, over backplanes, and across even longer distances is becoming increasingly important for scaling to 100 Gb/s and 400 Gb/s line cards. Specialized dedicated on-chip circuitry and differential I/O capable of coping with the signal integrity issues are required at these high data rates.

Three types of transceivers are used in Zynq UltraScale+ MPSoCs: GTH, GTY, and PS-GTR. All transceivers are arranged in groups of four, known as a transceiver Quad. Each serial transceiver is a combined transmitter and receiver. Table 10 compares the available transceivers.



Table 10: Transceiver Information

|                | Zynq UltraScale+ MPSoCs                                  |                                                           |                                                                                        |  |  |  |  |
|----------------|----------------------------------------------------------|-----------------------------------------------------------|----------------------------------------------------------------------------------------|--|--|--|--|
| Туре           | PS-GTR                                                   | GTH                                                       | GTY                                                                                    |  |  |  |  |
| Qty            | 4                                                        | 0-44                                                      | 0–28                                                                                   |  |  |  |  |
| Max. Data Rate | 6.0Gb/s                                                  | 16.3Gb/s                                                  | 32.75Gb/s                                                                              |  |  |  |  |
| Min. Data Rate | 1.25Gb/s                                                 | 0.5Gb/s                                                   | 0.5Gb/s                                                                                |  |  |  |  |
| Applications   | <ul><li>PCIe Gen2</li><li>USB</li><li>Ethernet</li></ul> | <ul><li>Backplane</li><li>PCIe Gen4</li><li>HMC</li></ul> | <ul><li>100G+ Optics</li><li>Chip-to-Chip</li><li>25G+ Backplane</li><li>HMC</li></ul> |  |  |  |  |

The following information in this section pertains to the GTH and GTY only.

The serial transmitter and receiver are independent circuits that use an advanced phase-locked loop (PLL) architecture to multiply the reference frequency input by certain programmable numbers between 4 and 25 to become the bit-serial data clock. Each transceiver has a large number of user-definable features and parameters. All of these can be defined during device configuration, and many can also be modified during operation.

#### **Transmitter**

The transmitter is fundamentally a parallel-to-serial converter with a conversion ratio of 16, 20, 32, 40, 64, or 80 for the GTH and 16, 20, 32, 40, 64, 80, 128, or 160 for the GTY. This allows the designer to trade off datapath width against timing margin in high-performance designs. These transmitter outputs drive the PC board with a single-channel differential output signal. TXOUTCLK is the appropriately divided serial data clock and can be used directly to register the parallel data coming from the internal logic. The incoming parallel data is fed through an optional FIFO and has additional hardware support for the 8B/10B, 64B/66B, or 64B/67B encoding schemes to provide a sufficient number of transitions. The bit-serial output signal drives two package pins with differential signals. This output signal pair has programmable signal swing as well as programmable pre- and post-emphasis to compensate for PC board losses and other interconnect characteristics. For shorter channels, the swing can be reduced to reduce power consumption.

#### Receiver

The receiver is fundamentally a serial-to-parallel converter, changing the incoming bit-serial differential signal into a parallel stream of words, each 16, 20, 32, 40, 64, or 80 bits in the GTH or 16, 20, 32, 40, 64, 80, 128, or 160 for the GTY. This allows the designer to trade off internal datapath width against logic timing margin. The receiver takes the incoming differential data stream, feeds it through programmable DC automatic gain control, linear and decision feedback equalizers (to compensate for PC board, cable, optical and other interconnect characteristics), and uses the reference clock input to initiate clock recognition. There is no need for a separate clock line. The data pattern uses non-return-to-zero (NRZ) encoding and optionally ensures sufficient data transitions by using the selected encoding scheme. Parallel data is then transferred into the device logic using the RXUSRCLK clock. For short channels, the transceivers offer a special low-power mode (LPM) to reduce power consumption by approximately 30%. The receiver DC automatic gain control and linear and decision feedback equalizers can optionally "auto-adapt" to automatically learn and compensate for different interconnect characteristics. This enables even more margin for tough 10G+ and 25G+ backplanes.

## **Integrated Block for 100G Ethernet**

Compliant to the IEEE Std 802.3ba, the 100G Ethernet integrated blocks in the UltraScale architecture provide low latency 100Gb/s Ethernet ports with a wide range of user customization and statistics gathering. With support for 10 x 10.3125Gb/s (CAUI) and 4 x 25.78125Gb/s (CAUI-4) configurations, the integrated block includes both the 100G MAC and PCS logic with support for IEEE Std 1588v2 1-step and 2-step hardware timestamping.

In UltraScale+ devices, the 100G Ethernet blocks contain a Reed Solomon Forward Error Correction (RS-FEC) block, compliant to IEEE Std 802.3bj, that can be used with the Ethernet block or stand alone in user applications. These families also support OTN mapping mode in which the PCS can be operate without using the MAC.

# **Clock Management**

The clock generation and distribution components in UltraScale architecture-based devices are located adjacent to the columns that contain the memory interfacing and input and output circuitry. This tight coupling of clocking and I/O provides low-latency clocking to the I/O for memory interfaces and other I/O protocols. Within every clock management tile (CMT) resides one mixed-mode clock manager (MMCM), two PLLs, clock distribution buffers and routing, and dedicated circuitry for implementing external memory interfaces.

### **Mixed-Mode Clock Manager**

The mixed-mode clock manager (MMCM) can serve as a frequency synthesizer for a wide range of frequencies and as a jitter filter for incoming clocks. At the center of the MMCM is a voltage-controlled oscillator (VCO), which speeds up and slows down depending on the input voltage it receives from the phase frequency detector (PFD).

Three sets of programmable frequency dividers (D, M, and O) are programmable by configuration and during normal operation via the Dynamic Reconfiguration Port (DRP). The pre-divider D reduces the input frequency and feeds one input of the phase/frequency comparator. The feedback divider M acts as a multiplier because it divides the VCO output frequency before feeding the other input of the phase comparator. D and M must be chosen appropriately to keep the VCO within its specified frequency range. The VCO has eight equally-spaced output phases (0°, 45°, 90°, 135°, 180°, 225°, 270°, and 315°). Each phase can be selected to drive one of the output dividers, and each divider is programmable by configuration to divide by any integer from 1 to 128.

The MMCM has three input-jitter filter options: low bandwidth, high bandwidth, or optimized mode. Low-Bandwidth mode has the best jitter attenuation. High-Bandwidth mode has the best phase offset. Optimized mode allows the tools to find the best setting.

The MMCM can have a fractional counter in either the feedback path (acting as a multiplier) or in one output path. Fractional counters allow non-integer increments of 1/8 and can thus increase frequency synthesis capabilities by a factor of 8. The MMCM can also provide fixed or dynamic phase shift in small increments that depend on the VCO frequency. At 1,600MHz, the phase-shift timing increment is 11.2ps.



# **Configurable Logic Block**

Every Configurable Logic Block (CLB) in the UltraScale architecture contains 8 LUTs and 16 flip-flops. The LUTs can be configured as either one 6-input LUT with one output, or as two 5-input LUTs with separate outputs but common inputs. Each LUT can optionally be registered in a flip-flop. In addition to the LUTs and flip-flops, the CLB contains arithmetic carry logic and multiplexers to create wider logic functions.

Each CLB contains one slice. There are two types of slices: SLICEL and SLICEM. LUTs in the SLICEM can be configured as 64-bit RAM, as 32-bit shift registers (SRL32), or as two SRL16s. CLBs in the UltraScale architecture have increased routing and connectivity compared to CLBs in previous-generation Xilinx devices. They also have additional control signals to enable superior register packing, resulting in overall higher device utilization.

### Interconnect

Various length vertical and horizontal routing resources in the UltraScale architecture that span 1, 2, 4, 5, 12, or 16 CLBs ensure that all signals can be transported from source to destination with ease, providing support for the next generation of wide data buses to be routed across even the highest capacity devices while simultaneously improving quality of results and software run time.

### **Block RAM**

Every UltraScale architecture-based device contains a number of 36Kb block RAMs, each with two completely independent ports that share only the stored data. Each block RAM can be configured as one 36Kb RAM or two independent 18Kb RAMs. Each memory access, read or write, is controlled by the clock. Connections in every block RAM column enable signals to be cascaded between vertically adjacent block RAMs, providing an easy method to create large, fast memory arrays, and FIFOs with greatly reduced power consumption.

All inputs, data, address, clock enables, and write enables are registered. The input address is always clocked (unless address latching is turned off), retaining data until the next operation. An optional output data pipeline register allows higher clock rates at the cost of an extra cycle of latency. During a write operation, the data output can reflect either the previously stored data or the newly written data, or it can remain unchanged. Block RAM sites that remain unused in the user design are automatically powered down to reduce total power consumption. There is an additional pin on every block RAM to control the dynamic power gating feature.



### **Programmable Data Width**

Each port can be configured as  $32K \times 1$ ;  $16K \times 2$ ;  $8K \times 4$ ;  $4K \times 9$  (or 8);  $2K \times 18$  (or 16);  $1K \times 36$  (or 32); or  $512 \times 72$  (or 64). Whether configured as block RAM or FIFO, the two ports can have different aspect ratios without any constraints. Each block RAM can be divided into two completely independent 18Kb block RAMs that can each be configured to any aspect ratio from  $16K \times 1$  to  $512 \times 36$ . Everything described previously for the full 36Kb block RAM also applies to each of the smaller 18Kb block RAMs. Only in simple dual-port (SDP) mode can data widths of greater than 18 bits (18Kb RAM) or 36 bits (36Kb RAM) be accessed. In this mode, one port is dedicated to read operation, the other to write operation. In SDP mode, one side (read or write) can be variable, while the other is fixed to 32/36 or 64/72. Both sides of the dual-port 36Kb RAM can be of variable width.

#### **Error Detection and Correction**

Each 64-bit-wide block RAM can generate, store, and utilize eight additional Hamming code bits and perform single-bit error correction and double-bit error detection (ECC) during the read process. The ECC logic can also be used when writing to or reading from external 64- to 72-bit-wide memories.

#### **FIFO Controller**

Each block RAM can be configured as a 36Kb FIFO or an 18Kb FIFO. The built-in FIFO controller for single-clock (synchronous) or dual-clock (asynchronous or multirate) operation increments the internal addresses and provides four handshaking flags: full, empty, programmable full, and programmable empty. The programmable flags allow the user to specify the FIFO counter values that make these flags go active. The FIFO width and depth are programmable with support for different read port and write port widths on a single FIFO. A dedicated cascade path allows for easy creation of deeper FIFOs.

### **UltraRAM**

UltraRAM is a high-density, dual-port, synchronous memory block used in some UltraScale+ families. Both of the ports share the same clock and can address all of the 4K x 72 bits. Each port can independently read from or write to the memory array. UltraRAM supports two types of write enable schemes. The first mode is consistent with the block RAM byte write enable mode. The second mode allows gating the data and parity byte writes separately. Multiple UltraRAM blocks can be cascaded together to create larger memory arrays. UltraRAM blocks can be connected together to create larger memory arrays. Dedicated routing in the UltraRAM column enables the entire column height to be connected together. This makes UltraRAM an ideal solution for replacing external memories such as SRAM. Cascadable anywhere from 288Kb to 36Mb, UltraRAM provides the flexibility to fulfill many different memory requirements.

### **Error Detection and Correction**

Each 64-bit-wide UltraRAM can generate, store and utilize eight additional Hamming code bits and perform single-bit error correction and double-bit error detection (ECC) during the read process.



### **Clock Management**

The PS in Zynq UltraScale+ MPSoCs is equipped with five phase-locked loops (PLLs), providing flexibility in configuring the clock domains within the PS. There are four primary clock domains of interest within the PS. These include the APU, the RPU, the DDR controller, and the I/O peripherals (IOP). The frequencies of all of these domains can be configured independently under software control.

#### **Power Domains**

The Zynq UltraScale+ MPSoC contains four separate power domains. When they are connected to separate power supplies, they can be completely powered down independently of each other without consuming any dynamic or static power. The processing system includes:

- Full Power Domain (FPD)
- Low Power Domain (LPD)
- Battery Powered Domain (BPD)

In addition to these three Processing System power domains, the PL can also be completely powered down if connected to separate power supplies.

The Full Power Domain (FPD) consists of the following major blocks:

- Application Processing Unit (APU)
- DMA (FP-DMA)
- Graphics Processing Unit (GPU)
- Dynamic Memory Controller (DDRC)
- High-Speed I/O Peripherals

The Low Power Domain (LPD) consists of the following major blocks:

- Real-Time Processing Unit (RPU)
- DMA (LP-DMA)
- Platform Management Unit (PMU)
- Configuration Security Unit (CSU)
- Low-Speed I/O Peripherals
- Static Memory Interfaces

The Battery Power Domain (BPD) is the lowest power domain of the Zynq UltraScale+ MPSoC processing system. In this mode, all the PS is powered off except the Real-Time Clock (RTC) and battery-backed RAM (BBRAM).

#### **Power Examples**

Power for the Zynq UltraScale+ MPSoCs varies depending on the utilization of the PL resources, and the frequency of the PS and PL. To estimate power, use the Xilinx Power Estimator (XPE) at:

http://www.xilinx.com/products/design\_tools/logic\_design/xpe.htm



# **Ordering Information**

Table 12 shows the speed and temperature grades available in the different device families.

Table 12: Speed Grade and Temperature Grade

|                  | Devices                                                          | Speed Grade and Temperature Grade |                   |                                         |                                      |  |  |  |
|------------------|------------------------------------------------------------------|-----------------------------------|-------------------|-----------------------------------------|--------------------------------------|--|--|--|
| Device<br>Family |                                                                  | Commercial<br>(C)                 | Industrial<br>(I) |                                         |                                      |  |  |  |
|                  |                                                                  | 0°C to +85°C                      | 0°C to +100°C     | 0°C to +110°C                           | -40°C to +100°C                      |  |  |  |
|                  |                                                                  |                                   | -2E (0.85V)       |                                         | -21 (0.85V)                          |  |  |  |
|                  | CG                                                               |                                   |                   | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) |                                      |  |  |  |
|                  | Devices                                                          |                                   | -1E (0.85V)       |                                         | -1I (0.85V)                          |  |  |  |
|                  |                                                                  |                                   |                   |                                         | -1LI <sup>(2)</sup> (0.85V or 0.72V) |  |  |  |
|                  |                                                                  |                                   | -2E (0.85V)       |                                         | -21 (0.85V)                          |  |  |  |
|                  | ZU2EG                                                            |                                   |                   | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) |                                      |  |  |  |
|                  | ZU3EG                                                            |                                   | -1E (0.85V)       |                                         | -1I (0.85V)                          |  |  |  |
|                  |                                                                  |                                   |                   |                                         | -1LI <sup>(2)</sup> (0.85V or 0.72V) |  |  |  |
|                  | ZU4EG ZU5EG ZU6EG ZU7EG ZU9EG ZU11EG ZU15EG ZU17EG ZU17EG ZU19EG |                                   | -3E (0.90V)       |                                         |                                      |  |  |  |
| Zynq             |                                                                  |                                   | -2E (0.85V)       |                                         | -21 (0.85V)                          |  |  |  |
| UltraScale+      |                                                                  |                                   |                   | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) |                                      |  |  |  |
|                  |                                                                  |                                   | -1E (0.85V)       |                                         | -1I (0.85V)                          |  |  |  |
|                  |                                                                  |                                   |                   |                                         | -1LI <sup>(2)</sup> (0.85V or 0.72V) |  |  |  |
|                  |                                                                  |                                   | -3E (0.90V)       |                                         |                                      |  |  |  |
|                  |                                                                  |                                   | -2E (0.85V)       |                                         | -2I (0.85V)                          |  |  |  |
|                  | EV<br>Devices                                                    |                                   |                   | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) |                                      |  |  |  |
|                  | 201.000                                                          |                                   | -1E (0.85V)       |                                         | -1I (0.85V)                          |  |  |  |
|                  |                                                                  |                                   |                   |                                         | -1LI <sup>(2)</sup> (0.85V or 0.72V) |  |  |  |

#### Notes:

The ordering information shown in Figure 3 applies to all packages in the Zynq UltraScale+ MPSoCs.

<sup>1.</sup> In -2LE speed/temperature grade, devices can operate for a limited time with junction temperature of 110°C. Timing parameters adhere to the same speed file at 110°C as they do below 110°C, regardless of operating voltage (nominal at 0.85V or low voltage at 0.72V). Operation at 110°C Tj is limited to 1% of the device lifetime and can occur sequentially or at regular intervals as long as the total time does not exceed 1% of device lifetime.

<sup>2.</sup> In Zynq UltraScale+ MPSoCs, when operating the PL at low voltage (0.72V), the PS operates at nominal voltage (0.85V)