Welcome to **E-XFL.COM** #### Embedded - System On Chip (SoC): The Heart of Modern Embedded Systems Embedded - System On Chip (SoC) refers to an integrated circuit that consolidates all the essential components of a computer system into a single chip. This includes a microprocessor, memory, and other peripherals, all packed into one compact and efficient package. SoCs are designed to provide a complete computing solution, optimizing both space and power consumption, making them ideal for a wide range of embedded applications. What are **Embedded - System On Chip (SoC)**? **System On Chip (SoC)** integrates multiple functions of a computer or electronic system onto a single chip. Unlike traditional multi-chip solutions. SoCs combine a central | Details | | |----------------------------|-------------------------------------------------------------------------------------------------------| | Product Status | Active | | Architecture | MCU, FPGA | | Core Processor | Quad ARM® Cortex®-A53 MPCore™ with CoreSight™, Dual ARM®Cortex™-R5 with CoreSight™, ARM Mali™-400 MP2 | | Flash Size | - | | RAM Size | 256KB | | Peripherals | DMA, WDT | | Connectivity | CANbus, EBI/EMI, Ethernet, I <sup>2</sup> C, MMC/SD/SDIO, SPI, UART/USART, USB OTG | | Speed | 500MHz, 600MHz, 1.2GHz | | Primary Attributes | Zynq®UltraScale+™ FPGA, 599K+ Logic Cells | | Operating<br>Temperature | -40°C ~ 100°C (TJ) | | Package / Case | 900-BBGA, FCBGA | | Supplier Device<br>Package | 900-FCBGA (31x31) | | Purchase URL | https://www.e-xfl.com/product-detail/xilinx/xczu9eg-l1ffvc900i | Email: info@E-XFL.COM Address: Room A, 16/F, Full Win Commercial Centre, 573 Nathan Road, Mongkok, Hong Kong # **Programmable Logic (PL)** ## **Configurable Logic Blocks (CLB)** - Look-up tables (LUT) - Flip-flops - Cascadable adders ### **36Kb Block RAM** - True dual-port - Up to 72 bits wide - Configurable as dual 18Kb ### **UltraRAM** - 288Kb dual-port - 72 bits wide - Error checking and correction ### **DSP Blocks** - 27 x 18 signed multiply - 48-bit adder/accumulator - 27-bit pre-adder ### **Programmable I/O Blocks** - Supports LVCMOS, LVDS, and SSTL - 1.0V to 3.3V I/O - Programmable I/O delay and SerDes ## JTAG Boundary-Scan • IEEE Std 1149.1 Compatible Test Interface ### **PCI Express** - Supports Root complex and End Point configurations - Supports up to Gen4 speeds - Up to five integrated blocks in select devices ## 100G Ethernet MAC/PCS - IEEE Std 802.3 compliant - CAUI-10 (10x 10.3125Gb/s) or CAUI-4 (4x 25.78125Gb/s) - RSFEC (IEEE Std 802.3bj) in CAUI-4 configuration - Up to four integrated blocks in select devices ### Interlaken - Interlaken spec 1.2 compliant - 64/67 encoding - 12 x 12.5Gb/s or 6 x 25Gb/s - Up to four integrated blocks in select devices ## Video Encoder/Decoder (VCU) - Available in EV devices - Accessible from either PS or PL - Simultaneous encode and decode - H.264 and H.265 support ### **System Monitor in PL** - On-chip voltage and temperature sensing - 10-bit 200KSPS ADC with up to 17 external inputs # **Feature Summary** Table 1: Zynq UltraScale+ MPSoC: CG Device Feature Summary | | ZU2CG | ZU3CG | ZU4CG | ZU5CG | ZU6CG | ZU7CG | ZU9CG | | | | |-----------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------|----------------|---------------------------------|----------------|---------------|---------------|--|--|--| | Application Processing Unit | Dual-core ARM Cortex-A53 MPCore with CoreSight; NEON & Single/Double Precision Floating Point; 32KB/32KB L1 Cache, 1MB L2 Cache | | | | | | | | | | | Real-Time Processing Unit | Dual-core A | Dual-core ARM Cortex-R5 with CoreSight; Single/Double Precision Floating Point; 32KB/32KB L1 Cache, and TCM | | | | | | | | | | Embedded and External<br>Memory | 256K | 256KB On-Chip Memory w/ECC; External DDR4; DDR3; DDR3L; LPDDR4; LPDDR3; External Quad-SPI; NAND; eMMC | | | | | | | | | | General Connectivity | 214 PS I/O; | UART; CAN; US | SB 2.0; I2C; S | PI; 32b GPIO;<br>Timer Counters | Real Time Cloc | k; WatchDog T | imers; Triple | | | | | High-Speed Connectivity | 4 | PS-GTR; PCIe | Gen1/2; Seria | ıl ATA 3.1; Disp | olayPort 1.2a; | USB 3.0; SGMI | I | | | | | System Logic Cells | 103,320 | 154,350 | 192,150 | 256,200 | 469,446 | 504,000 | 599,550 | | | | | CLB Flip-Flops | 94,464 | 141,120 | 175,680 | 234,240 | 429,208 | 460,800 | 548,160 | | | | | CLB LUTs | 47,232 | 70,560 | 87,840 | 117,120 | 214,604 | 230,400 | 274,080 | | | | | Distributed RAM (Mb) | 1.2 | 1.8 | 2.6 | 3.5 | 6.9 | 6.2 | 8.8 | | | | | Block RAM Blocks | 150 | 216 | 128 | 144 | 714 | 312 | 912 | | | | | Block RAM (Mb) | 5.3 | 7.6 | 4.5 | 5.1 | 25.1 | 11.0 | 32.1 | | | | | UltraRAM Blocks | 0 | 0 | 48 | 64 | 0 | 96 | 0 | | | | | UltraRAM (Mb) | 0 | 0 | 14.0 | 18.0 | 0 | 27.0 | 0 | | | | | DSP Slices | 240 | 360 | 728 | 1,248 | 1,973 | 1,728 | 2,520 | | | | | CMTs | 3 | 3 | 4 | 4 | 4 | 8 | 4 | | | | | Max. HP I/O <sup>(1)</sup> | 156 | 156 | 156 | 156 | 208 | 416 | 208 | | | | | Max. HD I/O <sup>(2)</sup> | 96 | 96 | 96 | 96 | 120 | 48 | 120 | | | | | System Monitor | 2 | 2 | 2 | 2 | 2 | 2 | 2 | | | | | GTH Transceiver 16.3Gb/s <sup>(3)</sup> | 0 | 0 | 16 | 16 | 24 | 24 | 24 | | | | | GTY Transceivers 32.75Gb/s | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | | | | Transceiver Fractional PLLs | 0 | 0 | 8 | 8 | 12 | 12 | 12 | | | | | PCIe Gen3 x16 and Gen4 x8 | 0 | 0 | 2 | 2 | 0 | 2 | 0 | | | | | 150G Interlaken | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | | | | 100G Ethernet w/ RS-FEC | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | | | - 1. HP = High-performance I/O with support for I/O voltage from 1.0V to 1.8V. - HD = High-density I/O with support for I/O voltage from 1.2V to 3.3V. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. See Table 2. Table 2: Zynq UltraScale+ MPSoC: CG Device-Package Combinations and Maximum I/Os | Package | Package | ZU2CG | ZU3CG | ZU4CG | ZU5CG | ZU6CG | ZU7CG | ZU9CG | |------------------------|-----------------|-----------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------| | (1)(2)(3)(4)(5) | Dimensions (mm) | | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY | HD, HP<br>GTH, GTY | | SBVA484 <sup>(6)</sup> | 19x19 | 24, 58<br>0, 0 | 24, 58<br>0, 0 | | | | | | | SFVA625 | 21x21 | 24, 156<br>0, 0 | 24, 156<br>0, 0 | | | | | | | SFVC784 <sup>(7)</sup> | 23x23 | 96, 156<br>0, 0 | 96, 156<br>0, 0 | 96, 156<br>4, 0 | 96, 156<br>4, 0 | | | | | FBVB900 | 31x31 | | | 48, 156<br>16, 0 | 48, 156<br>16, 0 | | 48, 156<br>16, 0 | | | FFVC900 | 31x31 | | | | | 48, 156<br>16, 0 | | 48, 156<br>16, 0 | | FFVB1156 | 35x35 | | | | | 120, 208<br>24, 0 | | 120, 208<br>24, 0 | | FFVC1156 | 35x35 | | | | | | 48, 312<br>20, 0 | | | FFVF1517 | 40x40 | | | | | | 48, 416<br>24, 0 | | - 1. Go to Ordering Information for package designation details. - 2. FB/FF packages have 1.0mm ball pitch. SB/SF packages have 0.8mm ball pitch. - 3. All device package combinations bond out 4 PS-GTR transceivers. - 4. All device package combinations bond out 214 PS I/O except ZU2CG and ZU3CG in the SBVA484 and SFVA625 packages, which bond out 170 PS I/Os. - 5. Packages with the same last letter and number sequence, e.g., A484, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined. - 6. All 58 HP I/O pins are powered by the same $\rm V_{\rm CCO}$ supply. - 7. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. 7 Table 4: Zynq UltraScale+ MPSoC: EG Device-Package Combinations and Maximum I/Os | Package (1)(2)(3)(4)(5) | Package<br>Dimensions<br>(mm) | | ZU2EG | ZU3EG | ZU4EG | ZU5EG | ZU6EG | ZU7EG | ZU9EG | ZU11EG | ZU15EG | ZU17EG | ZU19EG | |-------------------------|-------------------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------------------|--------| | | | HD, HP<br>GTH, GTY | | SBVA484 <sup>(6)</sup> | 19x19 | 24, 58<br>0, 0 | 24, 58<br>0, 0 | | | | | | | | | | | | SFVA625 | 21x21 | 24, 156<br>0, 0 | 24, 156<br>0, 0 | | | | | | | | | | | | SFVC784 <sup>(7)</sup> | 23x23 | 96, 156<br>0, 0 | 96, 156<br>0, 0 | 96, 156<br>4, 0 | 96, 156<br>4, 0 | | | | | | | | | | FBVB900 | 31x31 | | | 48, 156<br>16, 0 | 48, 156<br>16, 0 | | 48, 156<br>16, 0 | | | | | | | | FFVC900 | 31x31 | | | | | 48, 156<br>16, 0 | | 48, 156<br>16, 0 | | 48, 156<br>16, 0 | | | | | FFVB1156 | 35x35 | | | | | 120, 208<br>24, 0 | | 120, 208<br>24, 0 | | 120, 208<br>24, 0 | | | | | FFVC1156 | 35x35 | | | | | | 48, 312<br>20, 0 | | 48, 312<br>20, 0 | | | | | | FFVB1517 | 40x40 | | | | | | | | 72, 416<br>16, 0 | | 72, 572<br>16, 0 | 72, 572<br>16, 0 | | | FFVF1517 | 40x40 | | | | | | 48, 416<br>24, 0 | | 48, 416<br>32, 0 | | | | | | FFVC1760 | 42.5x42.5 | | | | | | | | 96, 416<br>32, 16 | | 96, 416<br>32, 16 | 96, 416<br>32, 16 | | | FFVD1760 | 42.5x42.5 | | | | | | | | | | 48, 260<br>44, 28 | 48, 260<br>44, 28 | | | FFVE1924 | 45x45 | | | | | | | | | | 96, 572<br>44, 0 | 96, 572<br>44, 0 | | - 1. Go to Ordering Information for package designation details. (5) - 2. FB/FF packages have 1.0mm ball pitch. SB/SF packages have 0.8mm ball pitch. - 3. All device package combinations bond out 4 PS-GTR transceivers. - 4. All device package combinations bond out 214 PS I/O except ZU2EG and ZU3EG in the SBVA484 and SFVA625 packages, which bond out 170 PS I/Os. - 5. Packages with the same last letter and number sequence, e.g., A484, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined. - 6. All 58 HP I/O pins are powered by the same $V_{CCO}$ supply. - 7. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. Table 6: Zynq UltraScale+ MPSoC: EV Device-Package Combinations and Maximum I/Os | Dackago | Package | ZU4EV | ZU5EV | ZU7EV | |-------------------------|-----------------|------------------|------------------|--------------------| | Package<br>(1)(2)(3)(4) | Dimensions (mm) | TO TOUTE TOUTE | | HD, HP<br>GTH, GTY | | SFVC784 <sup>(5)</sup> | 23x23 | 96, 156<br>4, 0 | 96, 156<br>4, 0 | | | FBVB900 | 31x31 | 48, 156<br>16, 0 | 48, 156<br>16, 0 | 48, 156<br>16, 0 | | FFVC1156 | 35x35 | | | 48, 312<br>20, 0 | | FFVF1517 | 40x40 | | | 48, 416<br>24, 0 | - 1. Go to Ordering Information for package designation details. - 2. FB/FF packages have 1.0mm ball pitch. SF packages have 0.8mm ball pitch. - 3. All device package combinations bond out 4 PS-GTR transceivers. - 4. Packages with the same last letter and number sequence, e.g., C784, are footprint compatible with all other UltraScale devices with the same sequence. The footprint compatible devices within this family are outlined. - 5. GTH transceivers in the SFVC784 package support data rates up to 12.5Gb/s. # **Zynq UltraScale+ MPSoCs** A comprehensive device family, Zynq UltraScale+ MPSoCs offer single-chip, all programmable, heterogeneous multiprocessors that provide designers with software, hardware, interconnect, power, security, and I/O programmability. The range of devices in the Zynq UltraScale+ MPSoC family allows designers to target cost-sensitive as well as high-performance applications from a single platform using industry-standard tools. While each Zynq UltraScale+ MPSoC contains the same PS, the PL, Video hard blocks, and I/O resources vary between the devices. Table 7: Zynq UltraScale+ MPSoC Device Features | | CG Devices | EG Devices | EV Devices | |-----|--------------------------|--------------------------|--------------------------| | APU | Dual-core ARM Cortex-A53 | Quad-core ARM Cortex-A53 | Quad-core ARM Cortex-A53 | | RPU | Dual-core ARM Cortex-R5 | Dual-core ARM Cortex-R5 | Dual-core ARM Cortex-R5 | | GPU | - | Mali-400MP2 | Mali-400MP2 | | VCU | - | - | H.264/H.265 | The Zynq UltraScale+ MPSoCs are able to serve a wide range of applications including: - Automotive: Driver assistance, driver information, and infotainment - Wireless Communications: Support for multiple spectral bands and smart antennas - Wired Communications: Multiple wired communications standards and context-aware network services - Data Centers: Software Defined Networks (SDN), data pre-processing, and analytics - Smarter Vision: Evolving video-processing algorithms, object detection, and analytics - Connected Control/M2M: Flexible/adaptable manufacturing, factory throughput, quality, and safety The UltraScale MPSoC architecture provides processor scalability from 32 to 64 bits with support for virtualization, the combination of soft and hard engines for real-time control, graphics/video processing, waveform and packet processing, next-generation interconnect and memory, advanced power management, and technology enhancements that deliver multi-level security, safety, and reliability. Xilinx offers a large number of soft IP for the Zynq UltraScale+ MPSoC family. Stand-alone and Linux device drivers are available for the peripherals in the PS and the PL. Xilinx's Vivado® Design Suite, SDK™, and PetaLinux development environments enable rapid product development for software, hardware, and systems engineers. The ARM-based PS also brings a broad range of third-party tools and IP providers in combination with Xilinx's existing PL ecosystem. The Zynq UltraScale+ MPSoC family delivers unprecedented processing, I/O, and memory bandwidth in the form of an optimized mix of heterogeneous processing engines embedded in a next-generation, high-performance, on-chip interconnect with appropriate on-chip memory subsystems. The heterogeneous processing and programmable engines, which are optimized for different application tasks, enable the Zynq UltraScale+ MPSoCs to deliver the extensive performance and efficiency required to address next-generation smarter systems while retaining backwards compatibility with the original Zynq-7000 All Programmable SoC family. The UltraScale MPSoC architecture also incorporates multiple levels of security, increased safety, and advanced power management, which are critical requirements of next-generation smarter systems. Xilinx's embedded UltraFast™ design methodology fully exploits the # **Processing System** ## **Application Processing Unit (APU)** The key features of the APU include: - 64-bit quad-core ARM Cortex-A53 MPCores. Features associated with each core include: - ARM v8-A Architecture - Operating target frequency: up to 1.5GHz - Single and double precision floating point:4 SP / 2 DP FLOPs - NEON Advanced SIMD support with single and double precision floating point instructions - o A64 instruction set in 64-bit operating mode, A32/T32 instruction set in 32-bit operating mode - Level 1 cache (separate instruction and data, 32KB each for each Cortex-A53 CPU) - 2-way set-associative Instruction Cache with parity support - 4-way set-associative Data Cache with ECC support - Integrated memory management unit (MMU) per processor core - TrustZone for secure mode operation - Virtualization support - Ability to operate in single processor, symmetric quad processor, and asymmetric quad-processor modes - Integrated 16-way set-associative 1MB Unified Level 2 cache with ECC support - Interrupts and Timers - Generic interrupt controller (GIC-400) - o ARM generic timers (4 timers per CPU) - One watchdog timer (WDT) - One global timer - Two triple timers/counters (TTC) - Little and big endian support - o Big endian support in BE8 mode - CoreSight debug and trace support - o Embedded Trace Macrocell (ETM) for instruction trace - Cross trigger interface (CTI) enabling hardware breakpoints and triggers - ACP interface to PL for I/O coherency and Level 2 cache allocation - ACE interface to PL for full coherency - Power island gating on each processor core - Optional eFUSE disable per core ## Xilinx Memory Protection Unit (XMPU) - Region based memory protection unit - Up to 16 regions - Each region supports address alignment of 1MB or 4KB - Regions can overlap; the higher region number has priority - Each region can be independently enabled or disabled - Each region has a start and end address ## **Graphics Processing Unit (GPU)** - Supports OpenGL ES 1.1 & 2.0 - Supports OpenVG 1.1 - Operating target frequency: up to 667MHz - Single Geometry Processor and two Pixel processor - Pixel Fill Rate: 2 Mpixel/sec/MHz - Triangle Rate: 0.11 Mtriangles/sec/MHz - 64KB Level 2 Cache (read-only) - 4X and 16X Anti-aliasing Support - ETC1 texture compression to reduce external memory bandwidth - Extensive texture format support - o RGBA 8888, 565, 1556 - o Mono 8, 16 - YUV format support - Automatic load balancing across different graphics shader engines - 2D and 3D graphic acceleration - Up to 4K texture input and 4K render output resolutions - Each geometry processor and pixel processor supports 4KB page MMU - Power island gating on each GPU engine and shared cache - Optional eFUSE disable ## **Dynamic Memory Controller (DDRC)** - DDR3, DDR3L, DDR4, LPDDR3, LPDDR4 - Target data rate: Up to 2400Mb/s DDR4 operation in -1 speed grade - 32-bit and 64-bit bus width support for DDR4, DDR3, DDR3L, or LPDDR3 memories, and 32-bit bus width support for LPDDR4 memory - ECC support (using extra bits) - Up to a total DRAM capacity of 32GB - Full duplex flow control with recognition of incoming pause frames and hardware generation of transmitted pause frames - 802.1Q VLAN tagging with recognition of incoming VLAN and priority tagged frames - Supports IEEE Std 1588 v2 ### SD/SDIO 3.0 Controller In addition to secure digital (SD) devices, this controller also supports eMMC 4.51. - Host mode support only - Built-in DMA - 1/4-Bit SD Specification, version 3.0 - 1/4/8-Bit eMMC Specification, version 4.51 - Supports primary boot from SD Card and eMMC (Managed NAND) - High speed, default speed, and low-speed support - 1 and 4-bit data interface support - Low speed clock 0-400KHz - Default speed 0-25MHz - High speed clock 0-50MHz - High speed Interface - o SD UHS-1: 208MHz - o eMMC HS200: 200MHz - Memory, I/O, and SD cards - Power control modes - Data FIFO interface up to 512B #### **UART** - Programmable baud rate generator - 6, 7, or 8 data bits - 1, 1.5, or 2 stop bits - Odd, even, space, mark, or no parity - Parity, framing, and overrun error detection - Line break generation and detection - Automatic echo, local loopback, and remote loopback channel modes - Modem control signals: CTS, RTS, DSR, DTR, RI, and DCD (from EMIO only) #### SPI - Full-duplex operation offers simultaneous receive and transmit - 128B deep read and write FIFO - Master or slave SPI mode - Up to 3 chip select lines - Multi-master environment - Identifies an error condition if more than one master detected - Selectable master clock reference - Software can poll for status or be interrupt driven #### **12C** - 128-bit buffer size - Both normal (100kHz) and fast bus data rates (400kHz) - Master or slave mode - Normal or extended addressing - I2C bus hold for slow host service #### **GPIO** - Up to 128 GPIO bits - Up to 78-bits from MIO and 96-bits from EMIO - Each GPIO bit can be dynamically programmed as input or output - Independent reset values for each bit of all registers - Interrupt request generation for each GPIO signals - Single Channel (Bit) write capability for all control registers include data output register, direction control register, and interrupt clear register - Read back in output mode #### CAN - Conforms to the ISO 11898 -1, CAN2.0A, and CAN 2.0B standards - Both standard (11-bit identifier) and extended (29-bit identifier) frames - Bit rates up to 1Mb/s - Transmit and Receive message FIFO with a depth of 64 messages - Watermark interrupts for TXFIFO and RXFIFO - Automatic re-transmission on errors or arbitration loss in normal mode - Acceptance filtering of 4 acceptance filters - Sleep Mode with automatic wake-up - Snoop Mode - 16-bit timestamping for receive messages - Both internal generated reference clock and external reference clock input from MIO - Guarantee clock sampling edge between 80 to 83% at 24MHz reference clock input - Optional eFUSE disable per port #### **USB 2.0** - Two USB controllers (configurable as USB 2.0 or USB 3.0) - Host, device and On-The-Go (OTG) modes - High Speed, Full Speed, and Low Speed - Up to 12 endpoints - 8-bit ULPI External PHY Interface - The USB host controller registers and data structures are compliant to Intel xHCI specifications. - 64-bit AXI master port with built-in DMA - Power management features: hibernation mode ## **Static Memory Interfaces** The static memory interfaces support external static memories. - ONFI 3.1 NAND flash support with up to 24-bit ECC - 1-bit SPI, 2-bit SPI, 4-bit SPI (Quad-SPI), or two Quad-SPI (8-bit) serial NOR flash - 8-bit eMMC interface supporting managed NAND flash #### NAND ONFI 3.1 Flash Controller - ONFI 3.1 compliant - Supports chip select reduction per ONFI 3.1 spec - SLC NAND for boot/configuration and data storage - ECC options based on SLC NAND - o 1, 4, or 8 bits per 512+spare bytes - o 24 bits per 1024+spare bytes - Maximum throughput as follows - o Asynchronous mode (SDR) 24.3MB/s - Synchronous mode (NV-DDR) 112MB/s (for 100MHz flash clock) - 8-bit SDR NAND interface - 2 chip selects - Programmable access timing - 1.8V and 3.3V I/O - Built-in DMA for improved performance ### **Quad-SPI Controller** - 4 bytes (32-bit) and 3 bytes (24-bit) address width - Maximum SPI Clock at Master Mode at 150MHz - Single, Dual-Parallel, and Dual-Stacked mode - 32-bit AXI Linear Address Mapping Interface for read operation - Up to 2 chip select signals - Write Protection Signal - Hold signals - 4-bit bidirectional I/O signals - x1/x2/x4 Read speed required - x1 write speed required only - 64 byte Entry FIFO depth to improve QSPI read efficiency - Built-in DMA for improved performance # Video Encoder/Decoder (VCU) Zynq UltraScale+ MPSoCs include a Video codec (encoder/decoder) available in the devices designated with the EV suffix. The VCU is located in the PL and can be accessed from either the PL or PS. - Simultaneous Encode and Decode through separate cores - H.264 high profile level 5.2 (4Kx2K-60) - H.265 (HEVC) main, main10 profile, level 5.1, high Tier, up to 4Kx2K-60 rate - 8 and 10 bit encoding - 4:2:0 and 4:2:2 chroma sampling - 8Kx4K-15 rate - Multi-stream up to total of 4Kx2K-60 rate - Low Latency mode - Can share the PS DRAM or use dedicated DRAM in the PL - Clock/power management - OpenMax Linux drivers ### **Out-of-Band Signaling** The transceivers provide out-of-band (OOB) signaling, often used to send low-speed signals from the transmitter to the receiver while high-speed serial data transmission is not active. This is typically done when the link is in a powered-down state or has not yet been initialized. This benefits PCIe and SATA/SAS and QPI applications. # **Integrated Interface Blocks for PCI Express Designs** The MPSoC PL includes integrated blocks for PCIe technology that can be configured as an Endpoint or Root Port, compliant to the PCI Express Base Specification Revision 3.1 for Gen3 and lower data rates and compatible with the PCI Express Base Specification Revision 4.0 (rev 0.5) for Gen4 data rates. The Root Port can be used to build the basis for a compatible Root Complex, to allow custom chip-to-chip communication via the PCI Express protocol, and to attach ASSP Endpoint devices, such as Ethernet Controllers or Fibre Channel HBAs, to the MPSoC. This block is highly configurable to system design requirements and can operate 1, 2, 4, 8, or 16 lanes at up to 2.5Gb/s, 5.0Gb/s, 8.0Gb/s, or 16Gb/s data rates. For high-performance applications, advanced buffering techniques of the block offer a flexible maximum payload size of up to 1,024 bytes. The integrated block interfaces to the integrated high-speed transceivers for serial connectivity and to block RAMs for data buffering. Combined, these elements implement the Physical Layer, Data Link Layer, and Transaction Layer of the PCI Express protocol. Xilinx provides a light-weight, configurable, easy-to-use LogiCORE™ IP wrapper that ties the various building blocks (the integrated block for PCIe, the transceivers, block RAM, and clocking resources) into an Endpoint or Root Port solution. The system designer has control over many configurable parameters: link width and speed, maximum payload size, MPSoC logic interface speeds, reference clock frequency, and base address register decoding and filtering. # **Integrated Block for Interlaken** Some UltraScale architecture-based devices include integrated blocks for Interlaken. Interlaken is a scalable chip-to-chip interconnect protocol designed to enable transmission speeds from 10Gb/s to 150Gb/s. The Interlaken integrated block in the UltraScale architecture is compliant to revision 1.2 of the Interlaken specification with data striping and de-striping across 1 to 12 lanes. Permitted configurations are: 1 to 12 lanes at up to 12.5Gb/s and 1 to 6 lanes at up to 25.78125Gb/s, enabling flexible support for up to 150Gb/s per integrated block. With multiple Interlaken blocks, certain UltraScale architecture-based devices enable easy, reliable Interlaken switches and bridges. # **Integrated Block for 100G Ethernet** Compliant to the IEEE Std 802.3ba, the 100G Ethernet integrated blocks in the UltraScale architecture provide low latency 100Gb/s Ethernet ports with a wide range of user customization and statistics gathering. With support for 10 x 10.3125Gb/s (CAUI) and 4 x 25.78125Gb/s (CAUI-4) configurations, the integrated block includes both the 100G MAC and PCS logic with support for IEEE Std 1588v2 1-step and 2-step hardware timestamping. In UltraScale+ devices, the 100G Ethernet blocks contain a Reed Solomon Forward Error Correction (RS-FEC) block, compliant to IEEE Std 802.3bj, that can be used with the Ethernet block or stand alone in user applications. These families also support OTN mapping mode in which the PCS can be operate without using the MAC. # **Clock Management** The clock generation and distribution components in UltraScale architecture-based devices are located adjacent to the columns that contain the memory interfacing and input and output circuitry. This tight coupling of clocking and I/O provides low-latency clocking to the I/O for memory interfaces and other I/O protocols. Within every clock management tile (CMT) resides one mixed-mode clock manager (MMCM), two PLLs, clock distribution buffers and routing, and dedicated circuitry for implementing external memory interfaces. ### **Mixed-Mode Clock Manager** The mixed-mode clock manager (MMCM) can serve as a frequency synthesizer for a wide range of frequencies and as a jitter filter for incoming clocks. At the center of the MMCM is a voltage-controlled oscillator (VCO), which speeds up and slows down depending on the input voltage it receives from the phase frequency detector (PFD). Three sets of programmable frequency dividers (D, M, and O) are programmable by configuration and during normal operation via the Dynamic Reconfiguration Port (DRP). The pre-divider D reduces the input frequency and feeds one input of the phase/frequency comparator. The feedback divider M acts as a multiplier because it divides the VCO output frequency before feeding the other input of the phase comparator. D and M must be chosen appropriately to keep the VCO within its specified frequency range. The VCO has eight equally-spaced output phases (0°, 45°, 90°, 135°, 180°, 225°, 270°, and 315°). Each phase can be selected to drive one of the output dividers, and each divider is programmable by configuration to divide by any integer from 1 to 128. The MMCM has three input-jitter filter options: low bandwidth, high bandwidth, or optimized mode. Low-Bandwidth mode has the best jitter attenuation. High-Bandwidth mode has the best phase offset. Optimized mode allows the tools to find the best setting. The MMCM can have a fractional counter in either the feedback path (acting as a multiplier) or in one output path. Fractional counters allow non-integer increments of 1/8 and can thus increase frequency synthesis capabilities by a factor of 8. The MMCM can also provide fixed or dynamic phase shift in small increments that depend on the VCO frequency. At 1,600MHz, the phase-shift timing increment is 11.2ps. #### **PLL** With fewer features than the MMCM, the two PLLs in a clock management tile are primarily present to provide the necessary clocks to the dedicated memory interface circuitry. The circuit at the center of the PLLs is similar to the MMCM, with PFD feeding a VCO and programmable M, D, and O counters. There are two divided outputs to the device fabric per PLL as well as one clock plus one enable signal to the memory interface circuitry. Zynq UltraScale+ MPSoCs are equipped with five additional PLLs in the PS for independently configuring the four primary clock domains with the PS: the APU, the RPU, the DDR controller, and the I/O peripherals. ## **Clock Distribution** Clocks are distributed throughout Zynq UltraScale+ MPSoCs via buffers that drive a number of vertical and horizontal tracks. There are 24 horizontal clock routes per clock region and 24 vertical clock routes per clock region with 24 additional vertical clock routes adjacent to the MMCM and PLL. Within a clock region, clock signals are routed to the device logic (CLBs, etc.) via 16 gateable leaf clocks. Several types of clock buffers are available. The BUFGCE and BUFCE\_LEAF buffers provide clock gating at the global and leaf levels, respectively. BUFGCTRL provides glitchless clock muxing and gating capability. BUFGCE\_DIV has clock gating capability and can divide a clock by 1 to 8. BUFG\_GT performs clock division from 1 to 8 for the transceiver clocks. In MPSoCs, clocks can be transferred from the PS to the PL using dedicated buffers. ## **Memory Interfaces** Memory interface data rates continue to increase, driving the need for dedicated circuitry that enables high performance, reliable interfacing to current and next-generation memory technologies. Every Zynq UltraScale+ MPSoC includes dedicated physical interfaces (PHY) blocks located between the CMT and I/O columns that support implementation of high-performance PHY blocks to external memories such as DDR4, DDR3, QDRII+, and RLDRAM3. The PHY blocks in each I/O bank generate the address/control and data bus signaling protocols as well as the precision clock/data alignment required to reliably communicate with a variety of high-performance memory standards. Multiple I/O banks can be used to create wider memory interfaces. As well as external parallel memory interfaces, Zynq UltraScale+ MPSoC can communicate to external serial memories, such as Hybrid Memory Cube (HMC), via the high-speed serial transceivers. All transceivers in the UltraScale architecture support the HMC protocol, up to 15Gb/s line rates. UltraScale architecture-based devices support the highest bandwidth HMC configuration of 64 lanes with a single device. ### **Programmable Data Width** Each port can be configured as $32K \times 1$ ; $16K \times 2$ ; $8K \times 4$ ; $4K \times 9$ (or 8); $2K \times 18$ (or 16); $1K \times 36$ (or 32); or $512 \times 72$ (or 64). Whether configured as block RAM or FIFO, the two ports can have different aspect ratios without any constraints. Each block RAM can be divided into two completely independent 18Kb block RAMs that can each be configured to any aspect ratio from $16K \times 1$ to $512 \times 36$ . Everything described previously for the full 36Kb block RAM also applies to each of the smaller 18Kb block RAMs. Only in simple dual-port (SDP) mode can data widths of greater than 18 bits (18Kb RAM) or 36 bits (36Kb RAM) be accessed. In this mode, one port is dedicated to read operation, the other to write operation. In SDP mode, one side (read or write) can be variable, while the other is fixed to 32/36 or 64/72. Both sides of the dual-port 36Kb RAM can be of variable width. ### **Error Detection and Correction** Each 64-bit-wide block RAM can generate, store, and utilize eight additional Hamming code bits and perform single-bit error correction and double-bit error detection (ECC) during the read process. The ECC logic can also be used when writing to or reading from external 64- to 72-bit-wide memories. ### **FIFO Controller** Each block RAM can be configured as a 36Kb FIFO or an 18Kb FIFO. The built-in FIFO controller for single-clock (synchronous) or dual-clock (asynchronous or multirate) operation increments the internal addresses and provides four handshaking flags: full, empty, programmable full, and programmable empty. The programmable flags allow the user to specify the FIFO counter values that make these flags go active. The FIFO width and depth are programmable with support for different read port and write port widths on a single FIFO. A dedicated cascade path allows for easy creation of deeper FIFOs. ### **UltraRAM** UltraRAM is a high-density, dual-port, synchronous memory block used in some UltraScale+ families. Both of the ports share the same clock and can address all of the 4K x 72 bits. Each port can independently read from or write to the memory array. UltraRAM supports two types of write enable schemes. The first mode is consistent with the block RAM byte write enable mode. The second mode allows gating the data and parity byte writes separately. Multiple UltraRAM blocks can be cascaded together to create larger memory arrays. UltraRAM blocks can be connected together to create larger memory arrays. Dedicated routing in the UltraRAM column enables the entire column height to be connected together. This makes UltraRAM an ideal solution for replacing external memories such as SRAM. Cascadable anywhere from 288Kb to 36Mb, UltraRAM provides the flexibility to fulfill many different memory requirements. ### **Error Detection and Correction** Each 64-bit-wide UltraRAM can generate, store and utilize eight additional Hamming code bits and perform single-bit error correction and double-bit error detection (ECC) during the read process. In FPGAs and the MPSoC PL, sensor outputs and up to 17 user-allocated external analog inputs are digitized using a 10-bit 200 kilo-sample-per-second (kSPS) ADC, and the measurements are stored in registers that can be accessed via internal FPGA (DRP), JTAG, PMBus, or I2C interfaces. The I2C interface and PMBus allow the on-chip monitoring to be easily accessed by the System Manager/Host before and after device configuration. The System Monitor in the MPSoC PS uses a 10-bit, 1 mega-sample-per-second (MSPS) ADC to digitize the sensor inputs. The measurements are stored in registers and are accessed via the Advanced Peripheral Bus (APB) interface by the processors and the PMU in the PS. # **Packaging** The UltraScale architecture-based devices are available in a variety of organic flip-chip and lidless flip-chip packages supporting different quantities of I/Os and transceivers. Maximum supported performance can depend on the style of package and its material. Always refer to the specific device data sheet for performance specifications by package type. In flip-chip packages, the silicon device is attached to the package substrate using a high-performance flip-chip process. Decoupling capacitors are mounted on the package substrate to optimize signal integrity under simultaneous switching of outputs (SSO) conditions. ## **System-Level Features** Several functions span both the PS and PL and include: - Reset Management - Clock Management - Power Domains - PS Boot and Device Configuration - Hardware and Software Debug Support ### **Reset Management** The reset management function provides the ability to reset the entire device or individual units within it. The PS supports these reset functions and signals: - External and internal power-on reset signal - Warm reset - Watchdog timer reset - User resets to PL - Software, watchdog timer, or JTAG provided resets - Security violation reset (locked down reset) ### **Clock Management** The PS in Zynq UltraScale+ MPSoCs is equipped with five phase-locked loops (PLLs), providing flexibility in configuring the clock domains within the PS. There are four primary clock domains of interest within the PS. These include the APU, the RPU, the DDR controller, and the I/O peripherals (IOP). The frequencies of all of these domains can be configured independently under software control. ### **Power Domains** The Zynq UltraScale+ MPSoC contains four separate power domains. When they are connected to separate power supplies, they can be completely powered down independently of each other without consuming any dynamic or static power. The processing system includes: - Full Power Domain (FPD) - Low Power Domain (LPD) - Battery Powered Domain (BPD) In addition to these three Processing System power domains, the PL can also be completely powered down if connected to separate power supplies. The Full Power Domain (FPD) consists of the following major blocks: - Application Processing Unit (APU) - DMA (FP-DMA) - Graphics Processing Unit (GPU) - Dynamic Memory Controller (DDRC) - High-Speed I/O Peripherals The Low Power Domain (LPD) consists of the following major blocks: - Real-Time Processing Unit (RPU) - DMA (LP-DMA) - Platform Management Unit (PMU) - Configuration Security Unit (CSU) - Low-Speed I/O Peripherals - Static Memory Interfaces The Battery Power Domain (BPD) is the lowest power domain of the Zynq UltraScale+ MPSoC processing system. In this mode, all the PS is powered off except the Real-Time Clock (RTC) and battery-backed RAM (BBRAM). ### **Power Examples** Power for the Zynq UltraScale+ MPSoCs varies depending on the utilization of the PL resources, and the frequency of the PS and PL. To estimate power, use the Xilinx Power Estimator (XPE) at: http://www.xilinx.com/products/design\_tools/logic\_design/xpe.htm ## **PS Boot and Device Configuration** Zyng UltraScale+ MPSoCs use a multi-stage boot process that supports both a non-secure and a secure boot. The PS is the master of the boot and configuration process. For a secure boot, the AES-GCM, SHA-3/384 decrypts and authenticates the images while the 4096-bit RSA block authenticates the image. Upon reset, the device mode pins are read to determine the primary boot device to be used: NAND, Quad-SPI, SD, eMMC, or JTAG. JTAG can only be used as a non-secure boot source and is intended for debugging purposes. The CSU executes code out of on-chip ROM and copies the first stage boot loader (FSBL) from the boot device to the OCM. After copying the FSBL to OCM, one of the processors, either the Cortex-A53 or Cortex-R5, executes the FSBL. Xilinx supplies example FSBLs or users can create their own. The FSBL initiates the boot of the PS and can load and configure the PL, or configuration of the PL can be deferred to a later stage. The FSBL typically loads either a user application or an optional second stage boot loader (SSBL), such as U-Boot. Users obtain example SSBL from Xilinx or a third party, or they can create their own SSBL. The SSBL continues the boot process by loading code from any of the primary boot devices or from other sources such as USB, Ethernet, etc. If the FSBL did not configure the PL, the SSBL can do so, or again, the configuration can be deferred to a later stage. The static memory interface controller (NAND, eMMC, or Quad-SPI) is configured using default settings. To improve device configuration speed, these settings can be modified by information provided in the boot image header. The ROM boot image is not user readable or callable after boot. ## **Hardware and Software Debug Support** The debug system used in Zyng UltraScale+ MPSoCs is based on the ARM CoreSight architecture. It uses ARM CoreSight components including an embedded trace controller (ETC), an embedded trace Macrocell (ETM) for each Cortex-A53 and Cortex-R5 processor, and a system trace Macrocell (STM). This enables advanced debug features like event trace, debug breakpoints and triggers, cross-trigger, and debug bus dump to memory. The programmable logic can be debugged with the Xilinx Vivado Logic Analyzer. ### **Debug Ports** Three JTAG ports are available and can be chained together or used separately. When chained together, a single port is used for chip-level JTAG functions, ARM processor code downloads and run-time control operations, PL configuration, and PL debug with the Vivado Logic Analyzer. This enables tools such as the Xilinx Software Development Kit (SDK) and Vivado Logic Analyzer to share a single download cable from Xilinx. When the JTAG chain is split, one port is used to directly access the ARM DAP interface. This CoreSight interface enables the use of ARM-compliant debug and software development tools such as Development Studio 5 (DS-5™). The other JTAG port can then be used by the Xilinx FPGA tools for access to the PL, including configuration bitstream downloads and PL debug with the Vivado Logic Analyzer. In this mode, users can download to and debug the PL in the same manner as a stand-alone FPGA. # **Ordering Information** Table 12 shows the speed and temperature grades available in the different device families. Table 12: Speed Grade and Temperature Grade | | Devices | Speed Grade and Temperature Grade | | | | | | |------------------|------------------|-----------------------------------|---------------|-----------------------------------------|--------------------------------------|--|--| | Device<br>Family | | Commercial<br>(C) | Е | Industrial<br>(I) | | | | | | | 0°C to +85°C | 0°C to +100°C | 0°C to +110°C | -40°C to +100°C | | | | | | | -2E (0.85V) | | -21 (0.85V) | | | | | CG | | | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) | | | | | | Devices | | -1E (0.85V) | | -1I (0.85V) | | | | | | | | | -1LI <sup>(2)</sup> (0.85V or 0.72V) | | | | | | | -2E (0.85V) | | -2I (0.85V) | | | | | ZU2EG | | | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) | | | | | | ZU3EG | | -1E (0.85V) | | -1I (0.85V) | | | | | | | | | -1LI <sup>(2)</sup> (0.85V or 0.72V) | | | | | ZU4EG | | -3E (0.90V) | | | | | | Zynq | ZU5EG<br>ZU6EG | | -2E (0.85V) | | -2I (0.85V) | | | | UltraScale+ | ZU7EG | | | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) | | | | | | ZU9EG | | -1E (0.85V) | | -1I (0.85V) | | | | | ZU11EG<br>ZU15EG | | | | | | | | | ZU17EG | | | | -1LI <sup>(2)</sup> (0.85V or 0.72V) | | | | | ZU19EG | | | | | | | | | | | -3E (0.90V) | | | | | | | E) ( | | -2E (0.85V) | | -2I (0.85V) | | | | | EV<br>Devices | | | -2LE <sup>(1)(2)</sup> (0.85V or 0.72V) | | | | | | | | -1E (0.85V) | | -1I (0.85V) | | | | | | | | | -1LI <sup>(2)</sup> (0.85V or 0.72V) | | | #### Notes: The ordering information shown in Figure 3 applies to all packages in the Zynq UltraScale+ MPSoCs. <sup>1.</sup> In -2LE speed/temperature grade, devices can operate for a limited time with junction temperature of 110°C. Timing parameters adhere to the same speed file at 110°C as they do below 110°C, regardless of operating voltage (nominal at 0.85V or low voltage at 0.72V). Operation at 110°C Tj is limited to 1% of the device lifetime and can occur sequentially or at regular intervals as long as the total time does not exceed 1% of device lifetime. <sup>2.</sup> In Zynq UltraScale+ MPSoCs, when operating the PL at low voltage (0.72V), the PS operates at nominal voltage (0.85V)