



Welcome to **E-XFL.COM** 

**Embedded - System On Chip (SoC):** The Heart of Modern Embedded Systems

Embedded - System On Chip (SoC) refers to an integrated circuit that consolidates all the essential components of a computer system into a single chip. This includes a microprocessor, memory, and other peripherals, all packed into one compact and efficient package. SoCs are designed to provide a complete computing solution, optimizing both space and power consumption, making them ideal for a wide range of embedded applications.

What are **Embedded - System On Chip (SoC)**?

**System On Chip (SoC)** integrates multiple functions of a computer or electronic system onto a single chip. Unlike traditional multi-chip solutions. SoCs combine a central

| Details                 |                                                                            |
|-------------------------|----------------------------------------------------------------------------|
| Product Status          | Active                                                                     |
| Architecture            | MCU, FPGA                                                                  |
| Core Processor          | Quad ARM® Cortex®-A53 MPCore™ with CoreSight™                              |
| Flash Size              | -                                                                          |
| RAM Size                | 256KB                                                                      |
| Peripherals             | DMA, WDT                                                                   |
| Connectivity            | EBI/EMI, Ethernet, I <sup>2</sup> C, MMC/SD/SDIO, SPI, UART/USART, USB OTG |
| Speed                   | 1.5GHz                                                                     |
| Primary Attributes      | FPGA - 2800K Logic Elements                                                |
| Operating Temperature   | 0°C ~ 100°C (TJ)                                                           |
| Package / Case          | 2912-BBGA, FCBGA                                                           |
| Supplier Device Package | 2912-FBGA, FC (55x55)                                                      |
| Purchase URL            | https://www.e-xfl.com/product-detail/intel/1sx280lh2f55e1vg                |
|                         |                                                                            |

Email: info@E-XFL.COM

Address: Room A, 16/F, Full Win Commercial Centre, 573 Nathan Road, Mongkok, Hong Kong



# **Contents**

| 1. | Intel® Stratix® 10 GX/SX Device Overview                                       | 3  |
|----|--------------------------------------------------------------------------------|----|
|    | 1.1. Intel Stratix 10 Family Variants                                          | 4  |
|    | 1.1.1. Available Options                                                       |    |
|    | 1.2. Innovations in Intel Stratix 10 FPGAs and SoCs                            | 6  |
|    | 1.3. FPGA and SoC Features Summary                                             | 8  |
|    | 1.4. Intel Stratix 10 Block Diagram                                            |    |
|    | 1.5. Intel Stratix 10 FPGA and SoC Family Plan                                 | 11 |
|    | 1.6. HyperFlex Core Architecture                                               | 15 |
|    | 1.7. Heterogeneous 3D SiP Transceiver Tiles                                    | 16 |
|    | 1.8. Intel Stratix 10 Transceivers                                             | 17 |
|    | 1.8.1. PMA Features                                                            | 18 |
|    | 1.8.2. PCS Features                                                            | 20 |
|    | 1.9. PCI Express Gen1/Gen2/Gen3 Hard IP                                        | 21 |
|    | 1.10. Interlaken PCS Hard IP                                                   | 21 |
|    | 1.11. 10G Ethernet Hard IP                                                     |    |
|    | 1.12. External Memory and General Purpose I/O                                  | 22 |
|    | 1.13. Adaptive Logic Module (ALM)                                              |    |
|    | 1.14. Core Clocking                                                            | 24 |
|    | 1.15. Fractional Synthesis PLLs and I/O PLLs                                   |    |
|    | 1.16. Internal Embedded Memory                                                 | 25 |
|    | 1.17. Variable Precision DSP Block                                             |    |
|    | 1.18. Hard Processor System (HPS)                                              |    |
|    | 1.18.1. Key Features of the Intel Stratix 10 HPS                               |    |
|    | 1.19. Power Management                                                         |    |
|    | 1.20. Device Configuration and Secure Device Manager (SDM)                     |    |
|    | 1.21. Device Security                                                          |    |
|    | 1.22. Configuration via Protocol Using PCI Express                             |    |
|    | 1.23. Partial and Dynamic Reconfiguration                                      |    |
|    | 1.24. Fast Forward Compile                                                     |    |
|    | 1.25. Single Event Upset (SEU) Error Detection and Correction                  |    |
|    | 1.26. Document Revision History for the Intel Stratix 10 GX/SX Device Overview | 36 |



Intel's 14-nm Intel® Stratix $^{\mathbb{R}}$  10 GX FPGAs and SX SoCs deliver 2X the core performance and up to 70% lower power over previous generation high-performance FPGAs.

Featuring several groundbreaking innovations, including the all new HyperFlex  $^{\text{\tiny{TM}}}$  core architecture, this device family enables you to meet the demand for ever-increasing bandwidth and processing performance in your most advanced applications, while meeting your power budget.

With an embedded hard processor system (HPS) based on a quad-core 64-bit ARM® Cortex®-A53, the Intel Stratix 10 SoC devices deliver power efficient, application-class processing and allow designers to extend hardware virtualization into the FPGA fabric. Intel Stratix 10 SoC devices demonstrate Intel's commitment to high-performance SoCs and extend Intel's leadership in programmable devices featuring an ARM-based processor system.

Important innovations in Intel Stratix 10 FPGAs and SoCs include:

- All new HyperFlex core architecture delivering 2X the core performance compared to previous generation high-performance FPGAs
- Industry leading Intel 14-nm Tri-Gate (FinFET) technology
- Heterogeneous 3D System-in-Package (SiP) technology
- Monolithic core fabric with up to 5.5 million logic elements (LEs)
- Up to 96 full duplex transceiver channels on heterogeneous 3D SiP transceiver tiles
- Transceiver data rates up to 28.3 Gbps chip-to-chip/module and backplane performance
- M20K (20 kbit) internal SRAM memory blocks
- Fractional synthesis and ultra-low jitter LC tank based transmit phase locked loops (PLLs)
- Hard PCI Express<sup>®</sup> Gen3 x16 intellectual property (IP) blocks
- Hard 10GBASE-KR/40GBASE-KR4 Forward Error Correction (FEC) in every transceiver channel
- Hard memory controllers and PHY supporting DDR4 rates up to 2666 Mbps per pin
- Hard fixed-point and IEEE 754 compliant hard floating-point variable precision digital signal processing (DSP) blocks with up to 10 TFLOPS compute performance with a power efficiency of 80 GFLOPS per Watt
- Quad-core 64-bit ARM Cortex-A53 embedded processor running up to 1.5 GHz in SoC family variants
- Programmable clock tree synthesis for flexible, low power, low skew clock trees

S10-OVERVIEW | 2018.08.08



Common to all Intel Stratix 10 family variants is a high-performance fabric based on the new HyperFlex core architecture that includes additional Hyper-Registers throughout the interconnect routing and at the inputs of all functional blocks. The core fabric also contains an enhanced logic array utilizing Intel's adaptive logic module (ALM) and a rich set of high performance building blocks including:

- · M20K (20 kbit) embedded memory blocks
- Variable precision DSP blocks with hard IEEE 754 compliant floating-point units
- Fractional synthesis and integer PLLs
- Hard memory controllers and PHY for external memory interfaces
- General purpose IO cells

To clock these building blocks, Intel Stratix 10 devices use programmable clock tree synthesis, which uses dedicated clock tree routing to synthesize only those branches of the clock trees required for the application. All devices support in-system, finegrained partial reconfiguration of the logic array, allowing logic to be added and subtracted from the system while it is operating.

All family variants also contain high speed serial transceivers, containing both the physical medium attachment (PMA) and the physical coding sublayer (PCS), which can be used to implement a variety of industry standard and proprietary protocols. In addition to the hard PCS, Intel Stratix 10 devices contain multiple instantiations of PCI Express hard IP that supports Gen1/Gen2/Gen3 rates in x1/x2/x4/x8/x16 lane configurations, and hard 10GBASE-KR/40GBASE-KR4 FEC for every transceiver. The hard PCS, FEC, and PCI Express IP free up valuable core logic resources, save power, and increase your productivity.



### 1.1.1. Available Options

Figure 1. Sample Ordering Code and Available Options for Intel Stratix 10 Devices



### 1.2. Innovations in Intel Stratix 10 FPGAs and SoCs

Intel Stratix 10 FPGAs and SoCs deliver many significant improvements over the previous generation high-performance Stratix V FPGAs.

Table 1. Key Features of Intel Stratix 10 Devices Compared to Stratix V Devices

| Feature             | Stratix V FPGAs                                               | Intel Stratix 10 FPGAs and SoCs                                         |
|---------------------|---------------------------------------------------------------|-------------------------------------------------------------------------|
| Process technology  | 28-nm TSMC (planar transistor)                                | 14 nm Intel Tri-Gate (FinFET)                                           |
| Hard processor core | None                                                          | Quad-core 64-bit ARM Cortex-A53 (SoC only)                              |
| Core architecture   | Conventional core architecture with conventional interconnect | HyperFlex core architecture with<br>Hyper-Registers in the interconnect |
| Core performance    | 500 MHz                                                       | 1 GHz                                                                   |
| Power dissipation   | 1x                                                            | As low as 0.3x                                                          |
|                     |                                                               | continued                                                               |



| Feature                                      | Stratix V FPGAs                                                              | Intel Stratix 10 FPGAs and SoCs                                                                                                              |
|----------------------------------------------|------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------|
| Logic density                                | 952 KLE (monolithic)                                                         | 5,500 KLE (monolithic)                                                                                                                       |
| Embedded memory (M20K)                       | 52 Mbits                                                                     | 229 Mbits                                                                                                                                    |
| 18x19 multipliers                            | 3,926<br>Note: Multiplier is 18x18 in<br>Stratix V devices.                  | 11,520 Note: Multiplier is 18x19 in Intel Stratix 10 devices.                                                                                |
| Floating point DSP capability                | Up to 1 TFLOP, requires soft floating point adder and multiplier             | Up to 10 TFLOPS, hard IEEE 754 compliant single precision floating point adder and multiplier                                                |
| Maximum transceivers                         | 66                                                                           | 96                                                                                                                                           |
| Maximum transceiver data rate (chip-to-chip) | 28.05 Gbps                                                                   | 28.3 Gbps L-Tile<br>28.3 Gbps H-Tile                                                                                                         |
| Maximum transceiver data rate (backplane)    | 12.5 Gbps                                                                    | 12.5 Gbps L-Tile<br>28.3 Gbps H-Tile                                                                                                         |
| Hard memory controller                       | None                                                                         | DDR4 @ 1333 MHz/2666 Mbps<br>DDR3 @ 1067 MHz/2133 Mbps                                                                                       |
| Hard protocol IP                             | PCIe Gen3 x8 (up to 4 instances)                                             | PCIe Gen3 x16 (up to 4 instances)<br>SR-IOV (4 physical functions / 2k<br>virtual functions) on H-Tile devices<br>10GBASE-KR/40GBASE-KR4 FEC |
| Core clocking and PLLs                       | Global, quadrant and regional clocks supported by fractional-synthesis fPLLs | Programmable clock tree synthesis supported by fractional synthesis fPLLs and integer IO PLLs                                                |
| Register state readback and writeback        | Not available                                                                | Non-destructive register state<br>readback and writeback for ASIC<br>prototyping and other applications                                      |

These innovations result in the following improvements:

- **Improved Core Logic Performance**: The HyperFlex core architecture combined with Intel's 14-nm Tri-Gate technology allows Intel Stratix 10 devices to achieve 2X the core performance compared to the previous generation
- **Lower Power**: Intel Stratix 10 devices use up to 70% lower power compared to the previous generation, enabled by 14-nm Intel Tri-Gate technology, the HyperFlex core architecture, and optional power saving features built into the architecture
- Higher Density: Intel Stratix 10 devices offer over five times the level of integration, with up to 5,500K logic elements (LEs) in a monolithic fabric, over 229 Mbits of embedded memory blocks (M20K), and 11,520 18x19 multipliers
- **Embedded Processing**: Intel Stratix 10 SoCs feature a Quad-Core 64-bit ARM Cortex-A53 processor optimized for power efficiency and software compatible with previous generation Arria and Cyclone SoC devices
- Improved Transceiver Performance: With up to 96 transceiver channels implemented in heterogeneous 3D SiP transceiver tiles, Intel Stratix 10 GX and SX devices support data rates up to 28.3 Gbps chip-to-chip and 28.3 Gbps across the backplane with signal conditioning circuits capable of equalizing over 30 dB of system loss
- Improved DSP Performance: The variable precision DSP block in Intel Stratix 10 devices features hard fixed and floating point capability, with up to 10 TeraFLOPS IEEE754 single-precision floating point performance





| Feature                                 | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|-----------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Power management                        | SmartVID controlled core voltage, standard power devices     0.85-V fixed core voltage, low static power devices available     Intel Quartus® Prime Pro Edition integrated power analysis                                                                                                                                                                                                                                                                       |
| High performance monolithic core fabric | HyperFlex core architecture with Hyper-Registers throughout the interconnect routing and at the inputs of all functional blocks  Monolithic fabric minimizes compile times and increases logic utilization  Enhanced adaptive logic module (ALM)  Improved multi-track routing architecture reduces congestion and improves compile times  Hierarchical core clocking architecture with programmable clock tree synthesis  Fine-grained partial reconfiguration |
| Internal memory blocks                  | M20K—20-Kbit with hard ECC support     MLAB—640-bit distributed LUTRAM                                                                                                                                                                                                                                                                                                                                                                                          |
| Variable precision DSP blocks           | IEEE 754-compliant hard single-precision floating point capability Supports signal processing with precision ranging from 18x19 up to 54x54 Native 27x27 and 18x19 multiply modes 64-bit accumulator and cascade for systolic FIRs Internal coefficient memory banks Pre-adder/subtractor improves efficiency Additional pipeline register increases performance and reduces power                                                                              |
| Phase locked loops (PLL)                | <ul> <li>Fractional synthesis PLLs (fPLL) support both fractional and integer modes</li> <li>Fractional mode with third-order delta-sigma modulation</li> <li>Precision frequency synthesis</li> <li>Integer PLLs adjacent to general purpose I/Os, support external memory, and LVDS interfaces, clock delay compensation, zero delay buffering</li> </ul>                                                                                                     |
| Core clock networks                     | 1 GHz fabric clocking     667 MHz external memory interface clocking, supports 2666 Mbps DDR4 interface     800 MHz LVDS interface clocking, supports 1600 Mbps LVDS interface     Programmable clock tree synthesis, backwards compatible with global, regional and peripheral clock networks     Clocks only synthesized where needed, to minimize dynamic power                                                                                              |



| SoC Subsystem Feature           |                            | Description                                                                       |  |
|---------------------------------|----------------------------|-----------------------------------------------------------------------------------|--|
|                                 | NAND flash controller      | 1 ONFI 1.0, 8- and 16-bit support                                                 |  |
|                                 | General-purpose I/O (GPIO) | Maximum of 48 software programmable GPIO                                          |  |
|                                 | Timers                     | 4 general-purpose timers     4 watchdog timers                                    |  |
| Secure Device<br>Manager        | Security                   | Secure boot     Advanced Encryption Standard (AES) and authentication (SHA/ECDSA) |  |
| External<br>Memory<br>Interface | External Memory Interface  | Hard Memory Controller with DDR4 and DDR3, and LPDDR3                             |  |

## 1.4. Intel Stratix 10 Block Diagram

Figure 2. Intel Stratix 10 FPGA and SoC Architecture Block Diagram



HPS: Quad ARM Cortex-A53 Hard Processor System

SDM: Secure Device Manager

EMIB: Embedded Multi-Die Interconnect Bridge

# 1.5. Intel Stratix 10 FPGA and SoC Family Plan

<sup>(1)</sup> The number of 27x27 multipliers is one-half the number of 18x19 multipliers.

S10-OVERVIEW | 2018.08.08



| Intel Stratix 10     | Interconnects |              | PLLs  |          | Hard IP                |  |
|----------------------|---------------|--------------|-------|----------|------------------------|--|
| GX/SX Device<br>Name | Maximum GPIOs | Maximum XCVR | fPLLs | I/O PLLs | PCIe Hard IP<br>Blocks |  |
| GX 2800/<br>SX 2800  | 1160          | 96           | 32    | 24       | 4                      |  |
| GX 4500/<br>SX 4500  | 1640          | 24           | 8     | 34       | 1                      |  |
| GX 5500/<br>SX 5500  | 1640          | 24           | 8     | 34       | 1                      |  |

Table 6. Intel Stratix 10 GX/SX FPGA and SoC Family Package Plan, part 1

Cell legend: General Purpose I/Os, High-Voltage I/Os, LVDS Pairs, Transceivers (2) (3) (4) (5) (6) (7)

| Intel Stratix 10 GX/SX<br>Device Name | F1152<br>HF35<br>(35x35 mm²) | F1760<br>NF43<br>(42.5x42.5 mm <sup>2</sup> ) | F1760<br>NF43<br>(42.5x42.5 mm <sup>2</sup> ) |
|---------------------------------------|------------------------------|-----------------------------------------------|-----------------------------------------------|
| GX 400/<br>SX 400                     | 392, 8, 192, 24              |                                               |                                               |
| GX 650/<br>SX 650                     | 392, 8, 192, 24              | 400, 16, 192, 48                              |                                               |
| GX 850/<br>SX 850                     |                              |                                               | 688, 16, 336, 48                              |
| GX 1100/<br>SX 1100                   |                              |                                               | 688, 16, 336, 48                              |
| GX 1650/<br>SX 1650                   |                              |                                               | 688, 16, 336, 48                              |
| GX 2100/<br>SX 2100                   |                              |                                               | 688, 16, 336, 48                              |
| GX 2500/<br>SX 2500                   |                              |                                               | 688, 16, 336, 48                              |
| GX 2800/                              |                              |                                               | 688, 16, 336, 48                              |

<sup>(2)</sup> All packages are ball grid arrays with 1.0 mm pitch.

<sup>(3)</sup> High-Voltage I/O pins are used for 3 V and 2.5 V interfacing.

<sup>(4)</sup> Each LVDS pair can be configured as either a differential input or a differential output.

<sup>(5)</sup> High-Voltage I/O pins and LVDS pairs are included in the General Purpose I/O count. Transceivers are counted separately.

<sup>(6)</sup> Each package column offers pin migration (common circuit board footprint) for all devices in the column.

<sup>(7)</sup> Intel Stratix 10 GX devices are pin migratable with Intel Stratix 10 SX devices in the same package.



| Intel Stratix 10 GX/SX<br>Device Name | F1152<br>HF35<br>(35x35 mm²) | F1760<br>NF43<br>(42.5x42.5 mm <sup>2</sup> ) | F1760<br>NF43<br>(42.5x42.5 mm <sup>2</sup> ) |
|---------------------------------------|------------------------------|-----------------------------------------------|-----------------------------------------------|
| SX 2800                               |                              |                                               |                                               |
| GX 4500/<br>SX 4500                   |                              |                                               |                                               |
| GX 5500/<br>SX 5500                   |                              |                                               |                                               |

### Table 7. Intel Stratix 10 GX/SX FPGA and SoC Family Package Plan, part 2

Cell legend: General Purpose I/Os, High-Voltage I/Os, LVDS Pairs, Transceivers (2) (3) (4) (5) (6) (7)

| Intel Stratix 10<br>GX/SX Device Name | F2112<br>NF48<br>(47.5x47.5 mm²) | F2397<br>UF50<br>(50x50 mm²) | F2912<br>HF55<br>(55x55 mm²) |
|---------------------------------------|----------------------------------|------------------------------|------------------------------|
| GX 400/<br>SX 400                     |                                  |                              |                              |
| GX 650/<br>SX 650                     |                                  |                              |                              |
| GX 850/<br>SX 850                     | 736, 16, 360, 48                 |                              |                              |
| GX 1100/<br>SX 1100                   | 736, 16, 360, 48                 |                              |                              |
| GX 1650/<br>SX 1650                   |                                  | 704, 32, 336, 96             |                              |
| GX 2100/<br>SX 2100                   |                                  | 704, 32, 336, 96             |                              |
| GX 2500/<br>SX 2500                   |                                  | 704, 32, 336, 96             | 1160, 8, 576, 24             |
| GX 2800/<br>SX 2800                   |                                  | 704, 32, 336, 96             | 1160, 8, 576, 24             |
| GX 4500/<br>SX 4500                   |                                  |                              | 1640, 8, 816, 24             |
| GX 5500/<br>SX 5500                   |                                  |                              | 1640, 8, 816, 24             |



## 1.6. HyperFlex Core Architecture

Intel Stratix 10 FPGAs and SoCs are based on a monolithic core fabric featuring the new HyperFlex core architecture. The HyperFlex core architecture delivers 2X the clock frequency performance and up to 70% lower power compared to previous generation high-end FPGAs. Along with this performance breakthrough, the HyperFlex core architecture delivers a number of advantages including:

- Higher Throughput—Leverages 2X core clock frequency performance to obtain throughput breakthroughs
- **Improved Power Efficiency**—Uses reduced IP size, enabled by HyperFlex, to consolidate designs which previously spanned multiple devices into a single device, thereby reducing power by up to 70% versus previous generation devices
- Greater Design Functionality—Uses faster clock frequency to reduce bus widths and reduce IP size, freeing up additional FPGA resources to add greater functionality
- **Increased Designer Productivity**—Boosts performance with less routing congestion and fewer design iterations using Hyper-Aware design tools, obtaining greater timing margin for more rapid timing closure

In addition to the traditional user registers found in the Adaptive Logic Modules (ALM), the HyperFlex core architecture introduces additional bypassable registers everywhere throughout the fabric of the FPGA. These additional registers, called Hyper-Registers are available on every interconnect routing segment and at the inputs of all functional blocks.

Figure 3. Bypassable Hyper-Register



The Hyper-Registers enable the following key design techniques to achieve the 2X core performance increases:

- Fine grain Hyper-Retiming to eliminate critical paths
- Zero latency Hyper-Pipelining to eliminate routing delays
- Flexible Hyper-Optimization for best-in-class performance

By implementing these techniques in your design, the Hyper-Aware design tools automatically make use of the Hyper-Registers to achieve maximum core clock frequency.



Within each transceiver tile, the transceivers are arranged in four banks of six PMA-PCS groups. A wide variety of bonded and non-bonded data rate configurations are possible within each bank, and within each tile, using a highly configurable clock distribution network.

### 1.8.1. PMA Features

PMA channels are comprised of transmitter (TX), receiver (RX), and high speed clocking resources.

Intel Stratix 10 device features provide exceptional signal integrity at data rates up to 28.3 Gbps. Clocking options include ultra-low jitter LC tank-based (ATX) PLLs with optional fractional synthesis capability, channel PLLs operating as clock multiplier units (CMUs), and fractional synthesis PLLs (fPLLs).

- **ATX PLL**—can be configured in integer mode, or optionally, in a new fractional synthesis mode. Each ATX PLL spans the full frequency range of the supported data rate range providing a stable, flexible clock source with the lowest jitter.
- **CMU PLL**—when not being used as a transceiver, select PMA channels can be configured as channel PLLs operating as CMUs to provide an additional master clock source within the transceiver bank.
- **fPLL**—In addition, dedicated fPLLs are available with precision frequency synthesis capabilities. fPLLs can be used to synthesize multiple clock frequencies from a single reference clock source and replace multiple reference oscillators for multiprotocol and multi-rate applications.

On the receiver side, each PMA has an independent channel PLL that allows analog tracking for clock-data recovery. Each PMA also has advanced equalization circuits that compensate for transmission losses across a wide frequency spectrum.

- Variable Gain Amplifier (VGA)—to optimize the receiver's dynamic range
- **Continuous Time Linear Equalizer (CTLE)**—to compensate for channel losses with lowest power dissipation
- **Decision Feedback Equalizer (DFE)**—to provide additional equalization capability on backplanes even in the presence of crosstalk and reflections
- **On-Die Instrumentation (ODI)**—to provide on-chip eye monitoring capabilities (Eye Viewer). This capability helps to optimize link equalization parameters during board bring-up and supports in-system link diagnostics and equalization margin testing



| Feature                                                              | Capability                                                                                                                                                                          |
|----------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Digitally Assisted Analog<br>CDR                                     | Superior jitter tolerance with fast lock time                                                                                                                                       |
| On-Die Instrumentation—<br>Eye Viewer and Jitter Margin<br>Tool      | Simplify board bring-up, debug, and diagnostics with non-intrusive, high-resolution eye monitoring (Eye Viewer). Also inject jitter from transmitter to test link margin in system. |
| Dynamic Reconfiguration                                              | Allows for independent control of each transceiver channel Avalon memory-mapped interface for the most transceiver flexibility.                                                     |
| Multiple PCS-PMA and PCS-<br>Core to FPGA fabric interface<br>widths | 8-, 10-, 16-, 20-, 32-, 40-, or 64-bit interface widths for flexibility of deserialization width, encoding, and reduced latency                                                     |

#### 1.8.2. PCS Features

Intel Stratix 10 PMA channels interface with core logic through configurable and bypassable PCS interface layers.

The PCS contains multiple gearbox implementations to decouple the PMA and PCS interface widths. This feature provides the flexibility to implement a wide range of applications with 8, 10, 16, 20, 32, 40, or 64-bit interface width between each transceiver and the core logic.

The PCS also contains hard IP to support a variety of standard and proprietary protocols across a wide range of data rates and encoding schemes. The Standard PCS mode provides support for 8B/10B encoded applications up to 12.5 Gbps. The Enhanced PCS mode supports 64B/66B and 64B/67B encoded applications up to 17.4 Gbps. The enhanced PCS mode also includes an integrated 10GBASE-KR/40GBASE-KR4 Forward Error Correction (FEC) circuit. For highly customized implementations, a PCS Direct mode provides an interface up to 64 bits wide to allow for custom encoding and support for data rates up to 28.3 Gbps.

For more information about the PCS-Core interface or the double rate transfer mode, refer to the *Intel Stratix 10 L- and H-Tile Transceiver PHY User Guide*, and the *Intel Stratix 10 E-Tile Transceiver PHY User Guide*.

**Table 9.** Transceiver PCS Features

| PCS Protocol<br>Support                         | Data Rate (Gbps) | Transmitter Data Path                                                                                                                                        | Receiver Data Path                                                                                                                                                                              |  |
|-------------------------------------------------|------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| Standard PCS                                    | 1 to 12.5        | Phase compensation FIFO, byte serializer, 8B/10B encoder, bit-slipper, channel bonding                                                                       | Rate match FIFO, word-aligner, 8B/10B decoder, byte deserializer, byte ordering                                                                                                                 |  |
| PCI Express<br>Gen1/Gen2 x1,<br>x2, x4, x8, x16 | 2.5 and 5.0      | Same as Standard PCS plus PIPE 2.0 interface to core                                                                                                         | Same as Standard PCS plus PIPE 2.0 interface to core                                                                                                                                            |  |
| PCI Express Gen3<br>x1, x2, x4, x8,<br>x16      | 8.0              | Phase compensation FIFO, byte serializer, encoder, scrambler, bit-slipper, gear box, channel bonding, and PIPE 3.0 interface to core, auto speed negotiation | Rate match FIFO (0-600 ppm mode), word-aligner, decoder, descrambler, phase compensation FIFO, block sync, byte deserializer, byte ordering, PIPE 3.0 interface to core, auto speed negotiation |  |
| CPRI                                            | 0.6144 to 9.8    | Same as Standard PCS plus deterministic latency serialization                                                                                                | Same as Standard PCS plus deterministic latency deserialization                                                                                                                                 |  |
|                                                 | continued        |                                                                                                                                                              |                                                                                                                                                                                                 |  |



| PCS Protocol<br>Support | Data Rate (Gbps) | Transmitter Data Path                                                                                                         | Receiver Data Path                                                                               |
|-------------------------|------------------|-------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------|
| Enhanced PCS            | 2.5 to 17.4      | FIFO, channel bonding, bit-slipper, and gear box                                                                              | FIFO, block sync, bit-slipper, and gear box                                                      |
| 10GBASE-R               | 10.3125          | FIFO, 64B/66B encoder, scrambler,<br>FEC, and gear box                                                                        | FIFO, 64B/66B decoder, descrambler, block sync, FEC, and gear box                                |
| Interlaken              | 4.9 to 17.4      | FIFO, channel bonding, frame<br>generator, CRC-32 generator,<br>scrambler, disparity generator, bit-<br>slipper, and gear box | FIFO, CRC-32 checker, frame sync,<br>descrambler, disparity checker, block<br>sync, and gear box |
| SFI-S/SFI-5.2           | 11.3             | FIFO, channel bonding, bit-slipper, and gear box                                                                              | FIFO, bit-slipper, and gear box                                                                  |
| IEEE 1588               | 1.25 to 10.3125  | FIFO (fixed latency), 64B/66B encoder, scrambler, and gear box                                                                | FIFO (fixed latency), 64B/66B decoder, descrambler, block sync, and gear box                     |
| SDI                     | up to 12.5       | FIFO and gear box                                                                                                             | FIFO, bit-slipper, and gear box                                                                  |
| GigE                    | 1.25             | Same as Standard PCS plus GigE state machine                                                                                  | Same as Standard PCS plus GigE state machine                                                     |
| PCS Direct              | up to 28.3       | Custom                                                                                                                        | Custom                                                                                           |

#### **Related Information**

Intel Stratix 10 L- and H-Tile Transceiver PHY User Guide

### 1.9. PCI Express Gen1/Gen2/Gen3 Hard IP

Intel Stratix 10 devices contain embedded PCI Express hard IP designed for performance, ease-of-use, increased functionality, and designer productivity.

The PCI Express hard IP consists of the PHY, Data Link, and Transaction layers. It also supports PCI Express Gen1/Gen2/Gen3 end point and root port, in x1/x2/x4/x8/x16 lane configurations. The PCI Express hard IP is capable of operating independently from the core logic (autonomous mode). This feature allows the PCI Express link to power up and complete link training in less than 100 ms, while the rest of the device is still in the process of being configured. The hard IP also provides added functionality, which makes it easier to support emerging features such as Single Root I/O Virtualization (SR-IOV) and optional protocol extensions.

The PCI Express hard IP has improved end-to-end data path protection using Error Checking and Correction (ECC). In addition, the hard IP supports configuration of the device via protocol (CvP) across the PCI Express bus at Gen1/Gen2/Gen3 rates.

### 1.10. Interlaken PCS Hard IP

Intel Stratix 10 devices have integrated Interlaken PCS hard IP supporting rates up to 17.4 Gbps per lane.

The Interlaken PCS hard IP is based on the proven functionality of the PCS developed for Intel's previous generation FPGAs, which has demonstrated interoperability with Interlaken ASSP vendors and third-party IP suppliers. The Interlaken PCS hard IP is present in every transceiver channel in Intel Stratix 10 devices.



### 1.11. 10G Ethernet Hard IP

Intel Stratix 10 devices include IEEE 802.3 10-Gbps Ethernet (10GbE) compliant 10GBASE-R PCS and PMA hard IP. The scalable 10GbE hard IP supports multiple independent 10GbE ports while using a single PLL for all the 10GBASE-R PCS instantiations, which saves on core logic resources and clock networks.

The integrated serial transceivers simplify multi-port 10GbE systems compared to 10 GbE Attachment Unit Interface (XAUI) interfaces that require an external XAUI-to-10G PHY. Furthermore, the integrated transceivers incorporate signal conditioning circuits, which enable direct connection to standard 10G XFP and SFP+ pluggable optical modules. The transceivers also support backplane Ethernet applications and include a hard 10GBASE-KR/40GBASE-KR4 Forward Error Correction (FEC) circuit that can be used for both 10G and 40G applications. The integrated 10G Ethernet hard IP and 10G transceivers save external PHY cost, board space and system power. The 10G Ethernet PCS hard IP and 10GBASE-KR FEC are present in every transceiver channel.

## 1.12. External Memory and General Purpose I/O

Intel Stratix 10 devices offer substantial external memory bandwidth, with up to ten 72-bit wide DDR4 memory interfaces running at up to 2666 Mbps.

This bandwidth is provided along with the ease of design, lower power, and resource efficiencies of hardened high-performance memory controllers. The external memory interfaces can be configured up to a maximum width of 144 bits when using either hard or soft memory controllers.

Figure 8. Hard Memory Controller



S10-OVERVIEW | 2018.08.08



Each I/O bank contains 48 general purpose I/Os and a high-efficiency hard memory controller capable of supporting many different memory types, each with different performance capabilities. The hard memory controller is also capable of being bypassed and replaced by a soft controller implemented in the user logic. The I/Os each have a hardened double data rate (DDR) read/write path (PHY) capable of performing key memory interface functionality such as:

- Read/write leveling
- FIFO buffering to lower latency and improve margin
- Timing calibration
- · On-chip termination

The timing calibration is aided by the inclusion of hard microcontrollers based on Intel's Nios® II technology, specifically tailored to control the calibration of multiple memory interfaces. This calibration allows the Intel Stratix 10 device to compensate for any changes in process, voltage, or temperature either within the Intel Stratix 10 device itself, or within the external memory device. The advanced calibration algorithms ensure maximum bandwidth and robust timing margin across all operating conditions.

#### **Table 10.** External Memory Interface Performance

The listed speeds are for the 1-rank case.

| Interface     | Controller Type | Performance |
|---------------|-----------------|-------------|
| DDR4          | Hard            | 2666 Mbps   |
| DDR3          | Hard            | 2133 Mbps   |
| QDRII+        | Soft            | 1,100 Mtps  |
| QDRII+ Xtreme | Soft            | 1,266 Mtps  |
| QDRIV         | Soft            | 2,133 Mtps  |
| RLDRAM III    | Soft            | 2400 Mbps   |
| RLDRAM II     | Soft            | 533 Mbps    |

In addition to parallel memory interfaces, Intel Stratix 10 devices support serial memory technologies such as the Hybrid Memory Cube (HMC). The HMC is supported by the Intel Stratix 10 high-speed serial transceivers, which connect up to four HMC links, with each link running at data rates of 15 Gbps (HMC short reach specification).

Intel Stratix 10 devices also feature general purpose I/Os capable of supporting a wide range of single-ended and differential I/O interfaces. LVDS rates up to 1.6 Gbps are supported, with each pair of pins having both a differential driver and a differential input buffer. This enables configurable direction for each LVDS pair.

## 1.13. Adaptive Logic Module (ALM)

Intel Stratix 10 devices use a similar adaptive logic module (ALM) as the previous generation Arria 10 and Stratix V FPGAs, allowing for efficient implementation of logic functions and easy conversion of IP between the devices.

The ALM block diagram shown in the following figure has eight inputs with a fracturable look-up table (LUT), two dedicated embedded adders, and four dedicated registers.





Figure 9. Intel Stratix 10 FPGA and SoC ALM Block Diagram

Key features and capabilities of the ALM include:

- High register count with 4 registers per 8-input fracturable LUT, operating in conjunction with the new HyperFlex architecture, enables Intel Stratix 10 devices to maximize core performance at very high core logic utilization
- Implements select 7-input logic functions, all 6-input logic functions, and two independent functions consisting of smaller LUT sizes (such as two independent 4-input LUTs) to optimize core logic utilization

The Intel Quartus Prime software leverages the ALM logic structure to deliver the highest performance, optimal logic utilization, and lowest compile times. The Intel Quartus Prime software simplifies design reuse as it automatically maps legacy designs into the Intel Stratix 10 ALM architecture.

## 1.14. Core Clocking

Core clocking in Intel Stratix 10 devices makes use of programmable clock tree synthesis.

This technique uses dedicated clock tree routing and switching circuits, and allows the Intel Quartus Prime software to create the exact clock trees required for your design. Clock tree synthesis minimizes clock tree insertion delay, reduces dynamic power dissipation in the clock tree and allows greater clocking flexibility in the core while still maintaining backwards compatibility with legacy global and regional clocking schemes.



The DSP blocks can be configured to support signal processing with precision ranging from 18x19 up to 54x54. A pipeline register has been added to increase the maximum operating frequency of the DSP block and reduce power consumption.

Figure 10. DSP Block: Standard Precision Fixed Point Mode



Figure 11. DSP Block: High Precision Fixed Point Mode





Figure 12. DSP Block: Single Precision Floating Point Mode



Each DSP block can be independently configured at compile time as either dual 18x19 or a single 27x27 multiply accumulate. With a dedicated 64-bit cascade bus, multiple variable precision DSP blocks can be cascaded to implement even higher precision DSP functions efficiently.

In floating point mode, each DSP block provides one single precision floating point multiplier and adder. Floating point additions, multiplications, mult-adds and mult-accumulates are supported.

The following table shows how different precisions are accommodated within a DSP block, or by utilizing multiple blocks.

**Table 12.** Variable Precision DSP Block Configurations

| Multiplier Size                 | DSP Block Resources                                                                      | Expected Usage                  |
|---------------------------------|------------------------------------------------------------------------------------------|---------------------------------|
| 18x19 bits                      | 1/2 of Variable Precision DSP Block                                                      | Medium precision fixed point    |
| 27x27 bits                      | 1 Variable Precision DSP Block                                                           | High precision fixed point      |
| 19x36 bits                      | 1 Variable Precision DSP Block with external adder                                       | Fixed point FFTs                |
| 36x36 bits                      | 2 Variable Precision DSP Blocks with external adder                                      | Very high precision fixed point |
| 54x54 bits                      | 4 Variable Precision DSP Blocks with external adder                                      | Double Precision floating point |
| Single Precision floating point | 1 Single Precision floating point adder, 1 Single<br>Precision floating point multiplier | Floating point                  |



Complex multiplication is very common in DSP algorithms. One of the most popular applications of complex multipliers is the FFT algorithm. This algorithm has the characteristic of increasing precision requirements on only one side of the multiplier. The Variable Precision DSP block supports the FFT algorithm with proportional increase in DSP resources as the precision grows.

Table 13. Complex Multiplication With Variable Precision DSP Block

| Complex Multiplier<br>Size | DSP Block Resources             | FFT Usage              |
|----------------------------|---------------------------------|------------------------|
| 18x19 bits                 | 2 Variable Precision DSP Blocks | Resource optimized FFT |
| 27x27 bits                 | 4 Variable Precision DSP Blocks | Highest precision FFT  |

For FFT applications with high dynamic range requirements, the Intel FFT IP Core offers an option of single precision floating point implementation with resource usage and performance similar to high precision fixed point implementations.

Other features of the DSP block include:

- Hard 18-bit and 25-bit pre-adders
- Hard floating point multipliers and adders
- 64-bit dual accumulator (for separate I, Q product accumulations)
- Cascaded output adder chains for 18- and 27-bit FIR filters
- Embedded coefficient registers for 18- and 27-bit coefficients
- Fully independent multiplier outputs
- Inferability using HDL templates supplied by the Intel Quartus Prime software for most modes

The Variable Precision DSP block is ideal to support the growing trend towards higher bit precision in high performance DSP applications. At the same time, it can efficiently support the many existing 18-bit DSP applications, such as high definition video processing and remote radio heads. With the Variable Precision DSP block architecture and hard floating point multipliers and adders, Intel Stratix 10 devices can efficiently support many different precision levels up to and including floating point implementations. This flexibility can result in increased system performance, reduced power consumption, and reduce architecture constraints on system algorithm designers.

## 1.18. Hard Processor System (HPS)

The Intel Stratix 10 SoC Hard Processor System (HPS) is Intel's industry leading third generation HPS. Leveraging the performance of Intel's 14-nm Tri-Gate technology, Intel Stratix 10 SoC devices more than double the performance of previous generation SoCs with an integrated quad-core 64-bit ARM Cortex-A53. The HPS also enables system-wide hardware virtualization capabilities by adding a system memory management unit. These architecture improvements ensure that Intel Stratix 10 SoCs will meet the requirements of current and future embedded markets, including wireless and wireline communications, data center acceleration, and numerous military applications.



| Feature                             | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
|-------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Communication Interface Controllers | Three 10/100/1000 Ethernet media access controls (MAC) with integrated DMA  — Supports RGMII and RMII external PHY Interfaces  — Option to support other PHY interfaces through FPGA logic  • GMII  • MII  • RMII (requires MII to RMII adapter)  • RGMII (requires GMII to RGMII adapter)  • SGMII (requires GMII to SGMII adapter)  • SGMII (requires GMII to SGMII adapter)  — Supports IEEE 1588-2002 and IEEE 1588-2008 standards for precision networked clock synchronization  — Supports IEEE 802.1Q VLAN tag detection for reception frames  — Supports Ethernet AVB standard  • Two USB On-the-Go (OTG) controllers with DMA  — Dual-Role Device (device and host functions)  • High-speed (480 Mbps)  • Full-speed (12 Mbps)  • Low-speed (1.5 Mbps)  • Supports USB 1.1 (full-speed and low-speed)  — Integrated descriptor-based scatter-gather DMA  — Support for external ULPI PHY  — Up to 16 bidirectional endpoints, including control endpoint  — Up to 16 bidirectional endpoints, including control endpoint  — Up to 16 host channels  — Support speneric root hub  — Configurable to OTG 1.3 and OTG 2.0 modes  • Five I²C controllers (three can be used by EMAC for MIO to external PHY)  — Support both 100Kbps and 400Kbps modes  — Support Master and Slave operating mode  • Two UART 16550 compatible  — Programmable baud rate up to 115.2Kbaud  • Four serial peripheral interfaces (SPI) (2 Master, 2 Slaves)  — Full and Half duplex |
| Timers and I/O                      | Timers  — 4 general-purpose timers  — 4 watchdog timers  4 8 HPS direct I/O allow HPS peripherals to connect directly to I/O  Up to three IO48 banks may be assigned to HPS for HPS DDR access                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
| Interconnect to Logic Core          | <ul> <li>FPGA-to-HPS Bridge         <ul> <li>Allows IP bus masters in the FPGA fabric to access to HPS bus slaves</li> <li>Configurable 32-, 64-, or 128-bit AMBA AXI interface</li> </ul> </li> <li>HPS-to-FPGA Bridge         <ul> <li>Allows HPS bus masters to access bus slaves in FPGA fabric</li> <li>Configurable 32-, 64-, or 128-bit AMBA AXI interface allows high-bandwidth HPS master transactions to FPGA fabric</li> </ul> </li> <li>HPS-to-SDM and SDM-to-HPS Bridges         <ul> <li>Allows the HPS to reach the SDM block and the SDM to bootstrap the HPS</li> </ul> </li> <li>Light Weight HPS-to-FPGA Bridge         <ul> <li>Light weight 32-bit AXI interface suitable for low-latency register accesses from HPS to soft peripherals in FPGA fabric</li> </ul> </li> <li>FPGA-to-HPS SDRAM Bridge         <ul> <li>Up to three AMBA AXI interfaces supporting 32, 64, or 128-bit data paths</li> </ul> </li> </ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |