Intel Xeon Sapphire Rapids: How To Go Monolithic with Tiles (2024)

One of the critical deficits Intel has to its competition in its server platform is core count – other companies are enabling more cores by one of two routes: smaller cores, or individual chiplets connected together. At its Architecture Day 2021, Intel has disclosed features about its next-gen Xeon Scalable platform, one of which is the move to a tiled architecture. Intel is set to combine four tiles/chiplets through its fast embedded bridges, leading to better CPU scalability at higher core counts. As part of the disclosure, Intel also expanded on its new Advanced Matrix Extension (AMX) technology, CXL 1.1 support, DDR5, PCIe 5.0, and an Accelerator Interfacing Architecture that may lead to custom Xeon CPUs in the future.

What is Sapphire Rapids?

Built on an Intel 7 process, Sapphire Rapids (SPR) will be Intel’s next-generation Xeon Scalable server processor for its Eagle Stream platform. Using its latest Golden Cove processor cores which we detailed last week, Sapphire Rapids will bring together a number of key technologies for Intel: Acceleration Engines, native half-precision FP16 support, DDR5, 300-Series Optane DC Persistent Memory, PCIe 5.0, CXL 1.1, a wider and faster UPI, its newest bridging technology (EMIB), new QoS and telemetry, HBM, and workload specialized acceleration.

Set to launch in 2022, Sapphire Rapids will be Intel’s first modern CPU product to take advantage of a multi-die architecture that aims to minimizelatency and maximizebandwidth due to its Embedded Multi-Die Interconnect Bridge technology. This allows for more high-performance cores (Intel hasn’t said how many just quite yet), with the focus on ‘metrics that matter for its customer base, such as node performance and data center performance’. Intel is calling SPR the ‘Biggest Leap in DC Capabilities in a Decade’.

The headline benefits are easy to rattle off. PCIe 5.0 is an upgrade over the previous generation Ice Lake PCIe 4.0, and we move from six 64-bit memory controllers of DDR4 to eight 64-bit memory controllers of DDR5. But the bigger improvements are in the cores, the accelerators, and the packaging.

Golden Cove: A High-Performance Core with AMX and AIA

By using the same core design on its enterprise platform Sapphire Rapids and consumer platform Alder Lake, there are some of the same synergies we saw back in the early 2000s when Intel did the same thing. We covered Golden Cove in detail in our Alder Lake architecture deep dive, however here’s a quick recap:

The new core, according to Intel, will over a +19% IPC gain in single-thread workloads compared to Cypress Cove, which was Intel’s backport of Ice Lake. This comes down to some big core changes, including:

  • 16B → 32B length decode
  • 4-wide → 6-wide decode
  • 5K → 12K branch targets
  • 2.25K → 4K μop cache
  • 5 → 6 wide allocation
  • 10 → 12 execution ports
  • 352 → 512-entry reorder buffer

The goal of any core is to process more things faster, and the newest generation tries to do it better than before. A lot of Intel’s changes make sense, and those wanting the deeper details are encouraged to read our deep dive.

There are some major differences between the consumer version of this core in Alder Lake and the server version in Sapphire Rapids. The most obvious one is that the consumer version does not have AVX-512, whereas SPR will have it enabled. SPR also has a 2 MB private L2 cache per core, whereas the consumer model has 1.25 MB. Beyond this, we’re talking about Advanced Matrix Extensions (AMX) and a new Accelerator Interface Architecture (AIA).

So far in Intel’s CPU cores we have scalar operation (normal) and vector operation (AVX, AVX2, AVX-512). The next stage up from that is a dedicated matrix solver, or something akin to a tensor core in a GPU. This is what AMX does, by adding a new expandable register file with dedicated AMX instructions in the form of TMUL instructions.

AMX uses eight 1024-bit registers for basic data operators, and through memory references, the TMUL instructions will operate on tiles of data using those tile registers. The TMUL is supported through a dedicated Engine Coprocessor built into the core (of which each core has one), and the basis behind AMX is that TMUL is only one such co-processor. Intel has designed AMX to be wider-ranging than simply this – in the event that Intel goes deeper with its silicon multi-die strategy, at some point we could see custom accelerators being enabled through AMX.

Intel confirmed that we shouldn’t see any frequency dips worse than AVX – there are new fine-grained power controllers per core for when vector and matrix instructions are invoked.

This feeds quite nicely into discussing AIA, the new accelerator interface. Typically when using add-in accelerator cards, commands must navigate between kernel and user space, set up memory, and direct any virtualization between multiple hosts. The way Intel is describing its new Acceleration Engine interface is akin to talking to a PCIe device as if it were simply an accelerator on board to the CPU, even though it’s attached through PCIe.

Initially, Intel will have two capable AIA bits of hardware.

Intel Quick Assist Technology (QAT) is one we’ve seen before, as it showcased inside special variants of Skylake Xeon’s chipset (that required a PCIe 3.0 x16 link) as well as an add-in PCIe card – this version will support up to 400 Gb/s symmetric cryptography, or up to 160 Gb/s compression plus 160 Gb/s decompression simultaneously, double the previous version.

The other is Intel’s Data Streaming Accelerator (DSA). Intel has haddocumentation about DSAon the web since 2019, stating that it is a high-performance data copy and transformation accelerator for streaming data from storage and memory or to other parts of the system through a DMA remapping hardware unit/IOMMU. DSA has been a request from specific hyperscaler customers, who are looking to deploy it within their own internal cloud infrastructure, and Intel is keen to point out that some customers will use DSA, some will use Intel’s new Infrastructure Processing Unit, while some will use both, depending on what level of integration or abstraction they are interested in. Intel told us that DSA is an upgrade over the Crystal Beach DMA engine which was present on the Purley (SKL+CLX) platforms.

On top of all this, Sapphire Rapids also supports AVX512_FP16 instructions for half-precision, mostly for AI workloads as part of its DLBoost strategy (Intel was quite quiet on DLBoost during the event). These FP16 commands can also be used as part of AMX, alongside INT8 and BF16 support. Intel now also supports CLDEMOTE for cache-line management.

A Side Word about CXL

Throughout the presentations of Sapphire Rapids, Intel has been keen to highlight it will support CXL 1.1 at launch. CXL is a connectivity standard designed to handle much more than what PCIe does – aside from simply acting as a data transfer from host to device, CXL has three branches to support, known as IO, Cache, and Memory. As defined in the CXL 1.0 and 1.1 standards, these three form the basis of a new way to connect a host with a device.

Naturally it was our expectation that all CXL 1.1 devices would support all three of these standards. It wasn’t until Hot Chips, several days later, that we learned Sapphire Rapids is only supporting part of the CXL standard, specifically CXL.io and CXL.cache, but CXL.memory would not be part of SPR. We're not sure to what extent this means SPR isn't CXL 1.1 compliant, or what it means for CXL 1.1 devices - without CXL.mem, as per the diagram above, all Intel loses is Type-2 support. Perhaps this is more of an indication that the market around CXL is better served by CXL 2.0, which will no doubt come in a later product.

In the next page, we look at Intel's new tiled architecture for Sapphire Rapids.

Intel Xeon Sapphire Rapids: How To Go Monolithic with Tiles (2024)

FAQs

What is the next generation Intel Xeon CPU Sapphire Rapids? ›

Sapphire Rapids (SPR) is the next-generation Xeon® Processor with increased core count, greater than 100MB shared L3 cache, 8 DDR5 channels, 32GT/s PCIe/CXL lanes, 16GT/s UPI lanes and integrated accelerators supporting cryptography, compression and data streaming. The processor is made up of 4 die (Fig. 2.2.

How big are the Intel Sapphire Rapids tiles? ›

Multi-chiplet chip with four tiles linked by 2.5D Embedded Multi-die Interconnect Bridges. Each tile is a 400mm2 system on a chip, providing both compute cores and I/O.

How many transistors does Intel Xeon Sapphire Rapids have? ›

Previously known by the code name Ponte Vecchio, these GPUs pack over 100 billion transistors, and can support up to 128GB of HBM2e memory, delivering up to 52 teraflops peak FP64 performance.

Is Intel Sapphire Rapids 7nm? ›

More About Intel's New 7nm Chip

This chip will be the successor of the previous Sapphire Rapids 10nm chip, offering Optane DC DIMMs and greater overall performance.

Is a Intel Xeon better then a i7? ›

Longevity (under heavy load) – Xeon processors are qualified to handle heavier, more intensive loads day in and day out. For the serious workstation user, this can translate to better longevity over i7 counterparts.

Did Intel discontinue Xeon? ›

In April 2024, Intel announced at its Vision event that the Xeon Scalable brand would be retired, beginning with 6th generation Xeon processors codenamed Sierra Forest and Granite Rapids that will now be referred to as "Xeon 6" processors. This change brings greater emphasis on processor generation numbers.

Is Sapphire Rapids any good? ›

But 'Sapphire Rapids' does have some big plusses. In single threaded workflows it appears to have a lead over Threadripper Pro, which could make a real difference in some CAD/BIM applications. Better single threaded performance should also boost 3D frame rates in CPU-limited applications.

How many memory channels does Sapphire Rapids have? ›

It provides eight channels of DDR5 per CPU with a maximum memory speed of 4800 MHz. Compared to the 3rd generation, it results in up to 50% more aggregated bandwidth as the Ice Lake generation supports eight channels using DDR4 3200 MHz.

What is the frequency of Sapphire Rapids? ›

Sapphire Rapids idles at 800 MHz and exhibits very slow clock speed ramp behavior. It reaches an intermediate 2 GHz clock speed after 35 ms. Afterwards, it takes over 1.6 seconds before boosting to 3.1 GHz for about 8 ms, and then reaches its maximum boost clock of 3.8 GHz.

What is the fastest Xeon chip? ›

Intel's Biggest and Fastest Chip Ever

The Xeon W-3175X is a behemoth processor. Using Intel's biggest x86 Skylake silicon design, it has a full 28 cores and 56 threads. These cores are rated at a 3.1 GHz base frequency, with the chip having a peak turbo frequency of 4.5 GHz.

What is the difference between Sapphire Rapids and emerald rapids? ›

Emerald Rapids includes 5600MT/s of DDR5 memory bandwidth, an improvement from 4800MT/s in Sapphire Rapids. The chip will also make possible for the first time some real world implementations of the Compute Express Link 1.1 interface (CXL), Singhal said.

Is Intel Xeon old? ›

No, this is not true. Xeon processors are still alive. Intel just released their Emeral Rapids Xeon series a month ago.

What is the next generation of Sapphire Rapids? ›

The follow-up to Intel's 4th Gen Sapphire Rapids CPU lineup comes in the form of the 5th Gen Xeon family, codenamed Emerald Rapids. The Emerald Rapids-SP CPUs are already sampling and are on schedule to deliver in Q4 2023. These chips will offer higher-quality silicon with volume validation in progress.

What is after Intel Sapphire Rapids? ›

Emerald Rapids is launching less than a year after Sapphire Rapids. At the same time, Intel is moving to the 288 E-Core Sierra Forest in 2024 and Granite Rapids in 2024.

What is the latest Intel Xeon processor 2024? ›

Intel announced its newest data center processor family, the Xeon 6, at Computex 2024. The new processors come in either Performance-core (P-core) or Efficient-core (E-core) variants to serve applications requiring the highest levels of performance and those focused on hardware density and power conservation.

What comes after Intel Sapphire Rapids? ›

Emerald Rapids is launching less than a year after Sapphire Rapids. At the same time, Intel is moving to the 288 E-Core Sierra Forest in 2024 and Granite Rapids in 2024.

What is the latest generation of Intel Xeon processors? ›

5th Gen Intel® Xeon® processors are our most sustainability-enhancing data center processors ever.

What is the codename for Intel Xeon 4th generation? ›

4th Gen Intel Xeon Scalable Processors Codename Sapphire Rapids.

When did Xeon Skylake come out? ›

Skylake is Intel's codename for its sixth generation Core microprocessor family that was launched on August 5, 2015, succeeding the Broadwell microarchitecture.

References

Top Articles
Roblox chat bypasser| Choose your desired combination of Roblox chat characters from a huge range available here
Season of Discovery Phase 4 Tank Tier List
Omega Pizza-Roast Beef -Seafood Middleton Menu
Play FETCH GAMES for Free!
Best Pizza Novato
Le Blanc Los Cabos - Los Cabos – Le Blanc Spa Resort Adults-Only All Inclusive
CLI Book 3: Cisco Secure Firewall ASA VPN CLI Configuration Guide, 9.22 - General VPN Parameters [Cisco Secure Firewall ASA]
Amtrust Bank Cd Rates
Recent Obituaries Patriot Ledger
Puretalkusa.com/Amac
41 annonces BMW Z3 occasion - ParuVendu.fr
Devourer Of Gods Resprite
A.e.a.o.n.m.s
Dr. med. Uta Krieg-Oehme - Lesen Sie Erfahrungsberichte und vereinbaren Sie einen Termin
Webcentral Cuny
Foxy Brown 2025
Craigslist Appomattox Va
Sizewise Stat Login
Program Logistics and Property Manager - Baghdad, Iraq
Graphic Look Inside Jeffrey Dahmer
Maxpreps Field Hockey
Shreveport City Warrants Lookup
Keyn Car Shows
EVO Entertainment | Cinema. Bowling. Games.
Evil Dead Rise Showtimes Near Sierra Vista Cinemas 16
Plasma Donation Racine Wi
Broken Gphone X Tarkov
Utexas Baseball Schedule 2023
One Credit Songs On Touchtunes 2022
Craigslist Albany Ny Garage Sales
Craigslist Hamilton Al
Merge Dragons Totem Grid
Www Craigslist Com Brooklyn
Adam Bartley Net Worth
Trap Candy Strain Leafly
Wilson Tire And Auto Service Gambrills Photos
Vérificateur De Billet Loto-Québec
Brauche Hilfe bei AzBilliards - Billard-Aktuell.de
Thothd Download
Conan Exiles Tiger Cub Best Food
Hawkview Retreat Pa Cost
Kjccc Sports
Wolf Of Wallstreet 123 Movies
Lorton Transfer Station
Jimmy John's Near Me Open
St Anthony Hospital Crown Point Visiting Hours
Headlining Hip Hopper Crossword Clue
Craigslist Free Cats Near Me
303-615-0055
Grandma's Portuguese Sweet Bread Recipe Made from Scratch
Vcuapi
Costco Gas Price Fort Lauderdale
Latest Posts
Article information

Author: Kimberely Baumbach CPA

Last Updated:

Views: 6344

Rating: 4 / 5 (61 voted)

Reviews: 84% of readers found this page helpful

Author information

Name: Kimberely Baumbach CPA

Birthday: 1996-01-14

Address: 8381 Boyce Course, Imeldachester, ND 74681

Phone: +3571286597580

Job: Product Banking Analyst

Hobby: Cosplaying, Inline skating, Amateur radio, Baton twirling, Mountaineering, Flying, Archery

Introduction: My name is Kimberely Baumbach CPA, I am a gorgeous, bright, charming, encouraging, zealous, lively, good person who loves writing and wants to share my knowledge and understanding with you.