share article

Tachyum unveils 2nm Prodigy processor with breakthrough performance

News

US-based technology company Tachyum has developed the Universal Processor, the 2nm Prodigy Ultimate, that combines the functions of a CPU, GPGPU and TPU into a single homogeneous processor architecture that 25.8x higher AI rack performance and 10x lower power than competing products. The processor delivers breakthrough performance and efficiency for a wide range of applications, including Hyperscale, HPC and AI data centres.

The 2nm Prodigy is the first ever chip to exceed 1,000 PFLOPs on inference, compared to the current performance of a competitor that delivers 50 PFLOPs.

“With tape-out funding now secured after a long wait, the world’s first Universal Processor can proceed to production, designed to overcome the inherent limitations of today’s data centres,” said Dr. Radoslav Danilak, founder and CEO of Tachyum. “The distinct markets addressed by Prodigy are the AI, server, and HPC markets, requiring fast and efficient chips. Tachyum’s Prodigy Premium and Ultimate will supercharge workloads with superior performance at a lower cost than any other solution on the market.”

The global competition in AI continues to accelerate, with China and the United States leading the race. Current AI models demonstrate massive computational scales — for instance, ChatGPT 4 features approximately 1.8 trillion parameters, while human brains contain an estimated 150 trillion synapses. Emerging systems such as BaGauLu reach 174 trillion parameters, but the ultimate breakthrough is expected to come from models trained on the collective knowledge of humanity, exceeding 100 000 000 trillion (1020) parameters. Traditional large-scale AI solutions could cost over $8 trillion and require more than 276 gigawatts of power. In contrast, the Tachyum solution is projected to achieve comparable capabilities at an estimated cost of $78 billion and a power requirement of just 1 gigawatt — making it accessible to multiple companies and nations.

In addition to open-sourcing all software, Tachyum is making its memory technology available, using standard components, allowing 10x increase of DIMM-based memory bandwidth available for licensing by memory or processor companies, including JEDEC adoption to achieve high adoption and low cost. In 2023, Tachyum announced licensable Tachyum AI (TAI) data types, and its Tachyum Processing Unit (TPU) core is available for licensing. Tachyum is in the process of making the Instruction Set Architecture (ISA) open.

Tachyum has continually upgraded its Prodigy design to address ever-changing requirements in server, AI and HPC markets with up to 5x integer performance, up to 16x higher AI performance, 8x DRAM bandwidth, 4x chip-to-chip and I/O bandwidth, 4x scalability by supporting 16 sockets, and 2x power efficiency, with lower cost per core.

The Prodigy chip was upgraded to 2nm to significantly reduce power consumption. Reducing chiplet die size improves cost despite expensive 2nm wafers. Each chiplet in the Prodigy package integrates 256 high-performance custom 64-bit cores. The power consumption reduction is critical, as multiple chiplets occupy a single package. Backed by a recent $220 million investment, the 2nm Prodigy is being readied for tape-out.

Multiple Prodigy SKUs cover a wide range of performance and applications, including big AI, exascale supercomputing, HPC, digital currency, cloud/hyperscale, big data analytics, and databases. Prodigy Ultimate integrates 1,024 high-performance cores, 24 DDR5 17.6GT/s memory controllers and 128 PCIE 7.0 lanes. The Prodigy Premium comes with 16 DRAM channels, and 512 to 128 cores scalable to 16 socket systems. Entry-level Prodigy comes with 8 or 4 DRAM controllers and 128 to 32 cores.

Prodigy features, scaleability, and price segmentation ensure rapid market penetration. Tachyum provides out-of-the-box native system software, operating systems, compilers, libraries, many applications, and AI infrastructure frameworks. It also allows running unmodified Intel/AMD x86 binaries and mixing them with native applications. This ensures that Tachyum systems can be operational by customers from day one.

The Prodigy Universal Processor delivers orders of magnitude higher AI performance, 3x the performance of the best x86 processors, and 6x HPC performance of the fastest GPGPU. Eliminating the need for expensive dedicated AI hardware and dramatically increasing server utilization, Prodigy reduces data center CAPEX and OPEX significantly while delivering unprecedented performance, power, and economics.

https://x.com/tachyum

Share this article

Related Posts

View Latest Magazine

Subscribe today

Member Login