AMD Announces New GPUs, Development Platform, Rack Scale Architecture

AMD issued a raft of news at their Advancing AI 2025 event this week, an update on the company’s response to NVIDIA’s 90-plus percent market share dominance in the GPU and AI markets. And the company offered a sneak peak at what to expect from their next generation of EPYC CPUs and Instinct GPUs. Here’s […]

AI Inference: NVIDIA Reports Blackwell Surpasses 1000 TPS/User Barrier with Llama 4 Maverick

NVIDIA said it has achieved a record for large language model inference, announcing that an NVIDIA DGX B200 node with eight Blackwell GPUs achieved more than 1,000 tokens ….

News Bytes 20250428: TSMC’s A14 Fab, Intel’s New CEO: ‘We Need to Change’

Good end-of-April to you! Here’s a quick (8:05) review of recent doings in the world of HPC-AI, including: the Salishan Conference takes on HPC-AI divergence and FugakuNext zettascale, TSMC’s A14 fab, Intel’s 18A fab and AmKor CoWoS packaging in Arizona, Intel’s earnings, new AI compute and Lip Bu Tan’s warning ….

NVIDIA to Manufacture AI Supercomputers in U.S.

NVIDIA today said it is working with manufacturing partners to design and build factories that will produce NVIDIA AI supercomputers — i.e., “AI factories” — entirely in the United States… NVIDIA said that within four years, it plans to produce up to half a trillion dollars worth of AI infrastructure in the U.S. through partnerships ….

DARPA Taps Cerebras and Ranovus for Military and Commercial Platform

AI compute company Cerebras Systems said it has been awarded a new contract from the Defense Advanced Research Projects Agency (DARPA) to develop a system combining their wafer scale technology with wafer scale co-packaged optics of Ottawa-based Ranovus to deliver ….

@HPCpodcast: Dr. Ian Cutress on the State of Advanced Chips, the GPU Landscape and AI Compute, Global Chip Manufacturing and GTC Expectations

…. we welcome special guest and high-powered chip industry analyst Dr. Ian Cutress, Chief Analyst at More Than Moore and host of the popular YouTube ….

Axelera AI Wins EuroHPC Grant of up to €61.6M for AI Chiplet Development

AI hardware maker Axelera AI has unveiled Titania, which the company described as a high-performance, low-power and scalable AI inference chiplet. Part of the EuroHPC Joint Undertaking’s effort to develop a ….

Fluidstack and Eclairion to Deliver 18K GPU Supercomputer in France

London-based AI cloud platform Fluidstack and Eclairion, a French maker of modular, high-density data centers, have partnered to build what the companies said is Europe’s largest GPU supercomputer that they will deliver in 2025 for Mistral AI ….

TSMC to Invest $100B in 3 New U.S. Fabs, Packaging, R&D

TSMC (TWSE: 2330, NYSE: TSM) today announced its intention to expand its investment in advanced semiconductor manufacturing in the United States by an additional $100 billion. Building on the company’s ongoing $65 billion investment in its advanced chip fabs in Phoenix, TSMC’s total investment in the U.S. is expected to reach US$165 billion. The expansion […]

d-Matrix Launches New Chiplet Connectivity Platform to Address Exploding Compute Demand for Generative AI

Today, d-Matrix, a leader in high-efficiency AI-compute and inference processors, announced Jayhawk, an Open Domain-Specific Architecture (ODSA) Bunch of Wires (BoW) based chiplet platform for energy efficient die-die connectivity over organic substrates. Building on the back of the Nighthawk chiplet platform launched in 2021, the 2nd generation Jayhawk silicon platform further builds the scale-out chiplet based inference compute platform. d-Matrix customers will be able to use the inference compute platforms to manage Generative AI applications and Large Language Model transformer applications with a 10-20X improvement in performance.