AI compute Archives - insideAI News

AMD Announces New GPUs, Development Platform, Rack Scale Architecture

June 14, 2025 by staff

AMD issued a raft of news at their Advancing AI 2025 event this week, an update on the company’s response to NVIDIA’s 90-plus percent market share dominance in the GPU and AI markets. And the company offered a sneak peak at what to expect from their next generation of EPYC CPUs and Instinct GPUs. Here’s […]

Filed Under: Enterprise AI, Recent News, Secondary Feature Tagged With: AI, AI compute, AI news, AMD, EPYC, gpus, MI350, Weekly Featured Newsletter Post

AI Inference: NVIDIA Reports Blackwell Surpasses 1000 TPS/User Barrier with Llama 4 Maverick

May 23, 2025 by staff

NVIDIA said it has achieved a record for large language model inference, announcing that an NVIDIA DGX B200 node with eight Blackwell GPUs achieved more than 1,000 tokens ….

Filed Under: Recent News, Secondary Feature Tagged With: AI, AI compute, AI inference, artificial intelligence, Llama, Llama 4, Llama models, Nvidia, weekly

News Bytes 20250428: TSMC’s A14 Fab, Intel’s New CEO: ‘We Need to Change’

April 29, 2025 by staff

Good end-of-April to you! Here’s a quick (8:05) review of recent doings in the world of HPC-AI, including: the Salishan Conference takes on HPC-AI divergence and FugakuNext zettascale, TSMC’s A14 fab, Intel’s 18A fab and AmKor CoWoS packaging in Arizona, Intel’s earnings, new AI compute and Lip Bu Tan’s warning ….

Filed Under: Recent News, Secondary Feature Tagged With: AI, AI compute, artificial intelligence, gpus, HPC-AI, Intel, TSMC, TSMC fabs

NVIDIA to Manufacture AI Supercomputers in U.S.

April 14, 2025 by staff

NVIDIA today said it is working with manufacturing partners to design and build factories that will produce NVIDIA AI supercomputers — i.e., “AI factories” — entirely in the United States… NVIDIA said that within four years, it plans to produce up to half a trillion dollars worth of AI infrastructure in the U.S. through partnerships ….

Filed Under: Recent News, Secondary Feature Tagged With: AI compute, AI data center, AI factory, AI supercomputer, Amkora, Foxconn, gpus, Nvidia, NVIDIA H100, Nvidia H20, SPIL, TSMC, Wistron

DARPA Taps Cerebras and Ranovus for Military and Commercial Platform

April 2, 2025 by staff

AI compute company Cerebras Systems said it has been awarded a new contract from the Defense Advanced Research Projects Agency (DARPA) to develop a system combining their wafer scale technology with wafer scale co-packaged optics of Ottawa-based Ranovus to deliver ….

Filed Under: Recent News, Secondary Feature Tagged With: AI, AI compute, Cerebras, DARPA, HPC-AI, optical interconnect, photonics, Ranovus, wafer scale engine

@HPCpodcast: Dr. Ian Cutress on the State of Advanced Chips, the GPU Landscape and AI Compute, Global Chip Manufacturing and GTC Expectations

March 13, 2025 by staff

…. we welcome special guest and high-powered chip industry analyst Dr. Ian Cutress, Chief Analyst at More Than Moore and host of the popular YouTube ….

Filed Under: Podcast, Recent News, Secondary Feature Tagged With: AI compute, AMD, CoolIT, gpus, GTC2025, Ian Cutress, Intel, Nvidia, TSMC

Axelera AI Wins EuroHPC Grant of up to €61.6M for AI Chiplet Development

March 6, 2025 by staff

AI hardware maker Axelera AI has unveiled Titania, which the company described as a high-performance, low-power and scalable AI inference chiplet. Part of the EuroHPC Joint Undertaking’s effort to develop a ….

Filed Under: Recent News, Secondary Feature Tagged With: AI chips, AI compute, Axelera AI, EuroHPC JU

Fluidstack and Eclairion to Deliver 18K GPU Supercomputer in France

March 5, 2025 by staff

London-based AI cloud platform Fluidstack and Eclairion, a French maker of modular, high-density data centers, have partnered to build what the companies said is Europe’s largest GPU supercomputer that they will deliver in 2025 for Mistral AI ….

Filed Under: Recent News, Secondary Feature Tagged With: AI cluster, AI compute, AI supercomputer, Eclairion, Fluidstack, gpus, Mistral AI

TSMC to Invest $100B in 3 New U.S. Fabs, Packaging, R&D

March 4, 2025 by staff

TSMC (TWSE: 2330, NYSE: TSM) today announced its intention to expand its investment in advanced semiconductor manufacturing in the United States by an additional $100 billion. Building on the company’s ongoing $65 billion investment in its advanced chip fabs in Phoenix, TSMC’s total investment in the U.S. is expected to reach US$165 billion. The expansion […]

Filed Under: Main Feature, Recent News Tagged With: AI compute, AMD, chip fabs, gpus, Intel, Nvidia, TSMC

d-Matrix Launches New Chiplet Connectivity Platform to Address Exploding Compute Demand for Generative AI

January 24, 2023 by Editorial Team

Today, d-Matrix, a leader in high-efficiency AI-compute and inference processors, announced Jayhawk, an Open Domain-Specific Architecture (ODSA) Bunch of Wires (BoW) based chiplet platform for energy efficient die-die connectivity over organic substrates. Building on the back of the Nighthawk chiplet platform launched in 2021, the 2nd generation Jayhawk silicon platform further builds the scale-out chiplet based inference compute platform. d-Matrix customers will be able to use the inference compute platforms to manage Generative AI applications and Large Language Model transformer applications with a 10-20X improvement in performance.

Filed Under: Data Science, Enterprise AI, Google News Feed, Machine Learning, News / Analysis, Uncategorized Tagged With: AI compute, generative AI, Weekly Newsletter Articles

AMD Announces New GPUs, Development Platform, Rack Scale Architecture

AI Inference: NVIDIA Reports Blackwell Surpasses 1000 TPS/User Barrier with Llama 4 Maverick

News Bytes 20250428: TSMC’s A14 Fab, Intel’s New CEO: ‘We Need to Change’

NVIDIA to Manufacture AI Supercomputers in U.S.

DARPA Taps Cerebras and Ranovus for Military and Commercial Platform

@HPCpodcast: Dr. Ian Cutress on the State of Advanced Chips, the GPU Landscape and AI Compute, Global Chip Manufacturing and GTC Expectations

Axelera AI Wins EuroHPC Grant of up to €61.6M for AI Chiplet Development

Fluidstack and Eclairion to Deliver 18K GPU Supercomputer in France

TSMC to Invest $100B in 3 New U.S. Fabs, Packaging, R&D

d-Matrix Launches New Chiplet Connectivity Platform to Address Exploding Compute Demand for Generative AI

Sponsored Guest Articles

Generative AI’s Accuracy Depends on an Enterprise Storage-driven RAG Architecture

White Papers

From Legacy to Leading Edge: How Mainframe Data Can Transform AI and Analytics

Featured RSS Feed

More News from insideHPC