The UALink Consortium today announced the ratification of the UALink 200G 1.0 Specification, which defines a low-latency, high-bandwidth interconnect for communication between accelerators and switches in AI computing pods. The UALink 1.0 Specification enables 200G per lane scale-up connection for ….
Multiverse Says It Compresses Llama Models by 80%
Multiverse Computing today released two new AI models compressed by CompactifAI, Multiverse’s AI compressor: 80 percent compressed versions of Llama 3.1-8B and Llama 3.3-70B. Both models have 60 percent fewer parameters than the original models, 84 percent greater energy effi ciency ….
MLCommons Releases MLPerf Inference v5.0 Benchmark Results
Today, MLCommons announced new results for its MLPerf Inference v5.0 benchmark suite, which delivers machine learning (ML) system performance benchmarking. The rorganization said the esults highlight that the AI community is focusing on generative AI ….
DARPA Taps Cerebras and Ranovus for Military and Commercial Platform
AI compute company Cerebras Systems said it has been awarded a new contract from the Defense Advanced Research Projects Agency (DARPA) to develop a system combining their wafer scale technology with wafer scale co-packaged optics of Ottawa-based Ranovus to deliver ….
News Bytes 20250401: Oracle’s Big AMD GPU Buy, ASML in China, Quantum in the News, the Award Winning Hoefler
Happy April Fool’s Day to you! Here’s a quick (8:42) review of recent news from the world of HPC-AI, including: AMD MI355X to debut with a 30,000-GPU Oracle cluster, in the face of U.S. trade policy, ASML to open repair hub in China, EuroHPC-JU’s ….
AI Factory: AMD in $4.9 Billion Acquisition of ZT Systems
On the heels of winning a 30,000 GPU cluster deal with Oracle, AMD (NASDAQ: AMD) today announced a move into the AI infrastructure arena, reflecting a trend toward integration — or verticalization — of chip makers and the AI factory, defined by at least one company (data center company Vertiv) the AI “unit of compute.” […]
Fluidstack to Deploy Exascale GPU Clusters in Europe with NVIDIA, Borealis Data Center and Dell
Fluidstack, an AI cloud platform, announced it is deploying and managing exascale clusters across Iceland and Europe in collaboration with Borealis Data Center, Dell Technologies and NVIDIA. “Our mission has ….
Report: Oracle to Deploy AI Cluster with 30,000 AMD MI355X Accelerators
Nvidia, hot off the festivities of its big GTC conference last week in San Jose, dominates the GPU market with a roughly 95 percent share. But the question remains whether and how other chip suppliers will make inroads ….
Survey: Big AI Investments at Odds with Lack of Testing in Generative AI Development
A new vendor survey reports a significant disconnect between major investments in generative AI and the adoption of quality assurance practices within the software development lifecycle (SDLC). The “State of Digital ….
Torsten Hoefler Wins ACM Prize in Computing for Contributions to AI and HPC
ACM, the Association for Computing Machinery, today named Torsten Hoefler, a professor at ETH Zurich, the recipient of the ACM Prize in Computing for fundamental contributions to high-performance computing and the ongoing AI revolution. Hoefler, who was a guest on ….