Google today announced GA on the Google Cloud Platform of three products built on custom silicon built for inference and agentic workloads: – Ironwood, Google’s seventh generation Tensor Processing Unit, will be generally available in the coming weeks. The company said it is built for large-scale model training and complex reinforcement learning, as well as […]
AI Inference: NVIDIA Reports Blackwell Surpasses 1000 TPS/User Barrier with Llama 4 Maverick
NVIDIA said it has achieved a record for large language model inference, announcing that an NVIDIA DGX B200 node with eight Blackwell GPUs achieved more than 1,000 tokens ….
MLCommons Releases MLPerf Inference v5.0 Benchmark Results
Today, MLCommons announced new results for its MLPerf Inference v5.0 benchmark suite, which delivers machine learning (ML) system performance benchmarking. The rorganization said the esults highlight that the AI community is focusing on generative AI ….
Blaize Received Approval to List its Common Stock and Warrants on Nasdaq
WASHINGTON & EL DORADO HILLS, Calif., Jan 13, 2024 – Blaize, Inc., a provider of artificial intelligence-enabled edge computing solutions, and acquisition company BurTech today announced that they expect to complete their previously announced business combination on January 12, 2025. The combined company will be named “Blaize Holdings, Inc.” and its common stock and warrants […]








