DeepSeek Archives - insideAI News

SambaNova Reports Fastest DeepSeek-R1 671B with High Efficiency

February 18, 2025 by staff

Palo Alto, CA – Generative AI company SambaNova announced last week that DeepSeek-R1 671B is running today on SambaNova Cloud at 198 tokens per second (t/s), “achieving speeds and efficiency that no other platform can match,” the company said. DeepSeek-R1 has reduced AI training costs by 10X, but its widespread adoption has been hindered by […]

Filed Under: Recent News, Secondary Feature Tagged With: AI, DeepSeek, generative AI, LLMs, SambaNova

Cerebras Reports Fastest DeepSeek R1 Distill Llama 70B Inference

February 3, 2025 by staff

Cerebras Systems today announced what it said is record-breaking performance for DeepSeek-R1-Distill-Llama-70B inference, achieving more than 1,500 tokens per second – 57 times faster than GPU-based solutions. Cerebras said this speed enables instant reasoning capabilities ….

Filed Under: Recent News Tagged With: AI, AI models, artificial intelligence, Cerebras, DeepSeek

News Bytes Podcast 20250203: DeepSeek Lessons, Intel Reroutes GPU Roadmap, LANL and OpenAI for National Security, Nuclear Reactors for Google Data Centers

February 3, 2025 by staff

Happy February to one and all! The HPC-AI world was upended last week by AI benchmark numbers from DeepSeek, as the dust settles we offer a brief commentary on what, at this stage, it may mean ….

Filed Under: Recent News Tagged With: DeepSeek, Google, Intel, Kairos Power, Lawrence Livermore National Laboratory, LLNL, Nvidia, OpenAI, rack scale

SambaNova Reports Fastest DeepSeek-R1 671B with High Efficiency

Cerebras Reports Fastest DeepSeek R1 Distill Llama 70B Inference

News Bytes Podcast 20250203: DeepSeek Lessons, Intel Reroutes GPU Roadmap, LANL and OpenAI for National Security, Nuclear Reactors for Google Data Centers

Sponsored Guest Articles

Generative AI’s Accuracy Depends on an Enterprise Storage-driven RAG Architecture

White Papers

Powering Innovation: IDC Spotlight: Private AI Infrastructure in the Enterprise

Featured RSS Feed

More News from insideHPC