NVIDIA said it has achieved a record large language model (LLM) inference speed, announcing that an NVIDIA DGX B200 node with eight NVIDIA Blackwell GPUs achieved more than 1,000 tokens per second (TPS) per user on the 400-billion-parameter Llama 4 Maverick model. NVIDIA said the model is the largest and most powerful in the Llama 4 […]
AI Inference: NVIDIA Reports Blackwell Surpasses 1000 TPS/User Barrier with Llama 4 Maverick
Cloudera Releases AI-Powered Unified Data Visualization for On-Prem Environments
Santa Clara, California – May 20, 2025: Hybrid data platform company Cloudera announced the latest release of Cloudera Data Visualization that extends its AI capabilities for on-premises environments. The offering is an AI tool designed to democratize insights across the data lifecycle, enabling data engineers, business analysts, and data scientists to seamlessly communicate, collaborate, and […]
Report: $15B OpenAI Data Center in Texas Will House up to 400,000 Blackwells
Amid reports of hundreds of billions in AI data center investments in the Middle East, development of AI factories in the U.S. continues apace. The Wall Street Journal reported today that OpenAI has secured ….
NVIDIA Announces DGX Cloud Lepton for GPU Access across Multi-Cloud Platforms
NVIDIA today announced at the Computex confence in Taiwan NVIDIA DGX Cloud Lepton — an AI platform with a compute marketplace that connects developers building agentic and physical AI applications ….
DDN Teams With NVIDIA on AI Data Platform Reference Design
AI and data intelligence vendor DDN today announced the NVIDIA AI Data Platform reference design, built in collaboration with NVIDIA to support organizations’ generative AI strategies by simplifying how unstructured data is stored, accessed ….
Duos Edge AI Confirms EDC Deployment Goal in 2025
JACKSONVILLE, Fla., May 15, 2025 — Duos Technologies Group, Inc. (Nasdaq: DUOT), through its operating subsidiary Duos Edge AI, Inc., a provider of adaptive Edge Data Center (EDC) solutions, today announced it is on pace to have 15 Edge Data Centers under contract by the end of 2025. The additional deployments are contributing to addressing the […]
Cognichip out of Stealth with $33M in Funding for Artificial Chip Intelligence
SAN FRANCISCO — May 15th, 2025 – Cognichip, the Artificial Chip Intelligence (ACI) company for semiconductor design, today announced its launch out of stealth with $33 million in seed funding. Led by Lux Capital and Mayfield, with support from FPV and Candou Ventures, Cognichip said it is the first physics-informed foundational model (PIFM) for semiconductors, designed […]
Openlayer Raises $14.5 Million Series A
San Francisco – May 14, 2025 – Today, Openlayer, a platform for evaluation and governance of AI systems at the enterprise level, announced a $14.5 million Series A round led by Race Capital with participation from NXTP, KPN Ventures, Mindset, Y Combinator, Quiet Capital, and Telefonica. The company will focus the new investment to expand […]
Saudi Arabia Unveils AI Deals with NVIDIA, AMD, Cisco, AWS
Saudi Arabia announced AI development deals with four leading U.S. technology firms, chip companies NVIDIA and AMD, networking company Cisco and cloud platform Amazon Web Services. The partnerships were formed through the kingdom’s ….
IBM and Oracle Expand Agentic AI and Hybrid Cloud Partnership
IBM is working with Oracle to bring watsonx, IBM’s portfolio of AI products, to Oracle Cloud Infrastructure. Leveraging OCI’s native AI services is designed “to fuel a new era of multi-agentic, AI-driven productivity and efficiency across the enterprise,” the companies said. “AI delivers the most impactful value …