Milvus 2.3 Massively Accelerates AI-Powered Applications With NVIDIA GPU Support 

Zilliz, the inventor of the open-source vector database Milvus, announced the launch of Milvus 2.3, featuring NVIDIA GPU support for greater flexibility and dramatic improvements in real-time workload performance. 

NVIDIA Hopper GPUs Expand Reach as Demand for AI Grows

NVIDIA and key partners today announced the availability of new products and services featuring the NVIDIA H100 Tensor Core GPU — the powerful GPU for AI — to address rapidly growing demand for generative AI training and inference. 

Deci delivers breakthrough inference performance on Intel’s 4th Gen Sapphire Rapids CPU

Deci, the deep learning company building the next generation of AI, announced a breakthrough performance on Intel’s newly released 4th Gen Intel® Xeon® Scalable processors, code-named Sapphire Rapids. By optimizing the AI models which run on Intel’s new hardware, Deci enables AI developers to achieve GPU-like inference performance on CPUs in production for both Computer Vision and Natural Language Processing (NLP) tasks.

CoreWeave Among First Cloud Providers to Offer NVIDIA HGX H100 Supercomputers Set to Transform AI Landscape

CoreWeave, a specialized cloud provider built for large-scale GPU-accelerated workloads, announced it is among the first to offer cloud instances with NVIDIA HGX H100 supercomputing. CoreWeave, Amazon, Google, Microsoft and Oracle are the first cloud providers included in the launch of this groundbreaking AI platform.

Oracle and NVIDIA Partner to Speed AI Adoption for Enterprises

Expanding their longstanding alliance, Oracle and NVIDIA today announced a multi-year partnership to help customers solve business challenges with accelerated computing and AI. The collaboration aims to bring the full NVIDIA accelerated computing stack — from GPUs to systems to software — to Oracle Cloud Infrastructure (OCI).

NVIDIA Hopper in Full Production

NVIDIA today announced that the NVIDIA H100 Tensor Core GPU is in full production, with global tech partners planning in October to roll out the first wave of products and services based on the groundbreaking NVIDIA Hopper™ architecture.

Exxact Partners with Run:ai to Offer Maximal Utilization in GPU Clusters for AI Workloads

Exxact Corporation; a leading provider of high-performance computing (HPC), artificial intelligence (AI), and data center solutions; now offers Run:ai in their solutions. This groundbreaking Kubernetes-based orchestration tool incorporates an AI-dedicated, high-performant super-scheduler tailored for managing GPU resources in AI clusters.

AMAX Launches GPU Servers Powered by Intel’s Newest Data Center GPU Flex Series for AI, Gaming, & Media Streaming

AMAX, a leading provider of turnkey rack-scale High Performance Computing (HPC) solutions, Deep Learning/AI applications and server appliance manufacturing, announces the new AceleMax X-122-Flex server solution featuring Intel’s next-generation Data Center GPU Flex Series, (formerly code-named Arctic Sound-M), providing the capability of a graphics processing (GPU) solution handling high density and complex workloads targeted towards media delivery, cloud gaming, AI, metaverse, and other emerging visual cloud use cases.

Introducing Vultr Talon with NVIDIA GPUs — Cloud Platform Breakthrough Makes Accelerated Computing Efficient and Affordable

Vultr®, a leading independent provider of cloud infrastructure, announced that Vultr Talon, powered by NVIDIA GPUs and NVIDIA AI Enterprise software, is now available in beta. A breakthrough cloud-based platform, Vultr Talon offers affordable accelerated computing by enabling GPU sharing, so multiple workloads can efficiently run on a single NVIDIA GPU.

The Looming Datacenter Paradigm Shift

In this contributed article, Jonathan Friedmann, CEO & Co-Founder of Speedata, discusses a paradigm shift that is already underway. CPU’s leading role in datacenters is no longer a given, and as database workloads grow increasingly varied, market forces will point more and more in a clear direction: the development of new, dedicated, accelerated solutions. Data workloads are only going to grow more diverse and intricate from here. The same holds true for the chips that will process them.