NVIDIA today announced that the NVIDIA H100 Tensor Core GPU is in full production, with global tech partners planning in October to roll out the first wave of products and services based on the groundbreaking NVIDIA Hopper™ architecture.
Exxact Partners with Run:ai to Offer Maximal Utilization in GPU Clusters for AI Workloads
Exxact Corporation; a leading provider of high-performance computing (HPC), artificial intelligence (AI), and data center solutions; now offers Run:ai in their solutions. This groundbreaking Kubernetes-based orchestration tool incorporates an AI-dedicated, high-performant super-scheduler tailored for managing GPU resources in AI clusters.
AMAX Launches GPU Servers Powered by Intel’s Newest Data Center GPU Flex Series for AI, Gaming, & Media Streaming
AMAX, a leading provider of turnkey rack-scale High Performance Computing (HPC) solutions, Deep Learning/AI applications and server appliance manufacturing, announces the new AceleMax X-122-Flex server solution featuring Intel’s next-generation Data Center GPU Flex Series, (formerly code-named Arctic Sound-M), providing the capability of a graphics processing (GPU) solution handling high density and complex workloads targeted towards media delivery, cloud gaming, AI, metaverse, and other emerging visual cloud use cases.
Introducing Vultr Talon with NVIDIA GPUs — Cloud Platform Breakthrough Makes Accelerated Computing Efficient and Affordable
Vultr®, a leading independent provider of cloud infrastructure, announced that Vultr Talon, powered by NVIDIA GPUs and NVIDIA AI Enterprise software, is now available in beta. A breakthrough cloud-based platform, Vultr Talon offers affordable accelerated computing by enabling GPU sharing, so multiple workloads can efficiently run on a single NVIDIA GPU.
The Looming Datacenter Paradigm Shift
In this contributed article, Jonathan Friedmann, CEO & Co-Founder of Speedata, discusses a paradigm shift that is already underway. CPU’s leading role in datacenters is no longer a given, and as database workloads grow increasingly varied, market forces will point more and more in a clear direction: the development of new, dedicated, accelerated solutions. Data workloads are only going to grow more diverse and intricate from here. The same holds true for the chips that will process them.
AWS Announces General Availability of Amazon EC2 DL1 Instances
Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), announced general availability of Amazon Elastic Compute Cloud (Amazon EC2) DL1 instances, a new instance type designed for training machine learning models. DL1 instances are powered by Gaudi accelerators from Habana Labs (an Intel company) to provide up to 40% better price performance for training machine learning models than the latest GPU-powered Amazon EC2 instances.
Building the GPU-accelerated Datacenter
[Sponsored Post] To learn about GPU-accelerated datacenters, join the upcoming live webinar from Thinkmate and PNY. You’ll hear about the future of the datacenter, why the GPU is crucial, the technology behind GPU acceleration, and what sort of options exist for different industries or types of organizations.
Yandex Finds Better Way to Train ML Models Over the Internet
A new proposal from tech giant Yandex overcomes a major hurdle in the advancement of machine learning by bringing the process to the masses, so that anyone with a home computer can help train a large neural network.
TensorRT 8 Provides Leading Enterprises Fast AI Inference Performance
NVIDIA today launched TensorRT™ 8, the eighth generation of the company’s AI software, which slashes inference time in half for language queries — enabling developers to build the world’s best-performing search engines, ad recommendations and chatbots and offer them from the cloud to the edge.
Walmart Innovates at the Retail Edge with AI
Through the combination of world-class software from Malong Technologies and a powerful processing platform Dell Technologies and NVIDIA, Walmart is able to deliver groundbreaking AI acceleration at the store edge.