Sponsored Guest Article

  • Generative AI’s Accuracy Depends on an Enterprise Storage-driven RAG Architecture

    [SPONSORED POST] In this sponsored article, Eric Herzog, CMO of Infinidat, suggests that as part of a transformative effort to bring one’s company into the AI-enhanced future, it’s an opportunity to leverage intelligent automation with RAG to create better, more accurate and timely responses. Further, to optimize your storage systems for this enhancement, look for industry-leading performance, 100% availability and cyber storage resilience. They make you RAG-ready. 

White Papers

Powering Innovation: IDC Spotlight: Private AI Infrastructure in the Enterprise

AI is rapidly transforming industries, becoming a critical driver of innovation. As AI’s influence expands, organizations are increasingly turning to private AI solutions to maintain control over their data, ensure regulatory compliance, and customize AI models to meet specific needs. According to IDC’s Spotlight report Powering Innovation: Private AI Infrastructure in the Enterprise, experienced organizations […]

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

In today’s fast-paced world, driven by demands for speed and efficiency, the field of clinical development has undergone a remarkable transformation. The way trials are being conducted has changed significantly with decentralized clinical trials (DCT) becoming mainstream and the collection of clinical data from wearables and other remote-monitoring devices becoming common practice. While these advances […]

Find out what drives today’s most successful platform businesses

In ‘The enterprise guide to platform thinking: What it can do for your business’, we explore how to achieve platform success and realize the full spectrum of business, operational, and customer value platforms can deliver. Inside, we dive into: — How platform thinking has evolved over the last five years, and the role that cultural, […]

Editor’s Choice

  • @HPCpodcast Industry View: A Deep Dive into High-Density Data Center Cooling and Efficiency Strategies with DDC Solutions

    https://orionx.net/wp-content/uploads/2025/01/096@HPCpodcast_IV_DDC_Chris-Orlando_Datacenter-Air-Liquid-Cooling_20250117.mp3 In this “Industry View” episode of the @HPCpodcast, Chris Orlando of DDC Solutions discusses the rapidly changing landscape of high density data center cooling, monitoring, safety and compliance, hybrid liquid-air solutions and other strategies for data center power efficiency. DDC’s cabinet technology, DCIM real-time monitoring and dynamic management software are designed to provide a single cabinet solution that exceeds AI, HPC, GPU and other compute-intensive demands.   Chris, Founder and Chief Strategy Officer at DDC, shares his extensive knowledge of HPC-AI data center cooling, which has rapidly emerged as a critical industry challenge that is growing more acute as HPC and AI combine to generate greater power demands. This episode is part of the @HPCpodcast’s Industry View feature, which takes on major issues in the world of AI, HPC and other advanced technologies through the lens of industry leaders. You can find our podcasts at insideHPC’s @HPCpodcast page, on Twitter, on the OrionX.net blog page, on iTunes and Google. Here’s the OrionX.net podcast page, and the RSS feed. We’re also available on Spotify and iTunes.

  • Nvidia at CES: Omniverse Blueprint for Industry, Generative Physical AI, Access to Blackwells, Cosmos Model for Physical AI

    Nvidia issued its anticipated raft of news at CES this week, here’s an overview of announcements for the HPC-AI sector: ‘Mega’ Omniverse Blueprint for Industrial Robot Fleet Digital Twins The company said Mega is an omniverse framework for next-gen industrial AI and robot simulation through software-defined testing and optimization of factories and warehouses. Citing facts and figures – there are 10 million factories, nearly 200,000 warehouses and 40 million miles of highways – the company said this vast industrial network of production facilities and distribution centers is still laboriously and manually designed, operated and optimized. Mega is a blueprint designed for developing, testing and optimizing physical AI and robot fleets at scale in a digital twin before deployment into real-world facilities. This is for advanced warehouses and factories that use fleets of autonomous mobile robots, robotic arm manipulators and humanoids working alongside people. Nvidia said Mega offers a reference architecture of accelerated computing, AI, Nvidia Isaac and Nvidia Omniverse technologies for developing and testing digital twins for AI-powered robot brains that drive robots, video analytics AI agents, equipment and more for handling enormous complexity and scale. The new framework brings software-defined capabilities to physical facilities, enabling continuous development, testing, optimization and deployment. Nvidia Omniverse with Generative Physical AI Nvidia announced generative AI models and blueprints that expand its Omniverse integration into physical AI applications, such as robotics, autonomous vehicles and vision AI. The company said Accenture, Altair, Ansys, Cadence, Foretellix, Microsoft and Neural Concept are among the first to integrate Omniverse into their software products and services. Industrial automation company Siemens also announced the availability of Teamcenter Digital Reality Viewer — the first Siemens Xcelerator application powered by NVIDIA Omniverse libraries. “Physical AI will revolutionize the $50 trillion manufacturing and logistics industries. Everything that moves — from cars and trucks to factories and warehouses — will be robotic and embodied by AI,” said Jensen Huang, founder and CEO at NVIDIA. “NVIDIA’s Omniverse digital twin operating system and Cosmos physical AI serve as the foundational libraries for digitalizing the world’s physical industries.” Nvidia said the USD Code and USD Search NVIDIA NIM microservices are now generally available, they are designed to let developers use text prompts to generate or search for OpenUSD assets. A new NVIDIA Edify SimReady generative AI model unveiled today can automatically label existing 3D assets with attributes like physics or materials, enabling developers to process 1,000 3D objects in minutes instead of over 40 hours manually, according to the company. Project DIGITS With Grace Blackwell 10 Superchip Debuts as AI Supercomputer Nvidia announced Project DIGITS, which the company called a personal AI supercomputer designed to provide AI researchers, data scientists and students access to Grace Blackwell platform, announced last March at the company’s GTC conference. Project DIGITS offers a petaflop of GB10 Superchip computing performance for prototyping, fine-tuning and running large AI models. The company said users can develop and run inference on models using their desktop system, then deploy the models on accelerated cloud or data center infrastructure. The company said Project DIGITS delivers GB10 performance using only a standard electrical outlet. Each Project DIGITS features 128GB of unified, coherent memory and up to 4TB of NVMe storage. With the supercomputer, developers can run up to 200-billion-parameter large language models. In addition, using Nvidia ConnectX networking, two Project DIGITS AI supercomputers can be linked to run up to 405-billion-parameter models. Cosmos World Foundation Model Platform for Physical AI Development Nvidia announced Cosmos, a platform comprised of generative foundation models, tokenizers, guardrails and an accelerated video processing pipeline built for development of physical AI systems such as autonomous vehicles (AVs) and robots. Cosmos models will be available under an open model license to help accelerate the work of the robotics and AV community. Developers can preview the first models on the NVIDIA API catalog, or download the family of models and fine-tuning framework from the NVIDIA NGC catalog or Hugging Face. Robotics and automotive companies, including 1X, Agile Robots, Agility, Figure AI, Foretellix, Fourier, Galbot, Hillbot, IntBot, Neura Robotics, Skild AI, Virtual Incision, Waabi and XPENG, along with ridesharing giant Uber, are among the first to adopt Cosmos. “The ChatGPT moment for robotics is coming. Like large language models, world foundation models are fundamental to advancing robot and AV development, yet not all developers have the expertise and resources to train their own,” said Huang. “We created Cosmos to democratize physical AI and put general robotics in reach of every developer.” All Nvidia CES-related announcements and blogs can be found here.

  • The insideAI News IMPACT 50 List for Q4 2024

    The team here at insideAI News is deeply entrenched in keeping the pulse of the big data ecosystem of companies from around the globe. We’re in close contact with the movers and shakers making waves in the technology areas of big data, data science, machine learning, AI and deep learning. Our in-box is filled each day with new announcements, commentaries, and insights about what’s driving the success of our industry so we’re in a unique position to publish our quarterly IMPACT 50 List.

Sponsored Guest Articles

  • Webinar: Getting Started with Llama 3 on AMD Radeon and Instinct GPUs

    [Sponsored Post] This webinar: “Getting Started with Llama 3 on AMD Radeon and Instinct GPUs” provides a guide to installing Hugging Face transformers, Meta’s Llama 3 weights, and the necessary dependencies for running Llama locally on AMD systems with ROCm™ 6.0.

  • Optimizing Performance and Cost Savings for Elastic on Pure Storage

    [SPONSORED POST] Organizations can now confidently embrace Elastic, enhance their hot tier storage, and seamlessly manage historical data with cost-efficient capacity-optimized storage. Pure Storage not only meets the demands of the modern data landscape but also empowers organizations to simplify their Elastic architecture, reflecting the industry trend towards a more streamlined and efficient approach.

  • Crafting Precision Content Using Large Language Models 

    [SPONSORED POST] Our friends over at DAC recently explored the potential of LLMs to interpret or generate text considering numerous parameters with extraordinary precision. They have demonstrated their ability to evaluate unstructured text blocks and grade them along various scales to great success. Additionally, they have been able to generate specific, tailored content using a complex series of instructions that can be tuned with precision to the same set of scales.