Red Hat, Nvidia Launch Co-Engineered AI Factory Platform for Enterprise Deployments -- ADTmag

Red Hat, Nvidia Launch Co-Engineered AI Factory Platform for Enterprise Deployments

By John K. Waters
February 25, 2026

Key Takeaways

Red Hat and Nvidia are packaging AIOps into a single “factory” stack by combining Red Hat AI Enterprise with NVIDIA AI Enterprise for end-to-end, production-scale deployments.
The focus is scaling inference efficiently—with integrated serving and optimization components (including vLLM and NVIDIA TensorRT-LLM) plus observability to help control performance and total cost.
It’s positioned as hybrid-by-design and enterprise-ready—supported across on-prem, cloud, and edge environments, with OEM partners (Cisco, Dell, Lenovo, Supermicro), and security features rooted in Red Hat Enterprise Linux.

Red Hat on Tuesday introduced the Red Hat AI Factory with NVIDIA, a co-engineered software platform that combines Red Hat AI Enterprise with NVIDIA AI Enterprise to help organizations deploy and operate AI systems at scale.

The companies said the offering is designed to support a shift among enterprises from pilot projects to production deployments, as more organizations try to run higher-density AI workloads and “agentic” applications that can increase demand for inference computing and associated infrastructure.

Red Hat, the enterprise software company best known for Red Hat Enterprise Linux and its open-source-based platforms for hybrid cloud, automation, and application development, said the platform is intended to give IT operations teams a single foundation to manage both conventional enterprise workloads and the additional layers of an AI stack. The company said it provides “Day 0” support for NVIDIA hardware architectures, referring to the availability of software support at the time new NVIDIA systems ship.

According to Red Hat, the platform is supported on AI infrastructure from systems vendors including Cisco, Dell Technologies, Lenovo, andSupermicro, and can be deployed on-premises, in the cloud, or at the edge.

Chris Wright, Red Hat’s chief technology officer, said in a blog post that enterprises are moving toward “industrial-scale” AI production that requires new approaches to managing compute infrastructure and software.

Red Hat said the platform includes capabilities for AI inference, model tuning and customization, and deployment and management of AI agents, with a focus on security. It also said customers can access pre-configured models delivered as NVIDIA NIM microservices, including the IBM Granite model family, as well as NVIDIA’s Nemotron and Cosmos open models. Red Hat added that model alignment and tuning can be performed using NVIDIA NeMo.

On performance, Red Hat said the platform integrates components such as vLLM, NVIDIA TensorRT-LLM, and NVIDIA Dynamo to help meet service-level objectives for inference workloads, and includes built-in observability features to improve utilization and lower the total cost of ownership.

The companies said the system also provides GPU resource pooling and orchestration features, including automatic checkpointing for longer-running jobs, and incorporates security features based on Red Hat Enterprise Linux. Red Hat said NVIDIA DOCA microservices can be used to add what it described as a zero-trust architecture and runtime security.

Red Hat said the Red Hat AI Factory with NVIDIA is available now.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS. He can be reached at [email protected].

Featured

AppTrends

Email Address*Country*

Please type the letters/numbers you see above.

Upcoming Training Events

0 AM

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training with CoPilot: 4-Day Hands-On Experience
July 14-17, 2026

Visual Studio Live! @ Microsoft HQ
July 27-31, 2026

Visual Studio Live! @ San Diego
September 14-18, 2026

The AI Pivot
September 25, 2026

Live! 360 6-Week Training & Certification Course: Mastering the Microsoft AI Framework: Building Enterprise-Ready AI Agents with Microsoft Foundry
October 6–November 10, 2026

VSLive! 6-Week Training & Certification Course: Blazor Developer Accelerator: Hands-On Skills for Real-World .NET Teams
October 7 – November 11, 2026

Live! 360 Orlando
November 15-20, 2026

Artificial Intelligence Live! Orlando
November 15-20, 2026

AI Enterprise Architecture Live! Orlando
November 15-20, 2026

Cybersecurity & Ransomware Live! Orlando
November 15-20, 2026

Data Platform Live! Orlando
November 15-20, 2026

Visual Studio Live! Orlando
November 15-20, 2026

Live! 360 2-Day Hands-On Seminar: AI-Powered .NET Development with Claude & Claude Code
December 8-9, 2026

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training with CoPilot: 4-Day Hands-On Experience
December 15-18, 2026

Visual Studio Live! Las Vegas
March 22-26, 2027

Visual Studio Live! @ Microsoft HQ
August 2-6, 2027

Free White Papers

More Tech Library