Datadog Expands AI Observability with GPU Monitoring Launch

DDOG
April 22, 2026

Datadog announced on April 22, 2026 that its new GPU Monitoring solution is now generally available worldwide, offering unified visibility into GPU fleet health, cost, and performance and linking telemetry directly to the workloads that consume GPU resources.

The launch addresses a critical pain point for AI‑heavy organizations, where GPU instances can account for 14 % of compute costs. By providing per‑instance, per‑device visibility into utilization, memory, power, and thermals, Datadog enables faster troubleshooting of slow workloads and reduces over‑provisioning that drives waste.

GPU Monitoring complements Datadog’s existing LLM Observability offering, creating a seamless path from model latency spikes to the underlying GPU metrics. This end‑to‑end visibility supports customers in managing AI spend at scale and aligns with Datadog’s strategy to capture AI‑native customers and consolidate its platform.

The product launch fits into Datadog’s FY2026 guidance of $4.06 billion to $4.10 billion in revenue, an 18‑20 % year‑over‑year growth target, and precedes the company’s Q1 2026 earnings call on May 7. The expansion into GPU monitoring is expected to strengthen the AI segment of Datadog’s revenue mix and support its broader observability strategy.

"While these companies can see their costs climbing, they can’t chargeback GPU spend across business units, see workload context or identify clear next steps for improvement," said Yanbing Li, Chief Product Officer. "Smartly managing AI spend becomes a board‑level conversation when capacity is misallocated, training and inference workloads stall, and costs escalate. We all know managing GPU costs is a huge problem we need to solve, but most companies are experimenting with solutions and it is still very difficult to get a single view of what is happening across the stack. GPU Monitoring fixes that with efficiency and reliability that we haven’t seen before," Li added. Kai Huang, Head of Product at Hyperbolic, noted that the solution “has made it easy for us to stay on top of our multi‑tenant GPU infrastructure. We get per‑instance, per‑device visibility into core utilization, memory, power and thermals right out of the box with no extra setup. The dashboards are rich out of the gate and simple to customize, and standing up isolated views per customer takes minutes.”

Analysts have responded positively to the launch, highlighting Datadog’s strong AI focus and its platform consolidation strategy. Some analysts have expressed concerns about customer concentration risks, but overall sentiment remains supportive of the company’s AI‑centric growth trajectory.

The content on EveryTicker is for informational purposes only and should not be construed as financial or investment advice. We are not financial advisors. Consult with a qualified professional before making any investment decisions. Any actions you take based on information from this site are solely at your own risk.