NVIDIA has released a public beta of its XR AI platform, giving developers the tools to create multimodal AI agents for augmented reality and extended reality devices. The launch marks a deeper push into spatial computing for the chipmaker.
What the Beta Lets Developers Do
The XR AI beta is designed for building agents that can process and respond to multiple types of input — text, voice, gestures, and visual cues — all within AR and XR environments. That means a developer could craft an AI assistant that sees what a user points at, hears a spoken command, and answers with both audio and a holographic overlay.
NVIDIA says the beta runs across its AI infrastructure, including the Omniverse platform and the NVIDIA AI Enterprise suite. But for now, the company is focusing on the developer experience: getting the software into the hands of people building spatial apps before the technology becomes mainstream.
Why Spatial Computing Needs Multimodal Agents
Current AR and XR devices rely heavily on hand tracking and gaze detection. Adding an AI layer that understands natural language and visual context could make those interactions feel less clunky. The multimodal approach is meant to bridge the gap between what a user does physically and what they intend digitally.
NVIDIA’s beta arrives as competitors like Meta and Apple push their own spatial computing hardware. But NVIDIA isn’t selling headsets — it’s selling the backend. The company’s bet is that developers will need a powerful, GPU-accelerated stack to run real-time AI on those devices. The XR AI beta is a step toward locking that ecosystem in place.
What Comes Next
The public beta is available now for developers through NVIDIA’s developer portal. The company hasn’t said when a full production release will arrive, but early testers are expected to provide feedback over the coming months. For now, the question is how quickly spatial app builders will adopt the toolkit — and whether the multimodal agents it produces will make AR and XR feel less like a gimmick and more like a utility.




