Telecommunications companies are shifting from renting graphics processing units to buying AI services through a token-based billing model from NVIDIA. The move marks a significant change in how carriers access computing power for tasks like network optimization and customer service chatbots.
What the token-metered model offers
Under NVIDIA’s system, telcos pay per token — a unit of compute tied to an AI task — rather than reserving entire GPUs for hours or days. That lets carriers scale up or down without overpaying for idle hardware. The model is similar to how cloud services charge per API call or per hour of processing, but NVIDIA ties it directly to its own AI software stack.
The shift away from GPU rentals means telcos no longer need to manage their own clusters or negotiate long-term contracts for hardware. Instead, they can tap into NVIDIA’s infrastructure as needed, paying only for what they use. That’s a big change for an industry that historically builds its own data centers.
Why telcos are moving now
Demand for AI in telecom keeps climbing. Carriers use machine learning for everything from predicting network congestion to automating customer support. But GPUs are expensive and hard to get. Renting them from cloud providers or NVIDIA itself has been the workaround, but that still requires predicting needs months in advance.
Token metering smooths out that uncertainty. A telco can run a burst of AI jobs during a holiday traffic spike and then cut back immediately, without wasting money on idle hardware. The model also fits the uneven workloads many carriers see — heavy during peak hours, light overnight.
Global token billing trends
NVIDIA isn’t the only one pushing token-based pricing. Across the tech industry, companies are moving from flat subscriptions or per-hour rates to granular, consumption-based billing. AI model providers like OpenAI charge per token. Cloud platforms have per-inference pricing. The telecom pivot aligns with that broader trend toward paying for exactly what you use.
For telcos, the appeal is clearer cost control. Instead of a big upfront capital expense for GPUs, they get an operating expense that varies with demand. That frees up budget for other investments, like 5G expansion or edge computing. It also lowers the barrier to trying new AI applications — a carrier can test a new fraud-detection model for a few hundred tokens before committing.
What’s still unknown
NVIDIA hasn’t disclosed specific token pricing for telcos, and it’s unclear how the model will compare to current GPU rental rates. Carriers that own their own hardware may not rush to switch. And the token system ties them even more tightly to NVIDIA’s software, raising questions about vendor lock-in. The next few quarters will show whether the model catches on beyond early adopters.
