Thursday, May 21, 2026

Non-RT RIC, AI-RAN, and the AI Grid: Three Different Bets on the Future of the RAN


I have been asked a few times lately what the difference is between the Non-Real Time RIC and AI-RAN. The question itself tells you something. Both sit under the broad "AI in the RAN" umbrella, marketed aggressively by the same vendors, debated in the same conference sessions. But they are fundamentally different in architecture, ambition, and business model. And neither is quite the same as what NVIDIA formally branded the AI Grid at GTC 2026 — which is where the most important and most misread opportunity actually sits.

The Non-RT RIC: the pragmatic bet

The Non-RT RIC is an O-RAN defined software layer that sits in the Service Management and Orchestration layer above the RAN, not inside it. Control loops over one second. rApps for energy saving, traffic steering, slice assurance, automated optimization. Think of it as the evolution of Self-Organizing Networks, re-platformed on open interfaces with a proper application model and a genuinely lower barrier to entry — cloud-native and OSS skills are sufficient. No RAN silicon expertise required.

This is precisely why the early commercial traction is here, not in AI-RAN. AT&T is deploying Ericsson's SMO and Non-RT RIC to replace two legacy C-SON systems. TELUS has launched an RIC platform alongside its Open RAN rollout. Swisscom is deploying one for multi-technology network management. These are not trials. These are production decisions.

AI-RAN: real performance gains, speculative revenue

AI-RAN embeds AI natively into the RAN stack itself .The AI-RAN Alliance — founded in February 2024, now at 109 member companies — defines it across three working groups: AI-for-RAN, AI-and-RAN, and AI-on-RAN.

AI-for-RAN is the most mature: using AI to optimize the RAN itself — the scheduler, link adaptation, beamforming, interference management. T-Mobile and Ericsson have been trialing an AI-driven scheduler and link adaptation engine on a live 5G Advanced network since Q2 2025, targeting commercial deployment in Q3 2026. Nokia and NVIDIA, backed by a $1 billion equity partnership, are testing GPU-accelerated AI-RAN with BT, Elisa, NTT DOCOMO, and Vodafone.

AI-and-RAN is where the narrative gets more ambitious — and more speculative. The idea is that RAN sites become shared compute infrastructure, running both network workloads and enterprise AI workloads on the same hardware. The tower becomes a distributed AI compute node. New revenue streams. Operators escape the utility trap.

AI-on-RAN is the monetization layer for the above. The commercial mechanisms are still being defined. That tells you where the maturity is.

The AI Grid: follow NVIDIA's sequencing, not its marketing

At GTC 2026, NVIDIA formally introduced the AI Grid as a reference design — geographically distributed AI infrastructure, using the telco footprint to run inference workloads closer to users. The numbers are interesting: early Comcast benchmarks showed inference cost reductions of up to 76% versus centralized deployments. HPE, SpectroCloud, and others have already announced implementations aligned to the reference architecture.

I have used this concept in my own work for years to describe the evolution from isolated MEC deployments into a coherent, programmable distributed inference fabric. Good to see NVIDIA put a formal architecture behind it. But the marketing obscures a critical sequencing question.

NVIDIA's own GTC announcements noted that many operators are starting by lighting up existing wired edge sites — central offices and mobile switching offices — as AI Grids they can monetize today. The cell site layer is a later phase. AT&T's CTO Igal Elbaz has been direct about questioning the value of pushing compute all the way to the far edge to save one or two milliseconds of latency. T-Mobile's SVP of network infrastructure defined her AI edge strategy as what is at a data center at a mobile switching office. Verizon's CTO has flagged the cost and complexity of far-edge GPU deployments.

These are the three largest US operators. They are not being conservative for the sake of it. The economics are straightforward: central offices and mobile switching offices already have power, cooling, connectivity, and physical security. They aggregate traffic from hundreds of cell sites. The sub-500ms latency threshold that NVIDIA's own reference design targets is achievable from a well-positioned CO. It does not require a GPU at the tower — not for the use cases that have a business case today.

I have seen this movie before with MEC. The industry led with its most ambitious architectural vision, ran the infrastructure investment ahead of the demand, and recovered slowly. The AI Grid does not have to repeat that pattern.

What to actually do

Start with the Non-RT RIC. The contracts are being signed, the ecosystem is opening, the business case is defensible.

On AI-RAN, wait for AI-for-RAN where your vendors have credible near-term roadmaps. Treat AI-and-RAN at the cell site as a long term speculative option — worth tracking, too early to fund at scale.

On the AI Grid, follow NVIDIA's own sequencing rather than the brochure. Central offices and mobile switching offices first. Build the orchestration and service layer from there outward. Expand to the far edge when the use cases and economics justify it — not because a GPU manufacturer's demand forecast requires it.

The cell site AI Grid is a compelling long-term vision. The central office AI Grid is deployable today. In this industry, deployable usually wins.


Monday, March 23, 2026

From AI-Native to Agentic-Native Networks

Recent announcements at NVIDIA's GTC 2026—including major pushes into agentic AI frameworks like OpenClaw, NemoClaw, and agentic systems for reasoning, planning, and autonomous action—have reinforced several convictions I've held about the trajectory of AI-native infrastructure, especially in telecom and networked industries. We're seeing the emergence of two distinct paradigms:

  • AI-Native networks that observe, detect, optimize, and predict in real time. These systems augment human decision-making, providing powerful assistance in planning, deploying, and managing both physical and virtual infrastructure.
  • Agentic-Native networks, by contrast, eliminate the human-in-the-loop entirely. When equipped with real-time data access, transactional capabilities, and fulfillment capacity, they execute at the speed permitted only by the slowest link in the supply chain.

This second model doesn't just accelerate execution—it fundamentally reprices time itself as a competitive asset.

As Jordi Visser articulates in his insightful piece "The Repricing of Time: Equity in the Age of Agents", agentic AI compresses competitive cycles dramatically. Velocity of execution no longer merely helps fulfill a plan faster; it redefines the playing field. When capabilities can be reconfigured almost overnight through model iterations or agent orchestration, durable moats erode. What once took decades to build—layered expertise, entrenched positions, regulatory barriers—can now be challenged or leapfrogged in months.

In this environment, equity behaves more like a call option on execution speed than on long-duration stability. "Execution speed replaces installed base. Iteration cadence replaces headcount." The advantage shifts decisively toward those who can pivot, adapt, innovate, and execute rapidly.

This dynamic hits telecom particularly hard.

Most operators are desperate to escape the "utility trench"—the low-margin, commodity perception that has trapped connectivity providers for years. They aspire to new revenue streams beyond pipes and bandwidth.

From my own experience modeling, teaching, and advising organizations on this challenge (see earlier pieces on innovation micro-strategies, telco relevance and growth, and the lean telco), there is no single silver bullet. No grand transformation program that magically reinvents the business.

Instead, the path forward involves thousands of micro-services and experiments: create, test, fail fast, pivot, scale the winners, and launch repeatedly. The era of one-size-fits-all offerings is over.

Agentic-native networks offer exactly the infrastructure to make this high-velocity approach viable at scale. They enable rapid creation, iteration, value capture, and deployment—turning velocity, flawless execution, and clear strategic vision into the new currency that outcompetes inertia, legacy systems, and eroding differentiation.

For telecom leaders, the message from GTC 2026 is clear: agentic AI can free up resources and help accelerate innovation at scale. Those who embrace this shift—building or partnering for agentic capabilities—will be the ones that don't just survive the repricing of time, but help define the next era of networked value creation.