Data Center Infrastructure Archives - Page 2 of 12

NVIDIA’s annual developer conference (San Jose, March 16–19) has become a bellwether for data center physical infrastructure (DCPI). This year was no exception. NVIDIA DSX took center stage — a full-stack platform for designing, building, and operating AI factories that now counts over 200 partners in its ecosystem. Several major DCPI vendors—including ABB, Eaton, Mitsubishi Electric, Schneider Electric, Siemens, Trane Technologies, and Vertiv—unveiled co-designed solutions in a tightly choreographed wave of announcements. It was a concrete expression of what CEO Jensen Huang declared in his keynote: “this conference is going to cover every single layer of the five-layer cake of artificial intelligence, from land, power and shell the infrastructure to chips, to the platforms, the models, and, of course, the most important, and ultimately what’s going to take get this industry taken off, is all of the applications.”

A Factory for Designing Factories

Among the DSX components, what particularly stood out was the Omniverse DSX Blueprint—a now generally available platform for modeling data center layouts, power topologies, and thermal behavior, using simulation-ready 3D models contributed by infrastructure partners in OpenUSD format. It is an ambitious vision at a time when the reality on the ground is that most data center design still relies on traditional CAD and BIM applications, and digital twin adoption is still in its infancy. This is NVIDIA being characteristically visionary—anticipating what will eventually become a necessity, even if today it can look like an overkill.

The industry is moving from adding capacity in the teens of gigawatts a year to potentially 100GW+ in a decade or less. At that scale, without AI-assisted tools in design, construction, and commissioning, it is hard to see how projects come online at the pace required—particularly given well-known skilled labor shortages. Just as semiconductor design has become fundamentally dependent on AI tools, data center design at gigawatt scale may have no choice but to follow the same path. The Omniverse Blueprint is NVIDIA’s bet on removing the barriers to building AI factories at scale.

But while the Omniverse Blueprint captures the imagination, the conversations dominating the show floor among DCPI vendors were far more immediate. Five topics in particular stood out: the growing heterogeneity of inferencing cluster racks, the fast-approaching 800 VDC transition, the ramp-up of liquid cooling designs, the potential commoditization within the MGX ecosystem and—as no data center discussion could miss it—power availability.

NVIDIA’s Vera Rubin DSX AI Factory Reference Design

The End of the One-Rack Era

For the past two NVIDIA generations, data center designers could plan around a single workhorse rack. The Hopper and then Blackwell platforms offered a largely homogeneous building block: one compute rack architecture, scaled across rows and halls, with relatively uniform power and cooling profiles. GTC 2026 broke that pattern decisively.

NVIDIA introduced not one but a number of rack configurations under the Vera Rubin umbrella. The NVL72 remains the flagship—72 Rubin GPUs and 36 Vera CPUs in a fully liquid-cooled, fanless, cableless enclosure exceeding 200 kW per rack. Alongside it, a CPX rack adds Rubin CPX accelerators to the Vera Rubin superchip trays, optimized for inference performance. A Vera CPU-only rack targets inference and data preprocessing without GPU acceleration. And the LPX rack with Groq’s LPUs debuts third-party silicon within NVIDIA’s own reference design.

This is a big departure. And it is also entirely expected. A single architecture serving every workload was only tenable while AI infrastructure was synonymous with large-scale training. As workloads diversify into a variety of fine-tuning and inferencing agentic AI applications, infrastructure must follow suit. Henry Ford was able to offer the Model T alone for only so long.

For DCPI vendors, the implications are immediate. Heterogeneous clusters mean managing mixed rack densities, uneven heat loads, and varying liquid cooling requirements coexisting on the same row. This is a design and operational challenge that will demand far more flexibility from infrastructure solutions than the relatively uniform AI halls of the Hopper and Blackwell era.

High Voltage, High Stakes

For the biggest disruption in data center power architecture in decades, 800 VDC power distribution received remarkably little attention in NVIDIA’s official channels. Absent from Jensen’s keynote and with no significant announcements since the technical blog and whitepaper released alongside last year’s OCP Global Summit—an event we covered in a previous blog—NVIDIA’s messaging on the architecture has been sparse.

The relevance of the discussion among vendors, however, could not have been more different. 800 VDC was the talk of the town. Multiple vendors showcased equipment and prototypes, and many dedicated sessions explored everything from semiconductor building blocks to rack-level power delivery and facility integration. Vendors like Delta Electronics, Texas Instruments, and STMicroelectronics focused their marquee March 16 announcements squarely on 800 VDC developments—an unusual departure from the lockstep of similar-themed announcements that have become the norm at GTC.

Schneider Electric’s Jim Simonelli session at GTC draws interest from audience

Such advancements are important and necessary, but many pieces of the 800 VDC topology remain unanswered. In his GTC session entitled “A Safe, Efficient, and Scalable Approach to 800 VDC Architecture,” Eaton’s J.P. Buzzell referenced an OCP white paper expected in the coming weeks. The draft should bring more clarity to the architecture, but there is still a long way to go before engineers can fully specify an 800 VDC data hall. And even once the specification matures, supply chains for components will need to be stood up and safety guidelines codified before broad deployment can begin.

45 Degrees of Separation

Much like 800 VDC, another infrastructure shift that made waves in an earlier NVIDIA keynote received little airtime at GTC. At CES in January, Jensen highlighted the move toward 45°C warm-water inlet temperatures—a significant departure from the designs more commonly deployed today. Beyond Jensen’s brief nod to Vera Rubin’s 45°C specification, the topic received little attention at GTC.

NVIDIA remains committed to 45°C, but there is no sign of it doubling down or rushing to get there. The convergence toward 45°C architectures will take longer to play out. Facility-side infrastructure needs to be adapted, but operators might remain reluctant to optimize the cooling system if doing so carries any risk of reducing accelerator performance. In an age of highly constrained compute, every token counts. And the imperative to maximize throughput trumps facility-level efficiency optimization.

The water temperature debate, however, was far from the only liquid cooling story at GTC. On the show floor, the direction of travel for CDU capacity was unmistakable. As pod architectures scale and per-rack thermal loads climb, vendors responded with a new class of multi-megawatt CDUs. These are a step change from capacities that dominated the market just a year ago, and we expect this upward trend to continue as next-generation pod architectures push thermal envelopes further still.

An interesting product found on the exhibition floor was a direct-current CDU, able to be connected straight to the 800 VDC bus. It is a thoughtful choice that adds flexibility for operators designing next-generation whitespace, even if we expect most large units to be housed in mechanical galleries in the grey space—where traditional AC power distribution is likely to remain the standard for the foreseeable future. Either way, the convergence of power and cooling design choices is becoming impossible to ignore.

MGX and the March Toward Standardization

The growing specificity of NVIDIA’s reference architectures—from rack dimensions and cooling requirements to power topologies and simulation-ready digital models—raises an uncomfortable question for DCPI vendors: as NVIDIA defines more of the design, what room is left for differentiation?

The “MGX wall” on the show floor—displaying components from dozens of vendors side by side within the standardized MGX ecosystem—made this tension visible. By standardizing interfaces, form factors, and performance specifications across the infrastructure stack, MGX makes it easier for operators to mix and match components from multiple suppliers. That is a win for deployment speed and supply chain resilience. But it also compresses the space in which vendors can compete on anything other than price and availability—the classic hallmarks of a commoditizing market.

Quick disconnects from multiple vendors showcased at the “MGX wall”

Not all vendors will be affected equally. Those with deep system integration expertise, intelligent controls, service capabilities, or engineering and quality differentiation in mission-critical components will find ways to stay above the commoditization line. But for vendors whose value proposition rests primarily on the physical product itself, the tightening of NVIDIA’s specifications around their equipment is a trend worth watching closely.

Unlocking the Grid

Perhaps the most consequential launch at GTC came not from the chip announcements but from DSX Flex—NVIDIA’s software layer for connecting AI factories to grid services and orchestrating dynamic power adjustment. With NVIDIA’s order book continuing to grow, the math is simple: the gap between the power needed to energize forecast chip shipments and the pace of grid updates is too large to ignore. And the only near-term path to more power is not launching data centers into space, but tapping into existing grid capacity when it is not being used.

This was a point I raised directly with Jensen during the event. His response was unequivocal: data centers must change their relationship with the grid and be willing to accept less stringent SLAs in exchange for faster access to capacity. AI workloads will need to flex around supply constraints rather than demanding always-on, fully firm power. In a world where tokens per watt is becoming the defining metric for AI factory economics, accessing these watts and maximizing them becomes a dealbreaker. Startups like Emerald AI and Phaidra are building the technology to support this, but unlocking it at scale requires more than just engineering ingenuity. It depends on the willpower and aligned incentives of primary gatekeepers involved—utilities, grid operators, and their regulators.

What This Means for the DCPI Market

Dell’Oro Group’s latest DCPI market update, released during GTC week, showed the market reached $10.9 billion in 4Q 2025—up 20% year-over-year—with synchronized backlog surges across vendors in power and cooling. The AI supercycle continues to drive record investment, and GTC 2026 did nothing to dampen expectations. The tone was one of confident optimism—about the trajectory of AI, the scale of compute still to be built, and the opportunities ahead for data center vendors.

Regardless of whether that optimism proves fully warranted, GTC 2026 left little doubt: the DCPI market is entering its most consequential chapter yet. Stay tuned as we continue to track these shifts—and connect with us at Dell’Oro Group to discuss these trends as they unfold.

Vendor Press Releases

Accelsius:

Accelsius Makes NVIDIA GTC Debut with NeuCool® IR150, the Industry’s First Integrated Rack for Two-Phase Liquid Cooling

Delta Electronics:

Eaton:

Eaton collaborates with NVIDIA to unveil the Eaton Beam Rubin DSX platform to address the nearly $7 trillion data center buildout market from grid to chip

Foxconn:

Hon Hai Technology Group (Foxconn) Accelerates AI At NVIDIA GTC With Vera Rubin NVL72, Humanoids, Modular Data Center

Flex:

Hitachi:

Hitachi Unveils 800 VDC Power Supply Simulation, Enabled by the NVIDIA Omniverse DSX Blueprint, to Accelerate Gigawatt-Scale AI Factories

LiteOn:

LITEON Showcases Next-Generation 800 VDC and NVIDIA Vera Rubin Platform Solutions at NVIDIA GTC 2026 Empowering Enterprises to Deploy Megawatt-Scale AI Data Centers based on NVIDIA MGX

Schneider Electric:

Schneider Electric teams with NVIDIA to develop validated blueprints to design, simulate, build, operate and maintain gigawatt-scale AI Factories

STMicroelectronics:

STMicroelectronics expands 800 VDC AI datacenter power conversion portfolio with new 12V and 6V architectures in collaboration with NVIDIA

Texas Instruments:

TI unveils complete 800 VDC power architecture for future generation AI data centers with NVIDIA

Trane Technologies:

Trane Technologies Optimizes Industry‑First Thermal Management Reference Design for AI Factories, Introduces Two New Designs

Vertiv:

Vertiv brings converged physical infrastructure to NVIDIA Vera Rubin DSX AI factories

A few months after Upscale AI introduced SkyHammer—its clean-slate, open-standards scale-up platform designed to make XPUs “behave like a single coherent machine”—the firm is now extending its vision for open AI networking infrastructure into the scale-out domain, where clusters expand horizontally across multiple racks and, increasingly, across multiple data centers. To that end, Upscale AI is announcing a strategic partnership with NVIDIA aimed at accelerating the deployment of open, scale-out AI networking infrastructure for next-generation data centers.

The collaboration brings together NVIDIA’s Spectrum-X Ethernet switch silicon and Upscale AI’s AI-optimized, SONiC-based networking software to deliver interoperable, high-performance Ethernet fabrics designed for large-scale AI workloads.

As enterprises and neocloud providers expand AI clusters, networking has emerged as a critical bottleneck. The partnership focuses on enabling these customers to deploy scalable, low-latency networking systems that support heterogeneous environments spanning compute, accelerators, memory, and storage.

Open Infrastructure for Heterogeneous AI Environments

As part of the initiative, Upscale AI has joined the NVIDIA Partner Network. The partnership is intended to give customers greater flexibility in how they design and procure AI infrastructure, including deploying Ethernet switching powered by NVIDIA Spectrum silicon in heterogeneous, multi-vendor environments. This collaboration reflects a step toward more interoperable Ethernet infrastructure for AI deployments, while maintaining operational consistency at scale.

Focus on AI-Optimized SONiC

A core element of Upscale AI’s approach is its AI-optimized implementation of SONiC, the open-source network operating system widely used in hyperscale environments.

At Dell’Oro Group, we expect SONiC adoption in AI back-end networks to accelerate much faster than what we have historically observed in front-end networks. This faster uptake will be driven by several tailwinds on both the demand as well as supply sides.

On the demand side, a growing number of fast-growing AI model builders and neocloud providers are evaluating SONiC to diversify vendors, reduce platform lock-in, and gain greater control over their network infrastructure. Vendor diversification also helps mitigate risk especially as supply availability tightens.

On the supply side, an expanding ecosystem of established vendors and new entrants is supporting the SONiC ecosystem. We expect SONiC-based switch sales in AI scale-out networks to grow at more than 50 % CAGR (2025-2030), exceeding $10 B by 2030.

Addressing a Critical Gap with Fully Integrated AI Infrastructure for Enterprise and Neocloud Customers

Historically, SONiC adoption has been spearheaded by hyperscalers. However, deploying and operating an open-source network operating system like SONiC demands substantial in-house engineering expertise and integration effort—capabilities many smaller cloud providers and enterprises lack. In addition, SONiC broader ecosystem support—such as turnkey distributions, enterprise-grade tooling, and vendor-backed support—has lagged proprietary network operating systems offerings, limiting SONiC adoption beyond hyperscale environments.

Upscale AI plans to bridge this gap by delivering fully integrated solutions that combine hardware, software, and lifecycle services targeted at organizations building medium and large-scale AI environments.

While the first wave of AI has been driven primarily by large AI model builders—namely hyperscalers—the second wave is expected to be led by other cloud providers, including neocloud providers, as well as large enterprises. Together, these customer segments are projected to account for the majority of the Ethernet data center switch sales in scale-out networks by 2030.

Stitching Together an Open Fabric for AI

SkyHammer was step one. Scale-out is step two. Upscale AI is stitching together an open networking story—from the scale-up interconnect that makes XPUs act like one system, to the Ethernet fabric that lets AI environments grow horizontally while preserving multi-vendor flexibility. The NVIDIA partnership helps validate that direction and accelerates the scale-out side of the roadmap, reinforcing Upscale AI’s broader goal: open, interoperable AI networking infrastructure from pod to cluster.

As 2025 comes to a close, we reflect on several remarkable milestones achieved by the data center switching market this year, and what 2026 may have in store for us.

Looking back at 2025, several clear inflection points reshaped the market:

Ethernet overtakes InfiniBand in AI back-end networking: Supported by strong tailwinds on both the supply and demand sides, 2025 marked a decisive turning point for AI back-end networks, as Ethernet surpassed InfiniBand in market adoption. This shift is particularly striking given that just two years ago, InfiniBand accounted for nearly 80% of the data center switch sales in AI back-end networks.

Overall Ethernet Data Center Switch sales nearly doubled compared with 2022: The rapid adoption of Ethernet in AI back-end deployments propelled total Ethernet data center switch sales to an all-time high in 2025, nearly doubling annual revenues compared with 2022 levels.

800 Gbps well surpassed 20 M ports within just three years of shipments: As a point of reference, it took 400 Gbps six to seven years to achieve the same milestone
The vendor landscape shifted meaningfully toward AI-exposed players: Vendors with greater exposure to AI back-end networking significantly outperformed the broader market in 2025. Companies such as Accton, Celestica and NVIDIA were among the primary beneficiaries of this shift, reflecting how AI-driven demand is reshaping competitive dynamics. Arista maintained the leading position in the Total Ethernet Data Center Switching market.

Looking ahead to 2026, questions are emerging around whether the pace of investment can be sustained after such an extraordinary year. While skepticism around AI returns on investment is growing, we believe the industry is still in the early innings of a multi-year AI investment cycle. Based on the latest capital expenditure outlooks from the large hyperscalers (Google, Amazon, Microsoft, Meta, Oracle and others), we expect another strong year of AI-related investment in 2026, which should continue to drive robust spending across the networking portion of the infrastructure stack.

Networking is becoming increasingly critical, as it plays a central role in addressing some of the most challenging scaling bottlenecks in AI deployments—including power availability and compute demand. Below are some of the inflection points expected for 2026:

Demand remains exceptionally strong in AI back-end networking. We continue to expect strong double-digit growth in AI networking spending, driven by ongoing scale-out of AI clusters. The integration of co-packaged optics could further accelerate market growth as optics would easily add multi billions to the market size.
Supply constraints remain the primary risk to our forecast. We expect demand to continue to outpace supply, with shortages in chips, memory, and other critical components representing the main caveats to our outlook. As a result, the market remains supply-constrained rather than demand-constrained—a challenging dynamic, but ultimately a more favorable one than the reverse.
Scale-up emerges as a new battlefield for Ethernet. After securing a leading position in the scale-out segment of AI back-end networks, Ethernet is now expanding into scale-up, where NVLink has historically dominated. In this space, Ethernet will compete not only with NVLink but also with UALink, another alternative to NVLink. We anticipate 2026 will be a year full of vendor announcements targeting both Ethernet and UALink opportunities in scale-up. Scale-up represents what could be the largest total addressable market expansion the industry has ever seen.
1.6 Tbps switches expected to ship in volume in 2026. 2026 will mark the first year of volume deployments of 1.6 Tbps switches, driven by the insatiable demand for high bandwidth in AI clusters. 1.6 Tbps ramp is expected to be even faster than 800 Gbps, surpassing 5 M ports within one to two years of shipments.
Co-packaged optics (CPO) expected to ramp on both InfiniBand and Ethernet switches. After many years of development and debate, 2026 is expected to see the initial volume ramp of CPO on both InfiniBand and Ethernet switches. On the demand side, major hyperscalers are actively trialing the technology. On the supply side, while NVIDIA is leading the way, we expect other vendors to follow shortly.
Vendor diversity set to increase in 2026. As AI clusters continue to scale, vendor diversity with both incumbent vendors as well as new entrants, will become increasingly important to ensure risk mitigation and supply availability. We believe that no single vendor can meet the full demand for AI infrastructure. As a result, we expect SONiC adoption to accelerate in both scale-up and scale-out deployments, as it will be critical in enabling this broader vendor ecosystem

In summary, as we look ahead to 2026, the AI-driven data center landscape is set to continue its rapid evolution. From Ethernet’s rise in AI back-end networks and the emergence of scale-up as a new battlefield, to the adoption of 1.6 Tbps switches, co-packaged optics, and a more diverse vendor ecosystem, the infrastructure supporting AI is expanding in both scale and complexity. While supply constraints and ROI questions remain challenges, the industry is clearly in the early innings of a multi-year AI journey. Networking, in particular, will play a pivotal role in enabling the next phase of AI growth, making 2026 an exciting year for both innovation and investment.

The hyperscale AI infrastructure buildout is entering a more mature phase. After several years of rapid regional expansion driven by resilience, redundancy, and data sovereignty, hyperscalers are now focused on scaling AI compute and supporting infrastructure efficiently. As we move into 2026, the cycle is increasingly defined by capex discipline and execution risk, even as absolute investment levels remain historically high.

Accelerated Servers Remain the Core Spending Driver

Spending on high-end accelerated servers rose sharply in 2025 and continues to anchor AI infrastructure investment heading into 2026. These platforms pull through demand for GPUs and custom accelerators, HBM, high-capacity SSDs, and high-speed NICs and networks used in large AI clusters. While frontier model training remains important, a growing share of deployments is now driven by inference workloads, as hyperscalers scale AI services to millions of users globally.

This shift meaningfully expands infrastructure requirements, as inference workloads require higher availability, geographic distribution, and tighter latency guarantees than centralized training clusters.

GPUs Continue to Dominate Component Revenue

High-end GPUs will remain the largest contributor to component market revenue growth in 2026, even as hyperscalers deploy more custom accelerators to optimize cost, power efficiency, and workload-specific performance at scale. NVIDIA is expected to begin shipping the Vera Rubin platform in 2H26, which increases system complexity through higher compute and networking density and optional Rubin CPX inference GPU configurations, materially boosting component attach rates.

AMD is positioning to gain share with its MI400 rack-scale platform, supported by recently announced wins at OpenAI and Oracle. Despite growing competition, GPUs continue to command outsized revenue due to higher ASPs, broader ecosystem support.

Near-Edge Infrastructure Becomes Critical for Inference

As AI inference demand accelerates, hyperscalers will need to increase investment in near-edge data centers to meet latency, reliability, and regulatory requirements. These facilities—located closer to population centers than centralized hyperscale regions—are essential for real-time, user-facing AI services such as copilots, search, recommendation engines, and enterprise applications.

Near-edge deployments typically favor smaller but highly dense accelerated clusters, with strong requirements for high-speed networking, local storage, and redundancy. While these sites do not approach the power scale of centralized AI campuses, their sheer number and geographic dispersion represent a meaningful incremental capex requirement heading into 2026. In contrast, far-edge deployments remain more use-case dependent and are unlikely to see material growth until ecosystems and application demand further mature.

Networking and CPUs Transition Unevenly

The x86 CPU and NIC markets tied to general-purpose servers are expected to decelerate in 2026 following short-term inventory digestion. In contrast, demand for high-speed networking remains tightly linked to accelerated compute growth. Even as inference workloads outpace training, inference accelerators continue to rely on scale-out fabrics to support utilization, redundancy, and ultra-low latency.

Supply Chains Tighten as Component Costs Rise

AI infrastructure supply chains are becoming increasingly constrained heading into 2026. Memory vendors are prioritizing production of higher-margin HBM, limiting capacity for conventional DRAM and NAND used in AI servers. As a result, memory and storage prices are rising sharply, increasing system-level costs for accelerated platforms.

Beyond memory, longer lead times for advanced substrates, optics, and high-speed networking components are adding further volatility to the supply chain. In parallel, tariff uncertainty and evolving trade policy introduce additional supply-chain risk, and potentially elevating component pricing over the medium term.

Capex Remains Elevated, but ROI Scrutiny Intensifies

The US hyperscale cloud service providers continue to raise capex guidance, reinforcing the continuity of the multi-year AI investment cycle into 2026. Accelerated computing, greenfield data center builds, near-edge expansion, and competitive pressures remain strong tailwinds. Changes in depreciation treatment provide levers to optimize cash flow and support near-term investment levels.

However, infrastructure investment has outpaced revenue growth, increasing scrutiny around capex intensity, depreciation, and long-term returns. While cash flow timing can be managed, underlying ROI depends on successful AI monetization, increasing the risk of margin pressure if revenue growth lags infrastructure deployment.

A year of continuous shifts within the sector — and familiar debates beyond it

As we close out 2025, the milestones of the past twelve months underscore just how quickly the industry is shifting beneath our feet. DeepSeek’s breakthrough reshaped assumptions about compute efficiency and cost; NVIDIA’s announcement of Blackwell Ultra signaled yet another leap in accelerator performance; the White House’s AI Action Plan formalized the policy stakes around national compute capacity; Stargate’s Abilene facility began operating at unprecedented scale, becoming a symbol of the AI‑era mega‑campus; debates around AI circular investments highlighted both the ambition and fragility of capital flows into frontier infrastructure — only to name a few of key milestones of this past year

These developments set the stage for a year that will balance continuity with disruption. For vendors and operators, 2026 will bring meaningful shifts in technologies, architectures, and competitive dynamics. Yet from the outside, the narrative may feel familiar. The same themes that began surfacing more prominently in recent years — and defined public debate throughout 2025 — will continue to dominate headlines, even as the underlying infrastructure evolves at a far faster pace.

What we’re not predicting — because everyone else already is

Power scarcity remains the defining constraint, with power availability continuing to be the single most important determinant of site selection for data center projects. Speculation about an AI‑driven investment bubble is expected to intensify, as trillions of dollars in critical infrastructure are deployed amid lingering uncertainty about long‑term monetization models. And public visibility of the sector will keep rising, bringing sharper community pushback, permitting resistance, and societal concerns ranging from energy affordability to the impact of AI on jobs, as well as growing scrutiny over the safe and responsible use of AI, particularly among young people — pressures that intensify most as the industry lacks coherent, accessible, and positive messaging about its value to communities and the broader economy.

Because these forces are so obvious and so deeply embedded in the industry’s trajectory, we will not include them among our predictions. Instead, this outlook focuses on the emerging dynamics that will shape vendors, operators, and the broader ecosystem in ways both expected and unexpected.

The easy ones: our highest-confidence expectations for 2026

These trends are already well underway, with early signals evident throughout 2025, reinforcing a trajectory that leaves little doubt about their momentum heading into 2026.

1. Consolidation and partnerships accelerate

The complexity of gigawatt‑scale data centers is pushing vendors to work together more closely, driving a surge in strategic partnerships that combine expertise across power, cooling, controls, and integration. Expect more joint reference architectures, co‑engineered solutions, and collaborative designs that extend well beyond any single vendor’s historical domain. We anticipate at least ten additional partnership announcements in 2026 as vendors align to meet the growing demands of AI‑era infrastructure.

In parallel, consolidation will continue as vendors with differentiated capabilities become acquisition targets — particularly in high-priority areas such as liquid cooling, solid-state power electronics, and global design and service expertise. These acquisitions will further accelerate the shift toward full-stack delivery models, with integrated chip-to-rack, rack-to-row, and row-to-hall solutions becoming a defining competitive strategy. We expect no fewer than five acquisitions or take-private transactions crossing the $1 billion threshold, underscoring the intensifying race to secure critical capabilities across the DCPI stack.

2. Real builds matter more than bold visions (and vanished ones)

Multi‑billion‑dollar and multi‑gigawatt campus announcements might continue to dominate headlines, but the center of gravity will shift toward execution rather than ideation. Operators will focus on translating these bold visions into reality — securing power, navigating permitting, sequencing construction, and commissioning facilities on time.

Source: Open AI – OpenAI, Oracle, and SoftBank expand Stargate with five new AI data center sites

With the running backlog of public announcements now exceeding 70 GW of stated capacity, a meaningful share of these projects is likely to remain “braggerwatts” — aspirational declarations that never progress past land options, concept designs, or early‑stage filings. As economic, regulatory, and power‑availability constraints sharpen, attention will shift back to credible projects with clear pathways to completion and well‑defined delivery plans.

Today, several sites are on trajectories that suggest they could eventually cross the fabled 1 GW capacity threshold, but none have reached that milestone yet. By the end of 2026, however, we expect at least five sites worldwide to surpass 1 GW of operational capacity.

3. Divergence grows before convergence returns

Despite efforts toward convergence, 2026 is likely to bring even greater architectural divergence across power and cooling, a proliferation of design pathways rather than a narrowing of them. This is being fueled by rapid technological shifts that show no signs of slowing.

On the power side, even as clarity improves around 400 Vdc and 800 Vdc rack architectures, vendors will diversify rather than narrow their portfolios — developing new families of DC circuit breakers, power shelves, hybrid and supercapacitor‑based energy storage, and MV switchgear integrated with solid-state electronics in preparation for deployments expected in 2028/29.

Cooling will see similar diversification. A testing ground of novel technologies — including two‑phase direct liquid cooling (DLC), CDU‑less single‑phase DLC, and a wide variety of cold‑plate architectures — is expected to gain momentum, expanding the solution diversity of the ecosystem.

In this environment, initiatives like the Open Compute Project (and its collaborations with ASHRAE, Current/OS, and others) will become even more important in steering the industry, offering reference frameworks and shared direction to help channel innovation while reducing unnecessary fragmentation.

Watch closely: trends gaining momentum — but not yet locked in

Early signals suggest these trends could gain real traction — but timing, economics, and scale remain uncertain.

4. “Micro‑mega” edge AI deployments are on the rise

As compute density within a single rack skyrockets, many AI workloads will be able to operate on one — or just a handful — of cabinets. These compact yet powerful clusters will increasingly sit alongside conventional compute to support hybrid workloads. Expect a wave of megawatt-class, ultra-dense AI racks for enterprise post-training and inference — small-scale AI factories — embedded within colocation sites, enterprise campuses, or telco edge facilities.

What makes this shift noteworthy is what it reveals about broader AI adoption: AI is moving beyond pilots and proofs‑of‑concept and into day‑to‑day business operations, requiring right‑sized, high‑density compute footprints placed directly where data and decision‑making occur.

Architecturally, this marks a meaningful shift. Instead of concentrating accelerated compute solely in hyperscale campuses or purpose‑built training clusters, enterprises and colocators will increasingly deploy AI directly into existing facilities. This proximity to business‑critical workflows will drive demand for modular, pre‑engineered AI systems that can be “dropped in” with minimal disruption, along with managed AI‑infrastructure services that oversee monitoring, lifecycle management, and performance optimization.

5. Air cooling strikes back

The novelty of liquid cooling has dominated industry discourse for the past three years, pushing vendors and operators to rapidly adapt — bringing new products to market, redesigning systems to accommodate liquid infrastructure, and upskilling operational teams to support deployments at scale. But as AI deployments move beyond frontier‑model training clusters and into enterprise environments, high‑density AI racks will more frequently appear in facilities not originally designed for liquid cooling.

This shift will prompt a resurgence in advanced air‑cooling solutions. Expect a proliferation of 40–80 kW air‑cooled racks supported by extremely high‑performance thermal systems, paired with 60–150 kW liquid‑cooled racks equipped with liquid‑to‑air sidecars. The result: hybrid thermal profiles within the same facility, introducing complex challenges for operators managing uneven heat loads and airflow dynamics.

Far from being overshadowed by liquid cooling, air‑cooling systems are poised for incremental growth as operators seek flexible, retrofit‑friendly approaches to support heterogeneous rack densities across mixed‑use sites.

6. Immersion cooling re-emerges in modular form

After the hype cycle of recent years, immersion cooling is beginning to find its footing in more targeted, pragmatic applications. Rather than competing head‑on with DLC for hyperscale AI clusters, immersion vendors are shifting toward modular, compact systems that deliver differentiated value.

We expect growing traction in edge, telecom, and industrial environments, where immersion’s sealed‑bath architecture offers advantages in reliability, environmental isolation, and minimal site modification. These deployments will remain modest in scale, but meaningful in carving out a sustainable niche beyond today’s supercomputing and crypto segments.

To be clear, immersion cooling is not poised to displace DLC or become a dominant cooling technology. However, it is finally entering a phase where use‑cases align with its strengths — enabling vendors to build viable businesses around modular, ready‑to‑deploy immersion clusters that “drop in” alongside traditional IT and support workloads that benefit from simplified thermal management and rapid deployment.

7. Europe and China wake up — but in very different ways

Europe and China are both poised for stronger AI‑driven data‑center momentum in 2026, but their trajectories could not be more different. In power‑constrained Europe, growth will increasingly hinge on inference deployments located closer to population centers, to minimize network latency (even if compute latency remains the bigger challenge for AI services). This shift toward user‑proximate infrastructure will steer investment toward distributed, high‑density nodes rather than massive gigawatt-scale training campuses. Within this landscape of smaller facilities, a growing cohort of start‑up model builders will prioritize hyper‑efficient architectures that can extract maximum utility from these distributed fleets, for both inference and selective training workflows.

China, by contrast, faces no shortage of power. Its constraint is access to the latest generation of advanced accelerators. We expect operators to continue building at scale using a mix of domestic silicon and whatever Western supply remains available — iterating rapidly as local manufacturers improve capability generation by generation. Over the next few years, this mix‑and‑match strategy will help China bridge the gap until it achieves greater semiconductor self‑sufficiency, resulting in substantial expansion of AI data‑center capacity even under export controls.

The long shots: unlikely swings with outsized impact

Three low-probability but transformative developments, if they emerge, could reshape the data center landscape far more than their probability suggests.

8. U.S. government tightens regulation of the data center industry

A push in Washington to encourage investment in advanced cooling technologies — including a proposed bill aimed at accelerating liquid‑cooling adoption — could have unintended consequences. While well‑intentioned, efforts to steer technological choices risk drawing the federal government more directly into data center design decisions, increasing oversight and potentially making infrastructure requirements more rigid at a time when flexibility is essential.

We do not expect sweeping regulation to materialize in 2026. The current administration has closely aligned itself with AI as a pillar of economic competitiveness and will be wary of stymieing data center buildout, especially given its role in supporting GDP growth. Moreover, political attention will be dominated largely by the mid‑term elections, leaving little bandwidth for complex industry‑specific legislation.

However, affordability and household cost pressures are set to become highly charged political themes — and in that environment, data centers may attract negative scrutiny. As utilities grapple with rising demand and public concern around bills, the industry could face a wave of unfavorable headlines and heightened calls for transparency. To mitigate reputational risk, operators will need to invest more heavily in public engagement, clear messaging, and proactive demonstration of their contributions to reliability, economic growth, and community well‑being.

9. The first liquid-cooling leak critical failure hits the headlines

The early wave of liquid-cooled deployments often moved faster than the industry’s collective design and operational expertise. Many systems were installed without fully accounting for the nuances of coolant management, materials compatibility, monitoring, and routine maintenance — conditions that naturally elevate leak risk. Throughout 2025, we saw scattered reports of cluster-level shutdowns tied to liquid-handling failures, but nothing approaching the scale or societal visibility of a major cloud outage.

While we still believe high-profile failures are possible, their broader impact will likely be limited. Despite growing enterprise adoption, most AI systems are not yet embedded deeply enough in critical business processes to trigger widespread disruption. As a result, even a significant leak-related outage is unlikely to spark the kind of global headlines seen after the AWS blackout — though it may accelerate industry efforts around standards, training, instrumentation, and risk-mitigation practices.

10. The GPU secondary market skyrockets

As hyperscalers and neo cloud providers refresh their fleets, early generations of GPUs — notably Ampere- and Hopper-based accelerators — will increasingly face retirement to make room for newer, more efficient architectures. This raises a key question already weighing on investors: what is the real depreciation timeline for AI hardware on hyperscaler balance sheets?

We expect most older GPUs to shift into lower‑complexity inference workloads or the training of smaller, less compute‑intensive models. We believe it is still too early for widespread scrapping of full data centers built on these platforms, which could flood the secondary market of GPUs looking for another productive life somewhere else.

Enterprise IT environments and colocation providers will see growing volumes of these second‑hand GPUs entering their ecosystems, often at attractive price points. Integrating these “intruders” into general‑purpose, lower‑density compute environments will introduce new operational and thermal challenges. Operators will need to manage concentrated heat loads, non‑uniform rack densities, and power profiles that differ from their conventional estate.

The bubbling question we can’t avoid — even if we tried

Speculation about an AI “bubble” has increasingly dominated media narratives throughout 2025, and the conversation is unlikely to quiet down in 2026. It is true that many AI‑adjacent companies are trading at lofty valuations, buoyed by optimism around future adoption and monetization, an optimism may not prove durable. There is a meaningful possibility that equity markets enter correction territory in 2026, bringing P/E ratios closer to historical norms.

Yet even in a cooling market environment, we do not expect the data‑center buildout to slow materially. Hyperscalers continue to generate ample cash flow to support aggressive infrastructure expansion, and their balance sheets remain low‑leveraged, giving them capacity to secure additional capital if needed. Strategic imperatives will outweigh short‑term market pressure: these companies are locked in a race to establish AI hegemony — or risk being left behind.

In other words, financial markets may wobble, but the underlying drivers of AI infrastructure investment remain intact. The bubble debate will rage on, but the buildout will continue.

Looking ahead: embracing another year of acceleration and uncertainty

As with every prediction cycle, only time will reveal which of these dynamics take hold and which fade into the background. What is certain, however, is that 2026 will yet again challenge our assumptions. The pace of AI‑driven infrastructure evolution shows no signs of slowing, and the industry will continue navigating a rare combination of technological disruption, supply‑chain reinvention, and unprecedented demand for capacity.

While we avoid grand year‑end platitudes, it is fair to say that much will change — and much will stay the same. Power will remain the currency of competitiveness, AI will continue to push infrastructure to its limits, and operators and vendors alike will be forced to adapt faster than ever. At Dell’Oro Group, we look forward to tracking, analyzing, and interpreting these shifts as they unfold.

Here’s to a 2026 that will undoubtedly keep all of us in the data‑center world busy — and to the insights that the next twelve months will bring!

Contact Us