Why are automated threat discovery and opportunistic drive-by attacks, such as threat actors using internet-scanning tools to find exposed VNC clients or unencrypted Modbus gateways, so dangerous for a neocloud? The underlying architecture that keeps AI training clusters from melting down relies entirely on a highly heterogeneous, often flat operational technology (OT) network. A flat OT network is an unsegmented network architecture where all devices share a single broadcast domain; essentially, everything can talk to everything. Because of this lack of internal boundaries, compromising just one low-level device grants an attacker unimpeded lateral access to the rest of the facility.
In this post, we’ll explain:
The hidden vulnerabilities created by rapid neocloud commissioning, where the race for speed-to-power often leaves critical devices exposed on flat networks.
Why downtime is the ultimate business risk, and how a single thermal stall could destroy multi-million dollar GPU clusters.
How to shift your defense to orchestrated risk reduction by adopting a zero-trust architecture and microsegmentation.
When you are delivering gigawatt facilities in record time to meet these contracts, commissioning moves at a breakneck pace. This speed inherently creates a massive weak point: the lack of time to verify that the build is actually correct and secure. Are your critical cooling devices actually segmented on the right OT VLAN, or were they accidentally left on a flat network where everything talks to everything? Were the default credentials on your BMS controllers, VNC clients, and Modbus gateways actually rolled during commissioning, or are they still using the highly vulnerable factory settings?
Neocloud cooling chains, which capture 100% of a GPU rack's thermal load, and power chains rely on a mix of modern IP-native devices and decades-old serial signaling technology (Modbus RTU, BACnet MS/TP) bridged by gateways. Aspirational network architecture diagrams suggest strict segmentation, but the reality on the ground is often a single, flat OT VLAN where everything talks to everything. If an opportunistic attacker compromises a single poorly secured device, such as a Modbus gateway, a variable frequency drive (VFD), or an environmental sensor, they instantly gain lateral movement across the entire facility. Once in, the attacker's final step in the kill chain is destruction: overwriting registers with garbage data or changing device configurations to cause physical disruptions.
Neoclouds operate with extreme customer concentration, often having fewer than 10 meaningful customers, with the top three accounting for up to 70-80% of revenue. Unlike traditional enterprise IT, AI training is physically constrained; a network or thermal stall can not only disrupt thousands of interconnected GPUs running a simultaneous training job, but permanently destroy them. A thermal event can melt chips in minutes, or a pipe burst could drop thousands of gallons of water onto racks worth millions of dollars.
Neocloud operations teams are incredibly lean, sometimes running a 100 MW facility with eight to 15 engineers. They need an automated way to double-check these fast-paced builds. By leveraging a comprehensive CPS protection platform, lean teams can instantly validate network segmentation and identify non-vulnerability risks, like dangerous misconfigurations and weak passwords, before an opportunistic attacker can exploit them. The ultimate buyer's pain in the neocloud space is customer retention, and downtime caused by an unchecked OT deployment will absolutely cost you your $100M+ contracts.
Because vulnerability management and patching are unviable as a primary defense in the age of Mythos, neocloud operators must shift their strategy to orchestrated risk reduction and compensating controls.
The immediate path forward is to embrace a zero-trust architecture through microsegmentation and secure remote access. By leveraging asset intelligence to identify exactly what every device does, from the BMS head-end controller down to the serial RS-485 flow meters hiding behind gateways, neoclouds can prioritize risk based on operational impact rather than just technical severity scores. Segmenting the flat OT VLAN ensures that even if AI-driven tools uncover a new vulnerability, the reachable attack paths are restricted, containing the blast radius before it can bring down a GPU cluster.
In the highly competitive neocloud market, speed-to-power got you the contract. But in the era of automated exploit discovery, engineered operational resilience is what will keep it.
Interested in learning about Claroty's Cybersecurity Solutions?
Life, uninterrupted
We maximize your availability, strengthen your insurability, and support compliance to ensure operational resilience.