QumulusAl Signs More Than $124 Million in AI Inference Infrastructure Agreements

Staff Report From Georgia CEO

Monday, June 15th, 2026

QumulusAI, a vertically integrated AI cloud infrastructure company, today announced it has secured more than $124 million in customer subscriptions for 3-year terms with Hyperbolic and another leading AI inference platform. These agreements support deployments totaling 1,280 NVIDIA Blackwell GPUs and include nearly $21.9 million in combined upfront customer commitments.

The customers operate some of the industry's largest inference platforms for open-source AI models, powering deep-research agents, automated coding systems, and other asynchronous AI applications that require high-throughput, low-latency and cost-efficient compute infrastructure. The agreements establish long-term recurring revenue for QumulusAI and validate growing demand for infrastructure purpose-built on AI inference workloads for these applications.

Under the GPU-as-a-service agreements, QumulusAI will provision 160 Lenovoand Supermicro bare-metal servers equipped with NVIDIA B300 and B200 Blackwell GPUs, respectively. Cisco Nexus One will power the cluster fabric for both deployments, delivering secure, high-performance AI networking. Rather than deploying off-the-shelf reference builds, QumulusAI's deployment is designed to reduce AI inference costs by approximately 20% compared to standard configurations by rightsizing CPU core counts, system memory, and local storage to the exact demands of large-scale open-source inference.

"AI infrastructure can no longer be built using one-size-fits-all designs," said Mike Maniscalco, CEO of QumulusAI. "Inference workloads have very different performance and economic requirements than model training environments. By tuning infrastructure to the workload itself, we can improve utilization, reduce costs, and accelerate deployment timelines for customers operating at production scale."

"As AI adoption expands, organizations need access to infrastructure that can be deployed quickly, scaled efficiently, and aligned to the economics of production AI," added Maniscalco. "These agreements demonstrate the value of a more flexible approach to AI infrastructure."

One of the agreements is with Hyperbolic, an AI cloud platform focused on providing scalable GPU compute infrastructure for AI startups, research teams, and enterprises. Hyperbolic gives AI builders flexible access to reliable, cost-efficient compute for training, fine-tuning, and inference workloads, helping teams move faster from experimentation to production.

"AI teams need infrastructure that supports every stage of the AI lifecycle, from training and fine-tuning to production inference," said Jasper Zhang, CEO of Hyperbolic. "QumulusAI's workload-optimized infrastructure gives us the performance, efficiency, and scalability we need as we continue expanding reliable GPU compute for customers building AI at scale."