Service	Configurations	On-demand price (1x GPU)	Spot price (1x GPU)
GPU Instances	1x, 2x, 4x, 8x with NVLink	$6.11/h	$2.14/h
Instant clusters	16x–128x with InfiniBand	$6.11/h	—
Serverless, Continuous	1x, 2x, 4x, 8x with auto-scaling	$6.72/h	$2.35/h
Serverless, Jobs	1x, 2x, 4x, 8x with auto-scaling	$6.72/h	$2.35/h

Specifications

NVIDIA® HGX™ B200

Each 1x instance contains:

180 GB: GPU VRAM

170 GB: GPU RAM

30: CPU threads

1.8 TB/s: NVLink bandwidth

The NVIDIA HGX™ B200 propels the data center into a new era of accelerating computing and generative AI, integrating NVIDIA Blackwell Tensor Core GPUs with a high-speed interconnect to accelerate AI performance at scale. Configurations of eight GPUs deliver unparalleled generative AI acceleration alongside a remarkable 1.4 terabytes (TB) of GPU memory and 64 terabytes per second (TB/s) of memory bandwidth for 15X faster real-time trillion-parameter-model inference, 12X lower cost, and 12X less energy. This extraordinary combination positions HGX B200 as a premier accelerated x86 scale-up platform designed for the most demanding generative AI, data analytics, and high-performance computing (HPC) workloads. HGX B200 supports advanced networking options—at speeds up to 400 gigabits per second (Gb/s)—delivering the highest AI performance with NVIDIA Quantum-2 InfiniBand and the Spectrum™-X Ethernet platform. HGX B200 with NVIDIA® BlueField®-3 data processing units (DPUs) enable cloud networking, composable storage, zero-trust security, and GPU compute elasticity in hyperscale AI clouds.

Verda at a glance

Full-stack AI cloud with high-performance hardware, transparent pricing, and developer autonomy

Full-stack AI

One platform for the full AI lifecycle — from rapid prototyping to foundation training and scalable inference.

Vertically integrated

Predictable cost, performance, and reliability — from end-to-end ownership of the stack, data centers to managed services.

On-demand access

Instant, self-service access to the latest compute on the market at any scale — no negotiations, no contracts.

Reliable

Historical uptime of over 99.9% with sensible SLAs and fair compensation for service disruptions.

World-class support

Proactive support from our team of ML craftsmen and infrastructure engineers.

Secure & compliant

SOC 2 Type II and ISO 27001/27017/27018/27701 certified. GDPR compliant. Powered by 100% renewable energy.

“Verda is the perfect mix of being nimble and having production-grade reliability for low-latency service like ours. Our startup times and compute costs both dropped significantly.”

Lars Vågnes

Founder & CEO

“Having direct contact between our engineering teams enables us to move incredibly fast. Being able to deploy any model at scale is exactly what we need in this fast moving industry.”

Iván de Prado

Head of AI

“Our entire language model journey is powered by Verda's clusters, from deployment to training. We can focus on achieving exceptional results without worrying about hardware issues.”

José Pombal

AI Research Scientist

Built in Europe, trusted globally

B200 across the full AI lifecycle

From rapid prototyping to foundation training and scalable inference — on a single full-stack AI cloud

Start building Talk to an expert