GPUs

NVIDIA® HGX™ B200

GPU instances & clusters and serverless services

  • On-demand, self-service access starting at $2.14/h

Pricing

Service Configurations On-demand price (1x GPU) Spot price (1x GPU)
GPU Instances 1x, 2x, 4x, 8x with NVLink $6.11/h $2.14/h
Instant clusters 16x–128x with InfiniBand $6.11/h
Serverless, Continuous 1x, 2x, 4x, 8x with auto-scaling $6.72/h $2.35/h
Serverless, Jobs 1x, 2x, 4x, 8x with auto-scaling $6.72/h $2.35/h
NVIDIA HGX™ B200
Specifications

NVIDIA® HGX™ B200

Each 1x instance contains:

180 GB
GPU VRAM
170 GB
GPU RAM
30
CPU threads
1.8 TB/s
NVLink bandwidth

The NVIDIA HGX™ B200 propels the data center into a new era of accelerating computing and generative AI, integrating NVIDIA Blackwell Tensor Core GPUs with a high-speed interconnect to accelerate AI performance at scale. Configurations of eight GPUs deliver unparalleled generative AI acceleration alongside a remarkable 1.4 terabytes (TB) of GPU memory and 64 terabytes per second (TB/s) of memory bandwidth for 15X faster real-time trillion-parameter-model inference, 12X lower cost, and 12X less energy. This extraordinary combination positions HGX B200 as a premier accelerated x86 scale-up platform designed for the most demanding generative AI, data analytics, and high-performance computing (HPC) workloads. HGX B200 supports advanced networking options—at speeds up to 400 gigabits per second (Gb/s)—delivering the highest AI performance with NVIDIA Quantum-2 InfiniBand and the Spectrum™-X Ethernet platform. HGX B200 with NVIDIA® BlueField®-3 data processing units (DPUs) enable cloud networking, composable storage, zero-trust security, and GPU compute elasticity in hyperscale AI clouds.

Verda at a glance

Full-stack AI cloud with high-performance hardware, transparent pricing, and developer autonomy

Full-stack AI

One platform for the full AI lifecycle — from rapid prototyping to foundation training and scalable inference.

Vertically integrated

Predictable cost, performance, and reliability — from end-to-end ownership of the stack, data centers to managed services.

On-demand access

Instant, self-service access to the latest compute on the market at any scale — no negotiations, no contracts.

Reliable

Historical uptime of over 99.9% with sensible SLAs and fair compensation for service disruptions.

World-class support

Proactive support from our team of ML craftsmen and infrastructure engineers.

Secure & compliant

SOC 2 Type II and ISO 27001/27017/27018/27701 certified. GDPR compliant. Powered by 100% renewable energy.

Built in Europe, trusted globally

B200 across the full AI lifecycle

From rapid prototyping to foundation training and scalable inference — on a single full-stack AI cloud