Full-stack AI
One platform for the full AI lifecycle — from rapid prototyping to foundation training and scalable inference.
AI Cloud
From rapid prototyping to foundation training to scalable inference
Compute
GB300 NVL72
New1x tray to 2+ racks · NVLink v5
GPU instances & clusters and serverless services
| Service | Configurations | On-demand price (1x GPU) | Spot price (1x GPU) |
|---|---|---|---|
| GPU Instances | 1x, 2x, 4x, 8x with NVLink | $6.11/h | $2.14/h |
| Instant clusters | 16x–128x with InfiniBand | $6.11/h | — |
| Serverless, Continuous | 1x, 2x, 4x, 8x with auto-scaling | $6.72/h | $2.35/h |
| Serverless, Jobs | 1x, 2x, 4x, 8x with auto-scaling | $6.72/h | $2.35/h |
Each 1x instance contains:
The NVIDIA HGX™ B200 propels the data center into a new era of accelerating computing and generative AI, integrating NVIDIA Blackwell Tensor Core GPUs with a high-speed interconnect to accelerate AI performance at scale. Configurations of eight GPUs deliver unparalleled generative AI acceleration alongside a remarkable 1.4 terabytes (TB) of GPU memory and 64 terabytes per second (TB/s) of memory bandwidth for 15X faster real-time trillion-parameter-model inference, 12X lower cost, and 12X less energy. This extraordinary combination positions HGX B200 as a premier accelerated x86 scale-up platform designed for the most demanding generative AI, data analytics, and high-performance computing (HPC) workloads. HGX B200 supports advanced networking options—at speeds up to 400 gigabits per second (Gb/s)—delivering the highest AI performance with NVIDIA Quantum-2 InfiniBand and the Spectrum™-X Ethernet platform. HGX B200 with NVIDIA® BlueField®-3 data processing units (DPUs) enable cloud networking, composable storage, zero-trust security, and GPU compute elasticity in hyperscale AI clouds.
Full-stack AI cloud with high-performance hardware, transparent pricing, and developer autonomy
One platform for the full AI lifecycle — from rapid prototyping to foundation training and scalable inference.
Predictable cost, performance, and reliability — from end-to-end ownership of the stack, data centers to managed services.
Instant, self-service access to the latest compute on the market at any scale — no negotiations, no contracts.
Historical uptime of over 99.9% with sensible SLAs and fair compensation for service disruptions.
Proactive support from our team of ML craftsmen and infrastructure engineers.
SOC 2 Type II and ISO 27001/27017/27018/27701 certified. GDPR compliant. Powered by 100% renewable energy.
“Verda is the perfect mix of being nimble and having production-grade reliability for low-latency service like ours. Our startup times and compute costs both dropped significantly.”
“Having direct contact between our engineering teams enables us to move incredibly fast. Being able to deploy any model at scale is exactly what we need in this fast moving industry.”
“Our entire language model journey is powered by Verda's clusters, from deployment to training. We can focus on achieving exceptional results without worrying about hardware issues.”
Built in Europe, trusted globally
From rapid prototyping to foundation training and scalable inference — on a single full-stack AI cloud