is now Verda Learn more

Verda Blog

Multi-Head Latent Attention: Benefits in Memory and Computation
NEW AI research

Multi-Head Latent Attention: Benefits in Memory and Computation

FLUX on B200 vs H100: Real-Time Image Inference with WaveSpeedAI
NEW AI research

FLUX on B200 vs H100: Real-Time Image Inference with WaveSpeedAI

DeepSeek-V3 + SGLang: Inference Optimization

DeepSeek-V3 + SGLang: Inference Optimization

AI research
DeepSeek + SGLang: Multi-Head Latent Attention

DeepSeek + SGLang: Multi-Head Latent Attention

AI research
Multi Data Center Training: Prime Intellect

Multi Data Center Training: Prime Intellect

AI research