NVIDIA CUDA-X Powers the Recent Sirius GPU Engine for DuckDB, Setting ClickBench Records

NVIDIA is partnering with the University of Wisconsin-Madison to bring GPU-accelerated analytics to DuckDB through the open-source Sirius engine.

DuckDB has seen rapid adoption amongst organizations similar to DeepSeek, Microsoft, and Databricks attributable to its simplicity, speed, and flexibility. As analytics workloads are highly amenable to massive parallelism, GPUs have emerged because the natural next step with higher performance, throughput, and higher total cost of ownership (TCO) in comparison with CPU-based databases. Nevertheless, this growing demand for GPU acceleration is hindered by the challenge of constructing a database system from the bottom up.

That is solved with the jointly developed Sirius, a composable GPU-native execution backend for DuckDB that reuses its advanced subsystems while accelerating query execution with GPUs. Using NVIDIA CUDA-X libraries, Sirius delivers GPU acceleration.

This blog post outlines the Sirius architecture and demonstrates the way it achieved record-breaking performance on ClickBench, a widely used analytics benchmark.