Establishing a Scalable Sparse Ecosystem with the Universal Sparse Tensor

Sparse tensors are vectors, matrices, and higher-dimensional generalizations with many zeros. They’re crucial in various fields comparable to scientific computing, signal processing, and deep learning as a result of their efficiency in storage, computation, and power. Despite their advantages, handling sparse tensors manually or through existing libraries is commonly cumbersome, error-prone, nonportable, and doesn’t scale with the combinatorial explosion of sparsity patterns, data types, operations, and targets.

Research largely focuses on sparse storage formats—data structures that compactly store nonzeros and permit efficient operations that avoid redundancies comparable to x+0=x and x*0=0. This allows scaling to larger sizes or solving same sizes with fewer resources. No single sparse format is perfect; the perfect selection is determined by the nonzero distribution, operations, and goal architecture.

The Universal Sparse Tensor (UST) decouples a tensor’s sparsity from its memory storage representation. The UST uses a domain-specific language (DSL) to explain how a tensor needs to be represented in memory. Developers concentrate on the sparsity of a tensor only. Compile-time or runtime inspection of the chosen format for the operands in sparsity-agnostic polymorphic operations eventually decides between dispatching to an optimized handwritten library or otherwise with automatic sparse code generation when no such solution exists. The UST has its roots in sparse compiler technology; see, for instance, Compiler Support for Sparse Matrix Computations, Sparse Tensor Algebra Compilation and Compiler Support for Sparse Tensor Computations in MLIR.

This post focuses on how developers can use the UST to define common but additionally less common sparse storage formats, each tailored to specific properties of the sparse tensors that occur of their application. Operability with, for instance, SciPy, CuPy, and PyTorch sparse tensors maps common formats like COO, CSR, and DIA to the corresponding DSL of the UST. Nevertheless, developers can even define their very own novel sparsity format via the DSL. Combined with dispatch or code generation, this permits the design of novel sparse formats without explicit coding, as shall be shown in future posts.