Accelerating AI-Powered Chemistry and Materials Science Simulations with NVIDIA ALCHEMI Toolkit-Ops

Machine learning interatomic potentials (MLIPs) are transforming the landscape of computational chemistry and materials science. MLIPs enable atomistic simulations that mix the fidelity of computationally expensive quantum chemistry with the scaling power of AI.

Yet, developers working at this intersection face a persistent challenge: an absence of strong, Pythonic toolbox for GPU-accelerated atomistic simulation. To be used cases similar to running numerous simultaneous, GPU-accelerated simulations, robust and well-supported tools are either missing in the present software ecosystem or are fragmented across several open source software tools.

Over the past few years, available software for running atomistic simulations with MLIPs has been CPU-centric. Core operations similar to neighbor identification, dispersion corrections, long-range interactions, and their associated gradient calculation have traditionally supported only CPU computation, which regularly struggles to deliver the speed that contemporary research demands. High-throughput simulations of small- to medium-sized atomic systems quickly turn into bottlenecked by inefficient GPU usage in hybrid workflows where the model is GPU-accelerated in PyTorch however the simulation tooling is serial and CPU-based.

While developers have attempted to implement these operations directly in PyTorch through the years, the general-purpose design of PyTorch leaves performance on the table for the specialized spatial and force calculation operations required in atomistic simulation. This fundamental mismatch between PyTorch capabilities and the demands of atomistic modeling raises a crucial query: What’s needed to bridge this gap?

NVIDIA ALCHEMI (AI Lab for Chemistry and Materials Innovation), announced at Supercomputing 2024, provides chemistry and materials science developers and researchers with domain-specialized toolkits and NVIDIA NIM microservices optimized on NVIDIA accelerated computing platforms. It’s a group of high-performance, batched and GPU-accelerated tools specifically for enabling atomistic simulations in chemistry and materials science research on the machine learning framework level.

NVIDIA ALCHEMI delivers capabilities across three integrated layers:

ALCHEMI Toolkit-Ops: A repository of GPU-accelerated, batched common operations for AI-enabled atomistic simulation tasks, similar to neighbor list construction, DFT-D3 dispersion corrections, and long-range electrostatics.
ALCHEMI Toolkit: A group of GPU-accelerated simulation constructing blocks, including geometry optimizers, integrators, and data structures to enable large-scale, batched simulations leveraging AI.
ALCHEMI NIM microservices: A scalable layer of cloud‑ready, domain‑specific microservices for chemistry and materials science, enabling deployment and orchestration on NVIDIA‑accelerated platforms.

This post introduces NVIDIA ALCHEMI Toolkit-Ops, the accelerated batched common operations layer of ALCHEMI. ALCHEMI Toolkit-Ops uses NVIDIA Warp to speed up and batch common operations in AI-driven atomistic modeling. These operations are exposed through a modular PyTorch accessible API (with a JAX API targeted for a future release) that permits rapid iteration and integration with existing and future atomistic simulation packages.

Figure 1 shows the accelerated batched common operations for atomistic simulations included on this initial release of ALCHEMI Toolkit-Ops. This beta release includes two versions of neighbor lists (naive and cell), DFT-D3 dispersion correction, and long-range coulombic (Ewald and Particle Mesh Ewald) functions.

Graphic illustrates ALCHEMI Toolkit-Ops as a key set of features for atomistic simulation made available through a modular plug-and-play API–including GPU-accelerated batched kernels such as neighbor lists, DFT-D3 corrections, and long-range electrostatics—to empower developers, researchers, and ISVs working on AI-driven chemical and materials discovery. — *Figure 1. NVIDIA ALCHEMI Toolkit-Ops is a repository of modules developed specifically for GPU-accelerated batched operations (one GPU, many systems) support for MLIPs and molecular dynamics engines*

Figure 2 demonstrates the performance of accelerated kernels in ALCHEMI Toolkit-Ops versus popular kernel-accelerated models like MACE (cuEquivariance) and TensorNet (Warp) to attain fully parallelized performance and scalability. The blue MLIP baseline allows comparison with advanced features like neighbor lists and dispersion corrections (DFT-D3). Test systems consisted of ammonia clusters of accelerating size packed into various cells using Packmol. Timing results were averaged over 20 runs on an NVIDIA H100 80 GB GPU. The DFT-D3 calculation doesn’t include 6Å as a result of the long-range nature of D3.

Benchmark graphs of several ALCHEMI Toolkit features compared to MLIPs. Contains two logarithmic plots showing that cell-based algorithms for neighbor lists scale efficiently, with the time per atom decreasing significantly as the system size grows to 128K atoms, effectively outperforming the provided MLIP baseline and naive algorithmic approaches. The DFT-D3 panel shows scalability in the number of atoms also compared to an MLIP baseline. Batched DFT-D3 calculations achieve the same scaling efficiency as running a single, larger system with an equivalent total number of atoms. — Figure 2. Benchmarks showing the speed of ALCHEMI Toolkit neighbors list (each naive O(N²) and cell list O(N) implementations) and DFT-D3 in comparison with the computational cost of popular kernel-accelerated MLIPs

ALCHEMI Toolkit-Ops is designed to integrate seamlessly with the broader PyTorch-based atomistic simulation ecosystem. We’re excited to announce in-progress integrations with leading open source tools within the chemistry and materials science community: TorchSim, MatGL, and AIMNet Central.

Accelerating AI-Powered Chemistry and Materials Science Simulations with NVIDIA ALCHEMI Toolkit-Ops

TorchSim

MatGL

AIMNet Central

System and package requirements

Installation

Feature highlights

Neighbor lists

Capabilities

API example

DFT-D3 dispersion corrections

Capabilities

API example

Limitations

Long-range electrostatic interactions

Capabilities

API example

Acknowledgments

What are your thoughts on this topic?
Let us know in the comments below.

Share this article

Recent posts

Welcome Gemma – Google’s recent open LLM

Beyond the Flat Table: Constructing an Enterprise-Grade Financial Model in Power BI

Introducing the Red-Teaming Resistance Leaderboard

Federated Learning, Part 1: The Basics of Training Models Where the Data Lives

🪆 Introduction to Matryoshka Embedding Models

Accelerating AI-Powered Chemistry and Materials Science Simulations with NVIDIA ALCHEMI Toolkit-Ops

TorchSim

MatGL

AIMNet Central

System and package requirements

Installation

Feature highlights

Neighbor lists

Capabilities

API example

DFT-D3 dispersion corrections

Capabilities

API example

Limitations

Long-range electrostatic interactions

Capabilities

API example

Acknowledgments

What are your thoughts on this topic? Let us know in the comments below.

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.