Open-Source AI Tool Upgrades Speed Up LLM and Diffusion Models on NVIDIA RTX PCs

AI developer activity on PCs is exploding, driven by the rising quality of small language models (SLMs) and diffusion models, akin to FLUX.2, GPT-OSS-20B, and Nemotron 3 Nano. At the identical time, AI PC frameworks, including ComfyUI, llama.cpp, Ollama, and Unsloth are making functional advances, doubling in popularity over the past yr because the variety of developers using PC-class models has grown tenfold. Developers aren’t any longer experimenting with generative AI workflows—they’re constructing the next-generation software stack on NVIDIA GPUs, from the info center to NVIDIA RTX AI PCs.

At CES 2026, NVIDIA is announcing several latest updates for the AI PC developer ecosystem, including:

Acceleration for the highest open source tools on PC, llama.cpp, and Ollama for SLMs, together with ComfyUI for diffusion models.
Optimizations to the highest open source models for NVIDIA GPUs, including the brand new LTX-2 audio-video model.
A collection of tools to speed up agentic AI workflows on RTX PCs and NVIDIA DGX Spark.