Making Robot Perception More Efficient on NVIDIA Jetson Thor

Constructing autonomous robots requires robust, low-latency visual perception for depth, obstacle recognition, localization, and navigation in dynamic environments. These capabilities demand heavy compute. NVIDIA Jetson platforms offer powerful GPUs for deep learning, but increasing AI complexity and the necessity for real-time performance can result in GPU oversubscription. Relying solely on the GPU for all perception tasks may end up in bottlenecks, increased power consumption, and thermal challenges, especially in power-sensitive and thermally constrained environments common in mobile robotics.

The NVIDIA Jetson platform addresses these challenges by combining powerful GPUs with dedicated hardware accelerators. Jetson devices like NVIDIA Jetson AGX Orin and NVIDIA Jetson Thor house specialty hardware accelerators designed to execute image processing and computer-vision tasks with high efficiency. That frees up the GPU for more demanding deep-learning workloads. The NVIDIA Vision Programming Interface (VPI) unlocks the total potential of those diverse hardware accelerators.

On this blog, we explore the advantages of using these accelerators and explain how developers can use VPI to unlock the total potential of the Jetson platform. For instance, we are going to walk you thru the event of a low-latency, low-powered perception application for stereo disparity using these accelerators. To begin, we are going to develop a single stereo camera pipeline, after which move onto developing a multi-stream pipeline with eight stereo cameras acting at 30FPS on Thor T5000—about 10x faster than Orin AGX 64 GB.

Before we jump into development, let’s quickly look over what accelerators can be found on the Jetson platform, their advantages, what applications they will unlock, and the way VPI will help.

Stereo disparity full pipeline (RELATIVE mode, res: 960×600, max disparity: 128)
	Frame Rate (FPS)		Speed-up ratio
Variety of streams	Orin AGX (64 GB)	Jetson Thor T5000
1	22	122	5.5
2	12	111	9.5
4	6	58	9.7
8	3	29	9.7

Making Robot Perception More Efficient on NVIDIA Jetson Thor

What accelerators does Jetson offer beyond the GPU?

What use cases profit from these accelerators?

How one can use VPI to unlock all of the accelerators

Constructing a high-performance stereo disparity pipeline using VPI

Getting began with Python APIs

Multi-Streaming disparity pipeline using C++ APIs

How Boston Dynamics uses the VPI

Takeaways

What are your thoughts on this topic?
Let us know in the comments below.

Share this article

Recent posts

Why CrewAI’s Manager-Employee Architecture Fails — and The best way to Fix It

Diffusers welcomes FLUX-2

What enterprises should find out about The White House's latest AI 'Manhattan Project' the Genesis Mission

Ten Lessons of Constructing LLM Applications for Engineers

How Code Execution Drives Key Risks in Agentic AI Systems

Making Robot Perception More Efficient on NVIDIA Jetson Thor

What accelerators does Jetson offer beyond the GPU?

What use cases profit from these accelerators?

How one can use VPI to unlock all of the accelerators

Constructing a high-performance stereo disparity pipeline using VPI

Getting began with Python APIs

Multi-Streaming disparity pipeline using C++ APIs

How Boston Dynamics uses the VPI

Takeaways

What are your thoughts on this topic? Let us know in the comments below.

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.