NVIDIA HGX B200 Reduces Embodied Carbon Emissions Intensity

-


NVIDIA HGX B200 is revolutionizing accelerated computing by unlocking unprecedented performance and energy efficiency. This post shows how HGX B200 is outperforming its predecessor NVIDIA HGX H100 in environmental metrics, including reduced carbon intensity, through a comparison of their recently published product carbon footprint (PCF) summaries. 

Specifically, an evaluation of the PCF data reveals that HGX B200 shows a 24% reduction of embodied carbon emissions across large workloads reminiscent of AI training and inference.

HGX B200 and HGX H100 are accelerated computing platforms composed of eight GPUs each, with high-speed interconnects through NVIDIA NVLink and NVIDIA NVSwitch to speed up AI performance at scale. Each are designed for high-performance computing (HPC) and supporting data analytics workloads. 

How does NVIDIA HGX B200 reduce carbon emissions?

NVIDIA HGX B200 relies on upgraded NVIDIA Blackwell B200 GPUs, that are designed to supply dramatically improved AI performance. B200 GPUs incorporate 180 GB of HBM3E memory—greater than double the memory of the NVIDIA HGX H100 and other performance enhancing features, including a second-generation Transformer Engine that introduces FP4 alongside FP8, fifth‑generation NVLink/NVSwitch (as much as 1.8 TB/s per‑GPU and 14.4 TB/s aggregate bandwidth), to spice up higher throughput at lower precision.

Throughput of the HGX B200 is 2.3x faster (FP16) than the HGX H100. This upgraded compute performance can be more energy efficient. For AI inference, the HGX B200 could be as much as 15x more energy efficient. That’s a 93% reduction in energy for a similar inference workload.

HGX B200 also generates less manufacturing-related emissions per FLOPS of compute. Specifically, the embodied carbon intensity for computing is reduced from 0.66 gCO2e per exaflop with HGX H100 to 0.50 gCO2e per exaflop with HGX B200 (estimated based on FP16 precision). That is an overall 24% decrease between the 2 generations. That is estimated based on PCF data and the whole variety of FLOPS at FP16 precision executed over the lifetime of the product.

Bar graph showing Hardware Embodied Compute Carbon Intensity in gCO2e/exaflop. HGX H100 produces 0.66 gC02e/exaflop and HGX B200 produces 0.50 gCO2e/exaflop of embodied carbon. A 24% decrease occurred between the generations. This is estimated based on PCF data and the total number of FLOPS at FP16 precision executed over the lifetime of the product.
Bar graph showing Hardware Embodied Compute Carbon Intensity in gCO2e/exaflop. HGX H100 produces 0.66 gC02e/exaflop and HGX B200 produces 0.50 gCO2e/exaflop of embodied carbon. A 24% decrease occurred between the generations. This is estimated based on PCF data and the total number of FLOPS at FP16 precision executed over the lifetime of the product.
Figure 1. HGX B200 delivers a 24% reduction in hardware-embodied compute carbon intensity in comparison with HGX H100

Moreover, there’s a decrease within the materials and components—the very best emission producing category for every baseboard—between HGX B200 and HGX H100. Essentially the most notable reductions are for thermal components, ICs, and memory. 

Bar graph showing comparison of material breakdown measured in mgCO2e/exaflop (FP-16 precision). For HGX H100 the numbers are memory 276, ICs 164, thermal components 118, electromechanical components 26, PCBs 4.6, common components 5.3, mechanical components 3.9, and interconnects 2.6. For HGX B200 the numbers are memory 245, ICs 140 , thermal components 60, electromechanical components 10, PCBs 5.5, common components 5.0, mechanical components 1.5, interconnects 1.5. This is estimated based on PCF data and the total number of FLOPS executed over the lifetime of the product.
Bar graph showing comparison of material breakdown measured in mgCO2e/exaflop (FP-16 precision). For HGX H100 the numbers are memory 276, ICs 164, thermal components 118, electromechanical components 26, PCBs 4.6, common components 5.3, mechanical components 3.9, and interconnects 2.6. For HGX B200 the numbers are memory 245, ICs 140 , thermal components 60, electromechanical components 10, PCBs 5.5, common components 5.0, mechanical components 1.5, interconnects 1.5. This is estimated based on PCF data and the total number of FLOPS executed over the lifetime of the product.
Figure 2. HGX B200 delivers a big decrease in emissions for materials and components in quite a few categories in comparison with HGX H100

The mixture of the HGX B200 energy-saving computational power and reduced embodied emissions intensity can result in a considerable lowering of emissions, especially during its use phase for giant workloads reminiscent of AI training and inference. 

Operational carbon impacts 

While this reduced embodied carbon intensity reflects a lower upstream carbon intensity for the HGX B200, the downstream carbon intensity improvement is much more pronounced. 

For instance, the HGX B200 is projected to deliver a 10x improvement in inference efficiency for the DeepSeek-R1 model, which translates to a 90% reduction in operational carbon emissions in comparison with the HGX H100 for operational carbon emissions gCO2e for processing 1 million inference tokens (100 TPS/user). 

Note that this data was calculated based on 2023 IEA emission aspects weighted by regional data center energy consumption. Emission aspects include upstream emissions and T&D losses related to electricity production. 

Bar graph showing operational carbon emissions in kgCO2e/million tokens for DeepSeek-R1 inference. HGX H100 produces 16 kgCO2e/million tokens and HGX B200 produces 1.6 kgCO2e.Bar graph showing operational carbon emissions in kgCO2e/million tokens for DeepSeek-R1 inference. HGX H100 produces 16 kgCO2e/million tokens and HGX B200 produces 1.6 kgCO2e.
Figure 3. A 90% reduction in operational carbon emissions was observed between generations

Methodology and data collection process 

Each of the externally published PCF summaries rely heavily on primary data from suppliers collected for over 90% of each products by weight, including material composition and production energy consumption. 

Secondary sources were integrated with this data, reminiscent of the imec.netzero tool for fabrication-related emissions, in addition to the ecoinvent 3.10 and Sphera LCA databases (Skilled Database 2024 and Extension Database XI: Electronics 2024) for modeling materials, transportation, and energy.

A visual diagram of the process conducted for both product carbon footprint reports. The summaries were both based on a cradle-to-gate analysis comprising an inventory of the emissions generated for raw material extraction and refinement, component manufacturing, and assembly. 
A visual diagram of the process conducted for both product carbon footprint reports. The summaries were both based on a cradle-to-gate analysis comprising an inventory of the emissions generated for raw material extraction and refinement, component manufacturing, and assembly.
Figure 4. The HGX B200 cradle-to-gate product carbon footprint scope

These PCF summaries are aligned with ISO Standard14040 and 14044 on life cycle assessments and were critically reviewed in conformance with ISO Standard 14067 on carbon footprints.

The longer term of sustainable computing

NVIDIA goals to diminish its product carbon footprint with each latest product it produces, while providing groundbreaking advancements in performance. This practice in transparency through detailed PCF summaries will improve understanding of the impacts of accelerated computing.

NVIDIA is committed to working toward publishing additional reliable data on the environmental impacts of NVIDIA products. NVIDIA will proceed to innovate towards a way forward for sustainable computing and AI development, without compromising on performance and scale. 

To learn more, read the Product Carbon Footprint Summary for NVIDIA HGX B200. 



Source link

ASK ANA

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x