GPU

AI in Multiple GPUs: How GPUs Communicate

is a component of a series about distributed AI across multiple GPUs: Introduction Before diving into advanced parallelism techniques, we want to know the important thing technologies that enable GPUs to speak with one another. But why...

AI in Multiple GPUs: Point-to-Point and Collective Operations

is an element of a series about distributed AI across multiple GPUs: Part 1: Understanding the Host and Device Paradigm Part 2: Point-to-Point and Collective Operations (this text) Part 3: How GPUs Communicate Part 4: Gradient Accumulation...

AI in Multiple GPUs: Understanding the Host and Device Paradigm

is an element of a series about distributed AI across multiple GPUs: Part 1: Understanding the Host and Device Paradigm (this text) Part 2: Point-to-Point and Collective Operations Part 3: How GPUs Communicate Part 4: Gradient...

Breaking the Hardware Barrier: Software FP8 for Older GPUs

As deep learning models grow larger and datasets expand, practitioners face an increasingly common bottleneck: GPU memory bandwidth. While cutting-edge hardware offers FP8 precision to speed up training and inference, most data scientists and...

Keep AI Costs Under Control

When my team first rolled out an internal assistant powered by GPT, adoption took off fast. Engineers used it for test cases, support agents for summaries, and product managers to draft specs. A number...

Learning Triton One Kernel at a Time: Matrix Multiplication

multiplication is undoubtedly probably the most common operation performed by GPUs. It's the elemental constructing block of linear algebra and shows up across a large spectrum of various fields equivalent to graphics, physics...

NumPy API on a GPU?

Is way forward for Python numerical computation? Late last yr, NVIDIA made a big announcement regarding the longer term of Python-based numerical computing. I wouldn’t be surprised for those who missed it. In spite...

“Chinese AI company, transported data overseas as a substitute of GPU smuggling, and learned model.”

To be able to avoid the US artificial intelligence (AI) chip export control, Chinese AI corporations are reported to coach the AI ​​model in Malaysia by smuggling hard disks with large data. One other...

Recent posts

Popular categories

ASK ANA