Model Optimization

Artificial Intelligence

Optimizing Deep Learning Models with SAM

: Overparameterization, Generalizability, and SAM The dramatic success of recent deep learning — especially within the domains of Computer Vision and Natural Language Processing — is built on “overparameterized” models: models with good enough parameters to memorize the training data...

ASK ANA - February 24, 2026

Artificial Intelligence

Optimizing Data Transfer in Batched AI/ML Inference Workloads

is a to Optimizing Data Transfer in AI/ML Workloads where we demonstrated using NVIDIA Nsight™ Systems (nsys) in studying and solving the common data-loading bottleneck — occurrences where the GPU idles while it waits for input...

ASK ANA - January 13, 2026

Artificial Intelligence

Optimizing Data Transfer in AI/ML Workloads

a , a deep learning model is executed on a dedicated GPU accelerator using input data batches it receives from a CPU host. Ideally, the GPU — the dearer resource — needs to...

ASK ANA - January 3, 2026

Artificial Intelligence

Enhancing RAG: Beyond Vanilla Approaches

Retrieval-Augmented Generation (RAG) is a robust technique that enhances language models by incorporating external information retrieval mechanisms. While standard RAG implementations improve response relevance, they often struggle in complex retrieval scenarios. This text explores...

ASK ANA - February 25, 2025

Artificial Intelligence

The Only Guide You Must Superb-Tune Llama 3 or Any Other Open Source Model

Superb-tuning large language models (LLMs) like Llama 3 involves adapting a pre-trained model to specific tasks using a domain-specific dataset. This process leverages the model's pre-existing knowledge, making it efficient and cost-effective in comparison...

ASK ANA - August 1, 2024

Popular categories

Artificial Intelligence10878 New Post1 My Blog1

Model Optimization

Recent posts

The Current Status of The Quantum Software Stack

The Multi-Agent Trap

A Tale of Two Variances: Why NumPy and Pandas Give Different Answers

How Vision Language Models Are Trained from “Scratch”

Why Care About Prompt Caching in LLMs?

Popular categories