In my latest post, I how hybrid search will be utilised to significantly improve the effectiveness of a RAG pipeline. RAG, in its basic version, using just semantic search on embeddings, will be...
who has written a children’s book and released it in two versions at the identical time into the market at the identical price. One version has a basic cover design, while the opposite...
is an element of a series about distributed AI across multiple GPUs:
Part 1: Understanding the Host and Device Paradigm (this text)
Part 2: Point-to-Point and Collective Operations
Part 3: How GPUs Communicate
Part 4: Gradient...
“What I cannot create, I don't understand” — attributed to R. Feynman
After Vibe Coding, we appear to have entered the (very area of interest, but much cooler) era of Vibe Proving: DeepMind wins gold...
in some interesting conversations recently about designing LLM-based tools for end users, and one in every of the vital product design questions that this brings up is “what do people find out about...
as a black box. We all know that it learns from data, however the query is it truly learns.
In this text, we are going to construct a tiny Convolutional Neural Network (CNN)...