neural network

YOLOv3 Paper Walkthrough: Even Higher, But Not That Much

to be the state-of-the-art object detection algorithm, looked to turn into obsolete due to the looks of other methods like SSD (Single Shot Multibox Detector), DSSD (Deconvolutional Single Shot Detector), and RetinaNet. Finally,...

Mechanistic Interpretability: Peeking Inside an LLM

Intro tips on how to examine and manipulate an LLM’s neural network. That is the subject of mechanistic interpretability research, and it could answer many exciting questions. Remember: An LLM is a deep artificial neural...

Teaching a Neural Network the Mandelbrot Set

Introduction set is probably the most beautiful mathematical objects ever discovered, a fractal so intricate that regardless of how much you zoom in, you retain finding infinite detail. But what if we asked a...

YOLOv1 Loss Function Walkthrough: Regression for All

In my previous article I explained how YOLOv1 works and tips on how to construct the architecture from scratch with PyTorch. In today’s article, I'm going to deal with the loss function used to...

The Machine Learning “Advent Calendar” Day 18: Neural Network Classifier in Excel

Neural Network Regressor, we now move to the classifier version. From a mathematical viewpoint, the 2 models are very similar. In truth, they differ mainly by the interpretation of the output and the selection...

The Machine Learning “Advent Calendar” Day 17: Neural Network Regressor in Excel

are sometimes presented as black boxes. Layers, activations, gradients, backpropagation… it may feel overwhelming, especially when every thing is hidden behind model.fit(). We are going to construct a neural network regressor from scratch using Excel....

Neural Networks Are Blurry, Symbolic Systems Are Fragmented. Sparse Autoencoders Help Us Mix Them.

computers and Artificial Intelligence, we had established institutions designed to reason systematically about human behavior — the court. The legal system is one in all humanity’s oldest reasoning engines, where facts and evidence...

Learning Triton One Kernel at a Time: Softmax

Within the previous article of this series, operation in all fields of computer science: matrix multiplication. It's heavily utilized in neural networks to compute the activation of linear layers. Nevertheless, activations on their...

Recent posts

Popular categories

ASK ANA