Media Summary: We dive into some of the internals of MLPs with multiple layers and scrutinize the statistics of the forward pass Building makemore Part 3 Activations & Gradients, BatchNorm We implement a multilayer perceptron (MLP) character-level language model. In this video we also introduce many basics of ...
Building Makemore Part 3 Activations - Detailed Analysis & Overview
We dive into some of the internals of MLPs with multiple layers and scrutinize the statistics of the forward pass Building makemore Part 3 Activations & Gradients, BatchNorm We implement a multilayer perceptron (MLP) character-level language model. In this video we also introduce many basics of ... Building makemore Part 3: Activations & Gradients, BatchNorm We take the 2-layer MLP (with BatchNorm) from the previous video and backpropagate through it manually without using PyTorch ... We implement a bigram character-level language model, which we will further complexify in followup videos into a modern ...
We take the 2-layer MLP from previous video and make it deeper with a tree-like structure, arriving at a convolutional neural ...