data-science – Robert Vagene's Blog

Introduction In the previous blog post we took a look at the backpropagation through time algorithm. We saw how, looking at the unfolded computation graph, backpropagation through time is essentially the same as backpropagating through one long connected feedforward neural net. Now that we have the gradient of the loss with respect to the parameters…

Recurrent Neural Networks From Scratch Part 4 – Backpropagation Through Time

Aug 10, 2024

ai, machine-learning

ai, artificial-intelligence, data-science, deep-learning, linear-algebra, machine-learning, math, mathematics, physics, programming, python

Introduction In the previous blog post, we began the implementation of our feedforward neural net, and went over the equations and code for the forward pass. With this, we can now feed our data through the network, but its not of much use without being able to train it. In order to train our recurrent…

Recurrent Neural Networks From Scratch Part 3 – The Forward Pass

Aug 10, 2024

machine-learning, Uncategorized

ai, artificial-intelligence, data-science, deep-learning, linear-algebra, machine-learning, math, mathematics, programming, python

Introduction In the previous blog post we talked about what it means exactly for data to have a “sequential topology”, and prepared our training data to be fed into the recurrent neural net. In this post, we’ll be looking at the design of our simple recurrent neural net, the equations for the forward pass, and…

Recurrent Neural Networks From Scratch Part 2 – Preparing the Data

Aug 10, 2024

ai, machine-learning

ai, artificial-intelligence, data-science, deep-learning, linear-algebra, machine-learning, math, mathematics, programming, python

Introduction In the previous blog post we looked at how a recurrent neural network differs from a feedforward neural network, and why they are better at sequence processing tasks. In this blog post, we will be preparing our training data, and start writing some code. Our training data is just a list of lowercase names.…

Recurrent Neural Networks From Scratch Part 1 – Recurrent Connections

Aug 10, 2024

ai, machine-learning

ai, artificial-intelligence, data-science, deep-learning, linear-algebra, machine-learning, math, mathematics, programming, python

Introduction In a previous series of blog posts I covered feedforward neural networks. Feedforward neural networks are very powerful, but they are not the only neural network architecture available and they may be ill-suited to certain tasks. Recurrent neural nets are a class of neural nets that are particularly effective for modeling data with a…

Support Vector Machines From Scratch Part 5 – Kernels

May 6, 2024

ai, machine-learning

ai, data-science, machine-learning, mathematics, python

Introduction This is the fifth installment in a series of blog posts about Support Vector Machines. If you have not read the four other blog posts, I highly recommend you go back and read those before continuing to this blog post. Last time we introduced soft-margin SVM, which was a new formulation that could accommodate…

Support Vector Machines From Scratch Part 2 – Optimizing the Margin

May 5, 2024

ai, machine-learning

ai, calculus, data-science, machine-learning, mathematics, physics, python

Introduction This is the second installment in a series of blog posts about Support Vector Machines. If you have not read the first blog post, I highly recommend you go back and read it before continuing to this blog post. Last time we looked at how to define the ideal hyperplane to linearly separate two…

Support Vector Machines From Scratch Part 1 – Overview

May 5, 2024

ai, machine-learning

ai, data-science, machine-learning, mathematics, python, svm

Introduction This is the first installment in a series of blog posts about Support Vector Machines. In these blog posts, I will give an overview of the mathematics behind, and implementation of Support Vector Machines which assumes only some mathematical background and almost no familiarity with machine learning libraries. To best understand these blog posts…

The Cross-Entropy Loss Function

Feb 25, 2024

ai, machine-learning

ai, artificial-intelligence, data-science, deep-learning, machine-learning

The loss function is a crucial component to training neural nets. It allows us to get a measure of how well our neural net is doing. Let’s take a look at the mean squared error loss: Even if you are unfamiliar with the mean squared error loss, it should hopefully be plausible to you that…

The Derivative of the Sigmoid Function

Feb 25, 2024

ai, machine-learning

ai, artificial-intelligence, data-science, deep-learning, machine-learning

The sigmoid function, is a popular activation function for neurons in a neural net. It is necessary to use the derivative of the activation function when using backpropagation to compute the gradients of the weights and biases of the network. Here is how to find the derivative of the sigmoid function: Let’s take a look…