Go Autograd | Oli's site

For this project, I set myself the challenge of writing a complete autograd and machine learning library from scratch in Go.

Design

To simplify the project, I broke it up into a few submodules: autograd/tensor for ndarray handling, and autograd/module for machine learning primitives. The core autograd library then provides a collection of autograd operations, as well as functions for backprop and SGD.

The core autograd design is strongly inspired by Andrej Karpathy’s micrograd. It is built around a simple Node struct that represents a computation node in the graph:

type Node struct {
	Val       *tensor.Tensor[float32]
	Grad      *tensor.Tensor[float32]
	WantsGrad bool
	Parents   []*Edge
}

type Edge struct {
	Parent   *Node
	Backward func(dy *tensor.Tensor[float32])
}

Backpropagation is then handled by working backwards through the graph, accumulating gradients along the way.

Testing

I figured MNIST is the obvious MVP machine learning model to implement, so the library includes 2D convolutions and pooling as default primitives. Here is an example of a LeNet-style model built in this library:

var eval bool
model := module.Sequential{
    module.Conv2D(1, 32, 3, 3, 1, 1, 1, 1).InitHe(),
    module.ReLU,
    module.Conv2D(32, 64, 3, 3, 1, 1, 1, 1).InitHe(),
    module.ReLU,
    module.MaxPool2D(2, 2, 2, 2),
    module.Conv2D(64, 128, 3, 3, 1, 1, 1, 1).InitHe(),
    module.ReLU,
    module.MaxPool2D(2, 2, 2, 2),
    module.Flatten,
    module.Affine(128*7*7, 256).InitHe(),
    module.ReLU,
    module.Affine(256, 10).InitXavier(),
    module.DoWhen(&eval, module.Softmax),
}

(This model got up to 96% in testing on MNIST, after 10,000 iterations of training.) In practice it doesn’t work particularly well, as we are just using the training data as-is.

Demo

Since the project is pure Go, it can be compiled into WebAssembly, so you can run models in the browser. You can see a demo here. (It probably won’t work on mobile browsers.)