Inspired by the awesome "Instant Neural Graphics Primitives" paper, I had a go at optimising hash tables with gradient descent in my toy ML code. Crop of a fitted image for a 4-layer ReLU thing vs multires hash tables (roughly same parameter count for each):
2
15
139

