01.
LED — how a neural network worked without backpropagation
· #research
In v35 I removed loss.backward() entirely — and unexpectedly the model outperformed BP. Here is how LED (Lateral Error Diffusion) came about, why it worked, and where its limit was.