I am wondering why we are not doing “layer_idx = idx + 1” in Snippet 9.

Original article can be found here (source): Deep Learning on Medium

gorithm, but …d grads_values, which stores cost function derivatives calculated with respect to these parameters. Now you only need to apply the following equations for each layer. This is a very simple optimization algorithm, but I decided to use it because it is a great startin…