fc1 has (3136,512) params and it seems like current implementation always resets and sets 0 to the dead-neuron for out-going layer-512. The implementation is supposed to reset the dead-neurons of the incoming-layer and set 0 the dead-neuron of the outgoing-layers. https://github.com/timoklein/redo/blob/main/src/redo.py#L120