arxiv A Loss Curvature Perspective on Training Instability in Deep Learning