arXiv:1912.00018 [stat.ML]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords stochastic gradient descent, deep neural networks, heavy-tailed theory, sgd prefers wide minima, assumption Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset