Stochastic Gradient Descent (SGD’s) Frequency Bias and How Adam Fixes It - TrendCloud