Mor Katz
YOU?
Author Swipe
View article: Scaling Laws for Autoregressive Generative Modeling
Scaling Laws for Autoregressive Generative Modeling Open
We identify empirical scaling laws for the cross-entropy loss in four domains: generative image modeling, video modeling, multimodal image$\leftrightarrow$text models, and mathematical problem solving. In all cases autoregressive Transform…