Tim Bula
YOU?
Author Swipe
View article: Scaling Granite Code Models to 128K Context
Scaling Granite Code Models to 128K Context Open
This paper introduces long-context Granite code models that support effective context windows of up to 128K tokens. Our solution for scaling context length of Granite 3B/8B code models from 2K/4K to 128K consists of a light-weight continua…