Colin Flaherty
YOU?
Author Swipe
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers Open
Autoregressive transformers are spectacular models for short sequences but scale poorly to long sequences such as high-resolution images, podcasts, code, or books. We proposed Megabyte, a multi-scale decoder architecture that enables end-t…
View article: Human-Level Play in the Game of Diplomacy by Combining Language Models with Strategic Reasoning
Human-Level Play in the Game of Diplomacy by Combining Language Models with Strategic Reasoning Open
Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge. We introduce CICERO, the first AI age…