Basil Hosmer YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

Characterizing and Efficiently Accelerating Multimodal Generation Model Inference Open

Yejin Lee, Anna Sun, Basil Hosmer, Bilge Acun, Can Balioglu , et al. · 2024

Computer science

Generative artificial intelligence (AI) technology is revolutionizing the computing industry. Not only its applications have broadened to various sectors but also poses new system design and optimization opportunities. The technology is ca…

Is Flash Attention Stable? Open

Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer , et al. · 2024

Computer science Psychology Physics

Training large-scale machine learning models poses distinct system challenges, given both the size and complexity of today's workloads. Recently, many organizations training state-of-the-art Generative AI models have reported cases of inst…

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Open

Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, Basil Hosmer, Bram Wasti , et al. · 2024

Computer science Chemistry

We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for earlier layers and higher dropout rates for later layers, and an …

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation Open

Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer , et al. · 2023

Computer science Engineering History

As the development of large-scale Generative AI models evolve beyond text (1D) generation to include image (2D) and video (3D) generation, processing spatial and temporal information presents unique challenges to quality, performance, and …

Gradient Descent: The Ultimate Optimizer Open

Kartik Chandra, Erik Meijer, Samantha Andow, Emilio Arroyo-Fang, Irene Dea , et al. · 2019

Computer science Philosophy Economics

Working with any gradient-based machine learning algorithm involves the tedious task of tuning the optimizer's hyperparameters, such as its step size. Recent work has shown how the step size can itself be optimized alongside the model para…

Creating related items for first view…