Exploring foci of:
arXiv (Cornell University)
SALM-Duplex: Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model
May 2025 • Ke Hu, Ehsan Hosseini-Asl, Chen Chen, Edresson Casanova, Subhankar Ghosh, Piotr Żelasko, Zhehuai Chen, Jason Li, Jagadeesh Balam, Boris Ginsburg
Spoken dialogue is an intuitive form of human-computer interaction, yet current speech language models often remain constrained to turn-based exchanges, lacking real-time adaptability such as user barge-in. We propose a novel duplex speech to speech (S2S) architecture featuring continuous user inputs and codec agent outputs with channel fusion that directly models simultaneous user and agent streams. Using a pretrained streaming encoder for user input enables the first duplex S2S model without requiring speech pre…
Main Page
Zohran Mamdani
Weapons (2025 Film)
Squid Game Season 3
Lauren Sánchez
Victoria Mboko
Jeff Bezos
Shefali Jariwala
F1 (Film)
Takahiro Shiraishi
The 1975
Matty Healy
Mira Nair
Kannappa (Film)
Squid Game
Alanis Morissette
Truth And Reconciliation Commission Of Canada
2025 Nba Draft
28 Years Later
Reich Ministry Of Public Enlightenment And Propaganda
Mahmood Mamdani
Rick Hurst
Mutual Aid
Degenerate Art Exhibition
Fuck
Kpop Demon Hunters
Anna Wintour