Ruihua Song
YOU?
Author Swipe
View article: MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning
MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning Open
Large Reasoning Models (LRMs) often exhibit a tendency for overanalysis in simple tasks, where the models excessively utilize System 2-type, deliberate reasoning, leading to inefficient token generation. Furthermore, these models face chal…
View article: Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation
Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation Open
Automatic related work generation (RWG) can save people's time and effort when writing a draft of related work section (RWS) for further revision. However, existing methods for RWG always suffer from shallow comprehension due to taking the…
View article: ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering Open
Precisely evaluating semantic alignment between text prompts and generated videos remains a challenge in Text-to-Video (T2V) Generation. Existing text-to-video alignment metrics like CLIPScore only generate coarse-grained scores without fi…
View article: Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation
Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation Open
View article: Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer Open
Multi-person interactive motion generation, a critical yet under-explored domain in computer character animation, poses significant challenges such as intricate modeling of inter-human interactions beyond individual motions and generating …
View article: BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain
BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain Open
Retrieval Augmented Generation (RAG) system is important in domains such as e-commerce, which has many long-tail entities and frequently updated information. Most existing works adopt separate modules for retrieval and generation, which ma…
View article: LoVA: Long-form Video-to-Audio Generation
LoVA: Long-form Video-to-Audio Generation Open
Video-to-audio (V2A) generation is important for video editing and post-processing, enabling the creation of semantics-aligned audio for silent video. However, most existing methods focus on generating short-form audio for short video segm…
View article: Towards Effective and Efficient Continual Pre-training of Large Language Models
Towards Effective and Efficient Continual Pre-training of Large Language Models Open
Continual pre-training (CPT) has been an important approach for adapting language models to specific domains or tasks. To make the CPT approach more traceable, this paper presents a technical report for continually pre-training Llama-3 (8B…
View article: YuLan: An Open-source Large Language Model
YuLan: An Open-source Large Language Model Open
Large language models (LLMs) have become the foundation of many applications, leveraging their extensive capabilities in processing and understanding natural language. While many open-source LLMs have been released with technical reports, …
View article: RoLD: Robot Latent Diffusion for Multi-task Policy Modeling
RoLD: Robot Latent Diffusion for Multi-task Policy Modeling Open
Modeling generalized robot control policies poses ongoing challenges for language-guided robot manipulation tasks. Existing methods often struggle to efficiently utilize cross-dataset resources or rely on resource-intensive vision-language…
View article: Intelligent Virtual Assistants with LLM-based Process Automation
Intelligent Virtual Assistants with LLM-based Process Automation Open
While intelligent virtual assistants like Siri, Alexa, and Google Assistant have become ubiquitous in modern life, they still face limitations in their ability to follow multi-step instructions and accomplish complex goals articulated in n…
View article: Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models Open
Humans often interact with large language models (LLMs) in multi-turn interaction to obtain desired answers or more information. However, most existing studies overlook the multi-turn instruction following ability of LLMs, in terms of trai…
View article: User Behavior Simulation with Large Language Model based Agents
User Behavior Simulation with Large Language Model based Agents Open
Simulating high quality user behavior data has always been a fundamental problem in human-centered applications, where the major difficulty originates from the intricate mechanism of human decision process. Recently, substantial evidences …
View article: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) Open
Message from the Program ChairsIt's hard to believe that we're actually going to be seeing the program come together in Toronto.We're really looking forward to it and to seeing you all there!Most of the work of a program chair is behind th…
View article: Findings of the Association for Computational Linguistics: ACL 2023
Findings of the Association for Computational Linguistics: ACL 2023 Open
Message from the Program ChairsIt's hard to believe that we're actually going to be seeing the program come together in Toronto.We're really looking forward to it and to seeing you all there!Most of the work of a program chair is behind th…
View article: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Open
Message from the Program ChairsIt's hard to believe that we're actually going to be seeing the program come together in Toronto.We're really looking forward to it and to seeing you all there!Most of the work of a program chair is behind th…
View article: Stylistic Retrieval-based Dialogue System with Unparallel Training Data
Stylistic Retrieval-based Dialogue System with Unparallel Training Data Open
The ability of a dialog system to express consistent language style during conversations has a direct, positive impact on its usability and on user satisfaction. Although previous studies have demonstrated that style transfer is feasible w…
View article: SogouQ: The First Large-Scale Test Collection with Click Streams Used in a Shared-Task Evaluation
SogouQ: The First Large-Scale Test Collection with Click Streams Used in a Shared-Task Evaluation Open
Song, Ruihua Zhang, Min Luo, Cheng Sakai, Tetsuya Liu, Yiqun Dou, ZhichengSearch logs are very precious for information retrieval studies. In this chapter, we will introduce a real Chinese query log dataset, SogouQ, which was released by S…
View article: ScriptWriter: Narrative-Guided Script Generation
ScriptWriter: Narrative-Guided Script Generation Open
It is appealing to have a system that generates a story or scripts automatically from a story-line, even though this is still out of our reach. In dialogue systems, it would also be useful to drive dialogues by a dialogue plan. In this pap…
View article: "Love is as Complex as Math": Metaphor Generation System for Social Chatbot
"Love is as Complex as Math": Metaphor Generation System for Social Chatbot Open
As the wide adoption of intelligent chatbot in human daily life, user demands for such systems evolve from basic task-solving conversations to more casual and friend-like communication. To meet the user needs and build emotional bond with …
View article: ScriptWriter: Narrative-Guided Script Generation
ScriptWriter: Narrative-Guided Script Generation Open
It is appealing to have a system that generates a story or scripts automatically from a storyline, even though this is still out of our reach. In dialogue systems, it would also be useful to drive dialogues by a dialogue plan. In this pape…
View article: Attitude Detection for One-Round Conversation: Jointly Extracting Target-Polarity Pairs
Attitude Detection for One-Round Conversation: Jointly Extracting Target-Polarity Pairs Open
We tackle Attitude Detection, which we define as the task of extracting the replier's attitude, i.e., a target-polarity pair, from a given one-round conversation. While previous studies considered Target Extraction and Polarity Classificat…
View article: Understanding People Lifestyles: Construction of Urban Movement Knowledge Graph from GPS Trajectory
Understanding People Lifestyles: Construction of Urban Movement Knowledge Graph from GPS Trajectory Open
Technologies are increasingly taking advantage of the explosion in the amount of data generated by social multimedia (e.g., web searches, ad targeting, and urban computing). In this paper, we propose a multi-view learning framework for pre…
View article: A World of Difference: Divergent Word Interpretations Among People
A World of Difference: Divergent Word Interpretations Among People Open
Divergent word usages reflect differences among people. In this paper, we present a novel angle for studying word usage divergence — word interpretations. We propose an approach that quantifies semantic differences in interpretations among…
View article: A World of Difference: Divergent Word Interpretations among People
A World of Difference: Divergent Word Interpretations among People Open
Divergent word usages reflect differences among people. In this paper, we present a novel angle for studying word usage divergence -- word interpretations. We propose an approach that quantifies semantic differences in interpretations amon…
View article: UniClip: Leveraging Web Search for Universal Clipping of Articles on Mobile
UniClip: Leveraging Web Search for Universal Clipping of Articles on Mobile Open
In this paper we address the difficulty of clipping articles from mobile apps. We propose a service called UniClip that allows a user to save the full content of an article by snapping a screenshot part of it. UniClip leverages a huge amou…