Huamin Qu
YOU?
Author Swipe
View article: Branch Explorer: Leveraging Branching Narratives to Support Interactive 360° Video Viewing for Blind and Low Vision Users
Branch Explorer: Leveraging Branching Narratives to Support Interactive 360° Video Viewing for Blind and Low Vision Users Open
360° videos enable users to freely choose their viewing paths, but blind and low vision (BLV) users are often excluded from this interactive experience. To bridge this gap, we present Branch Explorer, a system that transforms 360° videos i…
View article: NeuroSync: Intent-Aware Code-Based Problem Solving via Direct LLM Understanding Modification
NeuroSync: Intent-Aware Code-Based Problem Solving via Direct LLM Understanding Modification Open
Conversational LLMs have been widely adopted by domain users with limited programming experience to solve domain problems. However, these users often face misalignment between their intent and generated code, resulting in frustration and r…
View article: A Study on Using ChatGPT to Help Students With Dyslexia Learn Chinese and English Writing
A Study on Using ChatGPT to Help Students With Dyslexia Learn Chinese and English Writing Open
Purpose This study investigates the impact of large language models (LLMs), for example, ChatGPT, in inclusive education. Design/Approach/Methods We develop a ChatGPT-assisted writing system, namely CHATTING , to support students with dysl…
View article: FundSelector: A visual analysis system for mutual fund selection
FundSelector: A visual analysis system for mutual fund selection Open
Mutual funds are one of the most important and popular investment ways for ordinary investors to maintain and increase the value of their assets. However, it is challenging for ordinary investors to select optimal mutual funds from thousan…
View article: CineVision: An Interactive Pre-visualization Storyboard System for Director-Cinematographer Collaboration
CineVision: An Interactive Pre-visualization Storyboard System for Director-Cinematographer Collaboration Open
Effective communication between directors and cinematographers is fundamental in film production, yet traditional approaches relying on visual references and hand-drawn storyboards often lack the efficiency and precision necessary during p…
View article: RhythmTA: A Visual-Aided Interactive System for ESL Rhythm Training via Dubbing Practice
RhythmTA: A Visual-Aided Interactive System for ESL Rhythm Training via Dubbing Practice Open
English speech rhythm, the temporal patterns of stressed syllables, is essential for English as a second language (ESL) learners to produce natural-sounding and comprehensible speech. Rhythm training is generally based on imitation of nati…
View article: DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models
DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models Open
Creating aesthetically pleasing data visualizations remains challenging for users without design expertise or familiarity with visualization tools. To address this gap, we present DataWink, a system that enables users to create custom visu…
View article: Design Patterns of Human-AI Interfaces in Healthcare
Design Patterns of Human-AI Interfaces in Healthcare Open
Human-AI interfaces play a crucial role in advancing practices and research within the healthcare domain. However, designing such interfaces presents a substantial challenge for designers. In this paper, we propose systematic guidance for …
View article: Multi Layered Autonomy and AI Ecologies in Robotic Art Installations
Multi Layered Autonomy and AI Ecologies in Robotic Art Installations Open
This paper presents Symbiosis of Agents , a large-scale installation by artist Baoyang Chen, integrating AI-driven robotic agents within an immersive, reflective environment, foregrounding the delicate balance between machine agency and ar…
View article: Targeted control of fast prototyping through domain-specific interface
Targeted control of fast prototyping through domain-specific interface Open
Industrial designers have long sought a natural and intuitive way to achieve the targeted control of prototype models -- using simple natural language instructions to configure and adjust the models seamlessly according to their intentions…
View article: PIPE: Physics-Informed Position Encoding for Alignment of Satellite Images and Time Series
PIPE: Physics-Informed Position Encoding for Alignment of Satellite Images and Time Series Open
Multimodal time series forecasting is foundational in various fields, such as utilizing satellite imagery and numerical data for predicting typhoons in climate science. However, existing multimodal approaches primarily focus on utilizing t…
View article: The Jade Gateway to Trust: Exploring How Socio-Cultural Perspectives Shape Trust Within Chinese NFT Communities
The Jade Gateway to Trust: Exploring How Socio-Cultural Perspectives Shape Trust Within Chinese NFT Communities Open
Today's world is witnessing an unparalleled rate of technological transformation. The emergence of non-fungible tokens (NFTs) has transformed how we handle digital assets and value. These tokens have captured the interest of scholars and b…
View article: Dynamic visualization design of visual symbols using interactive genetic algorithm
Dynamic visualization design of visual symbols using interactive genetic algorithm Open
View article: Reflecting on Design Paradigms of Animated Data Video Tools
Reflecting on Design Paradigms of Animated Data Video Tools Open
View article: TangibleNet: Synchronous Network Data Storytelling through Tangible Interactions in Augmented Reality
TangibleNet: Synchronous Network Data Storytelling through Tangible Interactions in Augmented Reality Open
Synchronous data-driven storytelling with network visualizations presents significant challenges due to the complexity of real-time manipulation of network components. While existing research addresses asynchronous scenarios, there is a la…
View article: InterLink: Linking Text with Code and Output in Computational Notebooks
InterLink: Linking Text with Code and Output in Computational Notebooks Open
View article: "You'll Be Alice Adventuring in Wonderland!" Processes, Challenges, and Opportunities of Creating Animated Virtual Reality Stories
"You'll Be Alice Adventuring in Wonderland!" Processes, Challenges, and Opportunities of Creating Animated Virtual Reality Stories Open
Animated virtual reality (VR) stories, combining the presence of VR and the artistry of computer animation, offer a compelling way to deliver messages and evoke emotions. Motivated by the growing demand for immersive narrative experiences,…
View article: DanmuA11y: Making Time-Synced On-Screen Video Comments (Danmu) Accessible to Blind and Low Vision Users via Multi-Viewer Audio Discussions
DanmuA11y: Making Time-Synced On-Screen Video Comments (Danmu) Accessible to Blind and Low Vision Users via Multi-Viewer Audio Discussions Open
By overlaying time-synced user comments on videos, Danmu creates a co-watching experience for online viewers. However, its visual-centric design poses significant challenges for blind and low vision (BLV) viewers. Our formative study ident…
View article: Precision medicine in the prediction of metachronous liver metastasis in rectal cancer: Applications and challenges
Precision medicine in the prediction of metachronous liver metastasis in rectal cancer: Applications and challenges Open
Rectal cancer is a major global health concern, and metachronous liver metastasis (MLM) significantly worsens patient prognosis. Advances in imaging and machine learning have led to the development of radiomics models, particularly those u…
View article: Prompting Generative AI with Interaction-Augmented Instructions
Prompting Generative AI with Interaction-Augmented Instructions Open
The emergence of generative AI (GenAI) models, including large language models and text-to-image models, has significantly advanced the synergy between humans and AI with not only their outstanding capability but more importantly, the intu…
View article: Xavier: Toward Better Coding Assistance in Authoring Tabular Data Wrangling Scripts
Xavier: Toward Better Coding Assistance in Authoring Tabular Data Wrangling Scripts Open
Data analysts frequently employ code completion tools in writing custom scripts to tackle complex tabular data wrangling tasks. However, existing tools do not sufficiently link the data contexts such as schemas and values with the code bei…
View article: Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective
Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective Open
Human-AI collaborative tools attract attentions from the data storytelling community to lower the expertise barrier and streamline the workflow. The recent advance in large-scale generative AI techniques, e.g., large language models (LLMs)…
View article: Understanding Screenwriters' Practices, Attitudes, and Future Expectations in Human-AI Co-Creation
Understanding Screenwriters' Practices, Attitudes, and Future Expectations in Human-AI Co-Creation Open
With the rise of AI technologies and their growing influence in the screenwriting field, understanding the opportunities and concerns related to AI's role in screenwriting is essential for enhancing human-AI co-creation. Through semi-struc…
View article: InterLink: Linking Text with Code and Output in Computational Notebooks
InterLink: Linking Text with Code and Output in Computational Notebooks Open
Computational notebooks, widely used for ad-hoc analysis and often shared with others, can be difficult to understand because the standard linear layout is not optimized for reading. In particular, related text, code, and outputs may be sp…
View article: Reflecting on Design Paradigms of Animated Data Video Tools
Reflecting on Design Paradigms of Animated Data Video Tools Open
Animated data videos have gained significant popularity in recent years. However, authoring data videos remains challenging due to the complexity of creating and coordinating diverse components (e.g., visualization, animation, audio, etc.)…
View article: Exploring Spatial Hybrid User Interface for Visual Sensemaking
Exploring Spatial Hybrid User Interface for Visual Sensemaking Open
We built a spatial hybrid system that combines a personal computer (PC) and virtual reality (VR) for visual sensemaking, addressing limitations in both environments. Although VR offers immense potential for interactive data visualization (…
View article: Exploring the impact of robot interaction on learning engagement: a comparative study of two multi-modal robots
Exploring the impact of robot interaction on learning engagement: a comparative study of two multi-modal robots Open
In recent years, there has been a growing interest in using robots within educational environments due to their potential to augment student engagement and motivation. However, current research has not adequately addressed the effectivenes…
View article: Memory Reviver: Supporting Photo-Collection Reminiscence for People with Visual Impairment via a Proactive Chatbot
Memory Reviver: Supporting Photo-Collection Reminiscence for People with Visual Impairment via a Proactive Chatbot Open
Reminiscing with photo collections offers significant psychological benefits but poses challenges for people with visual impairment (PVI). Their current reliance on sighted help restricts the flexibility of this activity. In response, we e…
View article: Automated Constraint Specification for Job Scheduling by Regulating Generative Model with Domain-Specific Representation
Automated Constraint Specification for Job Scheduling by Regulating Generative Model with Domain-Specific Representation Open
Advanced Planning and Scheduling (APS) systems have become indispensable for modern manufacturing operations, enabling optimized resource allocation and production efficiency in increasingly complex and dynamic environments. While algorith…
View article: Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering Open