Explanipedia

CB-EVO: Contextual Bandit Tuning with Evolutionary Search for Logic Synthesis Open

Fangzhou Liu, Wuqian Tang, Zehua Pei, Ziyang Yu, Haisheng Zheng , et al. · 2025

In logic synthesis, pre-optimization involves applying a sequence of transformations, referred to as a synthesis flow, to reduce the complexity of a circuit’s Boolean logic graph, such as the And-Inverter Graph (AIG). The primary challenge…

Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents Open

Mingkang Zhu, Xi Chen, Bei Yu, Hengshuang Zhao, Jiaya Jia · 2025

Large language model (LLM) agents increasingly rely on external tools such as search engines to solve complex, multi-step problems, and reinforcement learning (RL) has become a key paradigm for training them. However, the trajectories of s…

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech Open

Chengyao Wang, Zhaolong Zhong, Bohao Peng, Senqiao Yang, Yuqi Liu , et al. · 2025

We present MGM-Omni, a unified Omni LLM for omni-modal understanding and expressive, long-horizon speech generation. Unlike cascaded pipelines that isolate speech synthesis, MGM-Omni adopts a "brain-mouth" design with a dual-track, token-b…

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning Open

Senqiao Yang, Junyi Li, Xin Lai, Bei Yu, Hengshuang Zhao , et al. · 2025

Recent advancements in vision-language models (VLMs) have improved performance by increasing the number of visual tokens, which are often significantly longer than text tokens. However, we observe that most real-world scenarios do not requ…

Deep-Learning-Based Pre-Layout Parasitic Capacitance Prediction on SRAM Designs Open

Shan Shen, Dingcheng Yang, Yuyang Xie, Chunyan Pei, Wenjian Yu , et al. · 2025

To achieve higher system energy efficiency, SRAM in SoCs is often customized. The parasitic effects cause notable discrepancies between pre-layout and post-layout circuit simulations, leading to difficulty in converging design parameters a…

Rank-DSE: Neural Pareto Comparator of Microarchitecture Design Space Exploration Open

Peng Xu, Su Zheng, Mingzi Wang, Ziyang Yu, Shixin Chen , et al. · 2025

The complexity of microarchitecture design has surged due to the expanding design space and time-intensive verification processes. Existing regression-based machine learning methods struggle with inaccurate estimations because of limited t…

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization Open

Mingkang Zhu, Xizhang Chen, Z. Wang, Bei Yu, Hengshuang Zhao , et al. · 2025

Recent advancements in reinforcement learning from human feedback have shown that utilizing fine-grained token-level reward models can substantially enhance the performance of Proximal Policy Optimization (PPO) in aligning large language m…

HAPE: Hardware-Aware LLM Pruning For Efficient On-Device Inference Optimization Open

Wenqian Zhao, Lancheng Zou, Zixiao Wang, Xufeng Yao, Bei Yu · 2025

Over the past few years, large language models (LLMs) have demonstrated remarkable performance and versatility across a variety of complex tasks. However, their deployment has been challenged by their substantial model size and computation…

RTime-QA: A Benchmark for Atomic Temporal Event Understanding in Large Multi-modal Models Open

Yuqi Liu, Qin Jin, Ting Qu, Xuan Liu, Yang Du , et al. · 2025

Understanding accurate atomic temporal event is essential for video comprehension. However, current video-language benchmarks often fall short to evaluate Large Multi-modal Models' (LMMs) temporal event understanding capabilities, as they …

UniMoCo: Unified Modality Completion for Robust Multi-Modal Embeddings Open

Jiajun Qin, Yuan Pu, Zhiqiang He, Seunggeun Kim, David Z. Pan , et al. · 2025

Current research has explored vision-language models for multi-modal embedding tasks, such as information retrieval, visual grounding, and classification. However, real-world scenarios often involve diverse modality combinations between qu…

VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning Open

Yuqi Liu, Tan Qu, Zhisheng Zhong, Bohao Peng, Shu Liu , et al. · 2025

Large vision-language models exhibit inherent capabilities to handle diverse visual perception tasks. In this paper, we introduce VisionReasoner, a unified framework capable of reasoning and solving multiple visual perception tasks within …

Large Language Models for EDA: Future or Mirage? Open

Zhuolun He, Yuan Pu, Haoyuan Wu, Tairu Qiu, Bei Yu · 2025

In this article, we explore the burgeoning intersection of large language models (LLMs) and electronic design automation (EDA). We critically assess whether LLMs represent a transformative future for EDA or merely a fleeting mirage. By org…

HDLdebugger: Streamlining HDL debugging with Large Language Models Open

Xufeng Yao, Haoyang Li, Tszho Chan, Wenyi Xiao, Mingxuan Yuan , et al. · 2025

In the domain of chip design, hardware description languages (HDLs) play a pivotal role. However, due to the inherent complexity of HDLs and the scarcity of high-quality debugging resources, HDL bug fixing remains a challenging and time-co…

DiffPattern-Flex: Efficient Layout Pattern Generation via Discrete Diffusion Open

Zixiao Wang, Wenqian Zhao, Yunheng Shen, Yang Bai, Guojin Chen , et al. · 2025

Recent advancements in layout pattern generation have been dominated by deep generative models. However, relying solely on neural networks for legality guarantees raises concerns in many practical applications. In this paper, we present \t…

G-kway: Multilevel GPU-Accelerated k-way Graph Partitioner using Task Graph Parallelism Open

Wan Luan Lee, Dian-Lun Lin, Shui Jiang, Cheng-Hsiang Chiu, Yibo Lin , et al. · 2025

Graph partitioning is important for the design of many CAD algorithms. However, as the graph size continues to grow, graph partitioning becomes increasingly time-consuming. Recent research has introduced parallel graph partitioners using e…

FlexPose: Pose Distribution Adaptation with Limited Guidance Open

Z. Jane Wang, Junwu Weng, Mengyuan Liu, Bei Yu · 2025

Numerous well-annotated human key-point datasets are publicly available to date. However, annotating human poses for newly collected images is still a costly and time-consuming progress. Pose distributions from different datasets share sim…

Bridging Hotspot Detection and Mask Optimization via Domain-Crossing Masked Layout Modeling Open

Binwu Zhu, Su Zheng, Yuzhe Ma, Bei Yu, Martin D. F. Wong · 2025

With the rapid development of semiconductors, the size of transistors is continuously scaling down. The shrinking circuit size poses great challenges to optical proximity correction (OPC) and hotspot detection (HSD). Recent advancements in…

Ultrasonic Time-of-Flight Diffraction Imaging Enhancement for Pipeline Girth Weld Testing via Time-Domain Sparse Deconvolution and Frequency-Domain Synthetic Aperture Focusing Open

Eryong Wu, Ye Han, Bei Yu, Wei Zhou, Shaohua Tian · 2025

Ultrasonic TOFD imaging, as an important non-destructive testing method, has a wide range of applications in pipeline girth weld inspection and testing. Due to the limited bandwidth of ultrasonic transducers, near-surface defects in the we…

EasyMRC: Efficient Mask Rule Checking via Representative Edge Sampling Open

Jiren Xu, Zhuolun He, Shuo Yin, Yuan Pu, Wenjian Yu , et al. · 2025

The photolithography process is getting more sophisticated with technology node scaling down and VLSI designs becoming complex. As photomask patterns get finer, mask rule checks (MRCs) are inevitable to avoid discrepancies in the layout an…

GraphCAD: Leveraging Graph Neural Networks for Accuracy Prediction Handling Crosstalk-affected Delays Open

Fangzhou Liu, Guannan Guo, Yuyang Ye, Ziyi Wang, Wenjie Fu , et al. · 2025

HeLO: A He terogeneous L ogic O ptimization Framework by Hierarchical Clustering and Graph Learning Open

Yuan Pu, Fangzhou Liu, Zhuolun He, Keren Zhu, Rongliang Fu , et al. · 2025

ML-Based Fine-Grained Modeling of DC Current Crowding in Power Delivery TSVs for Face-to-Face 3D ICs Open

Zheng Yang, Zhen Zhuang, Bei Yu, Tsung-Yi Ho, Martin D. F. Wong , et al. · 2025

Invited: Physical Design for Advanced 3D ICs: Challenges and Solutions Open

Yuxuan Zhao, Lancheng Zou, Bei Yu · 2025

Routing-aware Legal Hybrid Bonding Terminal Assignment for 3D Face-to-Face Stacked ICs Open

Siting Liu, Junyao Zhou, Jiaxi Jiang, Zhuolun He, Ziyi Wang , et al. · 2025

Face-to-face (F2F) stacked 3D IC is a promising alternative for scaling beyond Moore’s Law. In F2F 3D ICs, dies are connected through bonding terminals whose positions can significantly impact routing performance. Further, there exists res…

DeepVerifier: Learning to Update Test Sequences for Coverage-Guided Verification Open

Y. P. Lu, Chen Bai, Yuxuan Zhao, Ziyue Zheng, Yangdi Lyu , et al. · 2025

Verification is critical in ensuring the reliable operation of modern, complex computing systems. However, as processor designs become increasingly sophisticated, conventional static verification techniques struggle to generate high-qualit…

Architect of the Bits World: Masked Autoregressive Modeling for Circuit Generation Guided by Truth Table Open

Haoyuan Wu, Haisheng Zheng, Shoubo Hu, Zhuolun He, Bei Yu · 2025

Logic synthesis, a critical stage in electronic design automation (EDA), optimizes gate-level circuits to minimize power consumption and area occupancy in integrated circuits (ICs). Traditional logic synthesis tools rely on human-designed …

Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment Open

Haoyuan Wu, Han Zheng, Yuan Pu, Bei Yu · 2025

Understanding the structure and function of circuits is crucial for electronic design automation (EDA). Circuits can be formulated as And-Inverter graphs (AIGs), enabling efficient implementation of representation learning through graph ne…

Ultrasonic TOFD Imaging Enhancement Technology for Pipeline Girth Welds Testing via Time Domain Sparse Deconvolution and Frequency Domain Synthetic Aperture Focusing Open

Eryong Wu, Ye Han, Bei Yu, Wei Zhou · 2025

Ultrasonic TOFD imaging, as an important non-destructive testing method, has a wide range of applications in pipeline girths weld inspection and testing. Due to the limited bandwidth of ultrasonic transducers, near-surface defects in the w…

CMoE: Converting Mixture-of-Experts from Dense to Accelerate LLM Inference Open

Zehua Pei, Lan Zou, Hui-Ling Zhen, X. D. Yu, Wulong Liu , et al. · 2025

Scaling large language models (LLMs) improves performance but dramatically increases inference costs. The feed-forward network (FFN), consuming approximately 70\% of inference compute, represents a critical bottleneck, particularly in larg…

TorchResist: Open-Source Differentiable Resist Simulator Open

Zixiao Wang, Jieqiong Zhou, Su Zheng, Shengyong Yin, Kang Liang , et al. · 2025

Recent decades have witnessed remarkable advancements in artificial intelligence (AI), including large language models (LLMs), image and video generative models, and embodied AI systems. These advancements have led to an explosive increase…

Bei Yu YOU? Author Swipe