Explanipedia

Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model Open

Wang Ziyue, Jadhav, Yayati, Pák Péter, Farimani, Amir Barati · 2025

Mechanical design and manufacturing workflows conventionally begin with conceptual design, followed by the creation of a computer-aided design (CAD) model and fabrication through material-extrusion (MEX) printing. This process requires con…

Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model Open

Wang Ziyue, Jadhav, Yayati, Pák Péter, Farimani, Amir Barati · 2025

Mechanical design and manufacturing workflows conventionally begin with conceptual design, followed by the creation of a computer-aided design (CAD) model and fabrication through material-extrusion (MEX) printing. This process requires con…

SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking Open

Liu Hao-feng, Wang Ziyue, Mishra Sudhanshu, Gao Mingqi, Qin, Guanyi , et al. · 2025

Surgical video segmentation is crucial for computer-assisted surgery, enabling precise localization and tracking of instruments and tissues. Interactive Video Object Segmentation (iVOS) models such as Segment Anything Model 2 (SAM2) provid…

SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking Open

Liu Haofeng, Wang Ziyue, Mishra Sudhanshu, Gao, Mingqi, Qin, Guanyi , et al. · 2025

Surgical video segmentation is crucial for computer-assisted surgery, enabling precise localization and tracking of instruments and tissues. Interactive Video Object Segmentation (iVOS) models such as Segment Anything Model 2 (SAM2) provid…

SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking Open

Liu Haofeng, Wang Ziyue, Mishra Sudhanshu, Gao, Mingqi, Qin, Guanyi , et al. · 2025

Surgical video segmentation is crucial for computer-assisted surgery, enabling precise localization and tracking of instruments and tissues. Interactive Video Object Segmentation (iVOS) models such as Segment Anything Model 2 (SAM2) provid…

Cross-platform Clinical Proteomics using the Charité Open Standard for Plasma Proteomics (OSPP) Open

Wang Ziyue · 2025

we present the Charité Open Peptide Standard for plasma proteomics (OSPP), an open resource composed of 211 isotope-labeled peptides, intended to be used as an internal standard for plasma and serum proteomic projects. The OSPP was designe…

Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models Open

Association for Computational Linguistics 2025, Gao, Longwen, Li, Peiyi, Li Xinhao, Pang Yan , et al. · 2025

Object hallucination in Large Vision-Language Models (LVLMs) significantly impedes their real-world applicability. As the primary component for accurately interpreting visual information, the choice of visual encoder is pivotal. We hypothe…

MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models Open

Association for Computational Linguistics 2025, Huang Kaiyu, Kang, Zhaolu, Lai, Yunghwei, Li Peng , et al. · 2025

Multimodal Large Language Models (MLLMs) have demonstrated significant advances across numerous vision-language tasks. Due to their strong performance in image-text alignment, MLLMs can effectively understand image-text pairs with clear me…

DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms Open

Association for Computational Linguistics 2025, Bi Xiaojun, Han Lü, Li Peng, LI Shuo , et al. · 2025

Dongba pictographic is the only pictographic script still in use in the world. Its pictorial ideographic features carry rich cultural and contextual information. However, due to the lack of relevant datasets, research on semantic understan…

Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models Open

Wang Weihang, LI XinHao, Wang Ziyue, Pang, Yan, Zhang Jie-lei , et al. · 2025

Object hallucination in Large Vision-Language Models (LVLMs) significantly impedes their real-world applicability. As the primary component for accurately interpreting visual information, the choice of visual encoder is pivotal. We hypothe…

Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation Open

Qin, Guanyi, Wang Ziyue, Shen, Daiyun, Liu Hao-feng, Zhou Hantao , et al. · 2025

Given an object mask, Semi-supervised Video Object Segmentation (SVOS) technique aims to track and segment the object across video frames, serving as a fundamental task in computer vision. Although recent memory-based methods demonstrate p…

Functional Time Series Forecasting of Distributions: A Koopman-Wasserstein Approach Open

Wang Ziyue, Araki Yuko · 2025

We propose a novel method for forecasting the temporal evolution of probability distributions observed at discrete time points. Extending the Dynamic Probability Density Decomposition (DPDD), we embed distributional dynamics into Wasserste…

MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic Workflow Open

Wang Ziyue, Wu, Junde, Cai, Linghan, Low, Chang Han, Yang, Xihong , et al. · 2025

In modern medicine, clinical diagnosis relies on the comprehensive analysis of primarily textual and visual data, drawing on medical expertise to ensure systematic and rigorous reasoning. Recent advances in large Vision-Language Models (VL…

EgoLife: Towards Egocentric Life Assistant Open

Yang, Jingkang, Liu Shuai, Guo Hongming, Dong Yuhao, Zhang, Xiamengwei , et al. · 2025

We introduce EgoLife, a project to develop an egocentric life assistant that accompanies and enhances personal efficiency through AI-powered wearable glasses. To lay the foundation for this assistant, we conducted a comprehensive data coll…

<b>Mowing intensity and duration reshape and decouple plant and microbial communities</b> Open

Wang, Ziyue · 2025

We collected 363 peer-reviewed articles related to mowing (3555 data points) and conducted a meta-analysis. The indicators covered the impacts of mowing on plant biomass, richness, and microbial growth and metabolism. The focus was on anal…

<b>Mowing intensity and duration reshape and decouple plant and microbial communities</b> Open

Wang, Ziyue · 2025

We collected 363 peer-reviewed articles related to mowing (3555 data points) and conducted a meta-analysis. The indicators covered the impacts of mowing on plant biomass, richness, and microbial growth and metabolism. The focus was on anal…

Wang, Ziyue YOU? Author Swipe