Runji Wang
YOU?
Author Swipe
View article: Elastic signatures of stability in Ir–Ni–Ta bulk metallic glasses
Elastic signatures of stability in Ir–Ni–Ta bulk metallic glasses Open
The advancement of bulk metallic glasses (BMGs) for extreme-environment applications is hindered by limited understanding of their elastic behavior and structural stability under high pressure. This study presents a comprehensive study of …
View article: DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning Open
General reasoning represents a long-standing and formidable challenge in artificial intelligence (AI). Recent breakthroughs, exemplified by large language models (LLMs) 1,2 and chain-of-thought (CoT) prompting 3 , have achieved considerabl…
View article: DHGRPO: Domain-Induced, Hierarchical Group Relative Policy Optimization
DHGRPO: Domain-Induced, Hierarchical Group Relative Policy Optimization Open
DHGRPO (Domain-Induced Hierarchical Group Relative Policy Optimization) is a mathematically grounded extension of Group Relative Policy Optimization (GRPO) that mitigates group-level failure modes in preference-based fine-tuning of large l…
View article: DeepSeek-V3 Technical Report
DeepSeek-V3 Technical Report Open
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent Attenti…
View article: A Clinical Multi-center Study of Pregnant Women with COVID-19 in Hubei, China
A Clinical Multi-center Study of Pregnant Women with COVID-19 in Hubei, China Open
Background: Coronavirus disease 2019 (COVID-19) has rapidly spread to more than 200 countries. Thus far, reports regarding multi-center data from throughout gestation in women with COVID-19 and newborn outcomes are scarce. Methods: We retr…