Explanipedia

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning Open

Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Peiyi Wang , et al. · 2025

General reasoning represents a long-standing and formidable challenge in artificial intelligence (AI). Recent breakthroughs, exemplified by large language models (LLMs) 1,2 and chain-of-thought (CoT) prompting 3 , have achieved considerabl…

DHGRPO: Domain-Induced, Hierarchical Group Relative Policy Optimization Open

DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song , et al. · 2025

DHGRPO (Domain-Induced Hierarchical Group Relative Policy Optimization) is a mathematically grounded extension of Group Relative Policy Optimization (GRPO) that mitigates group-level failure modes in preference-based fine-tuning of large l…

Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling Open

Xiaokang Chen, Zhiyu Wu, Xingchao Liu, Zizheng Pan, Wenjun Liu , et al. · 2025

In this work, we introduce Janus-Pro, an advanced version of the previous work Janus. Specifically, Janus-Pro incorporates (1) an optimized training strategy, (2) expanded training data, and (3) scaling to larger model size. With these imp…

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced\n Multimodal Understanding Open

Zhiyu Wu, Xiaokang Chen, Zizheng Pan, Xingchao Liu, Wen Liu , et al. · 2024

We present DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE)\nVision-Language Models that significantly improves upon its predecessor,\nDeepSeek-VL, through two key major upgrades. For the vision component, we\nincorporate…

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Open

Chengyue Wu, Xiaokang Chen, Zhiyu Wu, Yiyang Ma, Xingchao Liu , et al. · 2024

In this paper, we introduce Janus, an autoregressive framework that unifies multimodal understanding and generation. Prior research often relies on a single visual encoder for both tasks, such as Chameleon. However, due to the differing le…

A Dual Adaptive Unscented Kalman Filter Algorithm for SINS-Based Integrated Navigation System Open

Xu Lyu, Ziyang Meng, Chunyu Li, Zhenyu Cai, Yi Huang , et al. · 2024

In this study, the problem of measuring noise pollution distribution by the intertial-based integrated navigation system is effectively suppressed. Based on nonlinear inertial navigation error modeling, a nested dual Kalman filter framewor…

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models Open

Damai Dai, Chengqi Deng, Chenggang Zhao, Renyuan Xu, Huazuo Gao , et al. · 2024

In the era of large language models, Mixture-of-Experts (MoE) is a promising architecture for managing computational costs when scaling up model parameters. However, conventional MoE architectures like GShard, which activate the top-$K$ ou…

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism Open

DeepSeek-AI, NULL AUTHOR_ID, Xiao Guo Bi, Deli Chen, Guanting Chen , et al. · 2024

The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into …

Robust Kalman filters with unknown covariance of multiplicative noise Open

Xingkai Yu, Ziyang Meng · 2021

In this paper, state and noise covariance estimation problems for linear system with unknown multiplicative noise are considered. The measurement likelihood is modelled as a mixture of two Gaussian distributions and a Student's t distribut…

Adaptive Kalman Filter for Linear Systems with Additive and Multiplicative Noises Open

Xingkai Yu · 2021

This manuscript investigates adaptive Kalman filter problem of of linear systems with multiplicative and additive noises. The main contributions are stated in two aspects. Firstly, compared with the estimation problem of linear systems wit…

Adaptive Kalman Filter for Linear Systems with Additive and Multiplicative Noises Open

Xingkai Yu · 2021

This manuscript investigates adaptive Kalman filter problem of of linear systems with multiplicative and additive noises. The main contributions are stated in two aspects. Firstly, compared with the estimation problem of linear systems wit…

Xingkai Yu YOU? Author Swipe