Huaimin Wang
YOU?
Author Swipe
View article: From PBFT to the present: a thorough overview of blockchain consensus protocols
From PBFT to the present: a thorough overview of blockchain consensus protocols Open
View article: Empowering Large Language Model Agent through Step-Level Self-Critique and Self-Training
Empowering Large Language Model Agent through Step-Level Self-Critique and Self-Training Open
View article: Towards Understanding Docker Build Faults in Practice: Symptoms, Root Causes, and Fix Patterns
Towards Understanding Docker Build Faults in Practice: Symptoms, Root Causes, and Fix Patterns Open
Docker building is a critical component of containerization in modern software development, automating the process of packaging and converting sources into container images. It is not uncommon to find that Docker build faults (DBFs) occur …
View article: Joint$λ$: Orchestrating Serverless Workflows on Jointcloud FaaS Systems
Joint$λ$: Orchestrating Serverless Workflows on Jointcloud FaaS Systems Open
Existing serverless workflow orchestration systems are predominantly designed for a single-cloud FaaS system, leading to vendor lock-in. This restricts performance optimization, cost reduction, and availability of applications. However, or…
View article: Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models Open
Agents significantly enhance the capabilities of standalone Large Language Models (LLMs) by perceiving environments, making decisions, and executing actions. However, LLM agents still face challenges in tasks that require multiple decision…
View article: Software Engineering for OpenHarmony: A Research Roadmap
Software Engineering for OpenHarmony: A Research Roadmap Open
Mobile software engineering has been a hot research topic for decades. Our fellow researchers have proposed various approaches (with over 7,000 publications for Android alone) in this field that essentially contributed to the great success…
View article: NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing
NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing Open
With advancements in AI infrastructure and Trusted Execution Environment (TEE) technology, Federated Learning as a Service (FLaaS) through JointCloud Computing (JCC) is promising to break through the resource constraints caused by heteroge…
View article: Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models Open
Agents significantly enhance the capabilities of standalone Large Language Models (LLMs) by perceiving environments, making decisions, and executing actions. However, LLM agents still face challenges in tasks that require multiple decision…
View article: Online Self-Preferring Language Models
Online Self-Preferring Language Models Open
Aligning with human preference datasets has been critical to the success of large language models (LLMs). Reinforcement learning from human feedback (RLHF) employs a costly reward model to provide feedback for on-policy sampling responses.…
View article: A Transformer-based Model for Assisting Dockerfile Revising
A Transformer-based Model for Assisting Dockerfile Revising Open
Dockerfile plays an important role in the containerized software development process since it specifies the structure and functionality of the built Docker image. Currently, Dockerfile writing and modification still rely on manual operatio…
View article: Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization Open
Model-based offline reinforcement learning (RL) has made remarkable progress, offering a promising avenue for improving generalization with synthetic model rollouts. Existing works primarily focus on incorporating pessimism for policy opti…
View article: Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization Open
Model-based offline reinforcement learning (RL) has made remarkable progress, offering a promising avenue for improving generalization with synthetic model rollouts. Existing works primarily focus on incorporating pessimism for policy opti…
View article: Development Strategy of Collective Intelligence and Its Industrial Clusters
Development Strategy of Collective Intelligence and Its Industrial Clusters Open
Collective intelligence is an important component of the new generation of artificial intelligence (AI). It plays a decisive role in stimulating and converging innovative forces as well as coupling and integrating large-scale intelligent s…
View article: Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles
Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles Open
Reinforcement learning from human feedback (RLHF) emerges as a promising paradigm for aligning large language models (LLMs). However, a notable challenge in RLHF is overoptimization, where beyond a certain threshold, the pursuit of higher …
View article: Software Engineering for OpenHarmony: A Research Roadmap
Software Engineering for OpenHarmony: A Research Roadmap Open
Mobile software engineering has been a hot research topic for decades. Our fellow researchers have proposed various approaches (with over 7,000 publications for Android alone) in this field that essentially contributed to the great success…
View article: Ark Filter: A General and Space-Efficient Sketch for Network Flow Analysis
Ark Filter: A General and Space-Efficient Sketch for Network Flow Analysis Open
Sketches are widely deployed to represent network flows to support complex flow analysis. Typical sketches usually employ hash functions to map elements into a hash table or bit array. Such sketches still suffer from potential weaknesses u…
View article: Jump Filter: A Dynamic Sketch for Big Data Governance
Jump Filter: A Dynamic Sketch for Big Data Governance Open
PDF HTML XML Export Cite reminder Jump Filter: A Dynamic Sketch for Big Data Governance DOI: 10.21655/ijsi.1673-7288.00296 Author: Affiliation: Clc Number: Fund Project: Article | Figures | Metrics | Reference | Related | Cited by | Materi…
View article: Intelligent Computing: The Latest Advances, Challenges, and Future
Intelligent Computing: The Latest Advances, Challenges, and Future Open
Computing is a critical driving force in the development of human civilization. In recent years, we have witnessed the emergence of intelligent computing, a new computing paradigm that is reshaping traditional computing and promoting digit…
View article: Intelligent Computing: The Latest Advances, Challenges and Future
Intelligent Computing: The Latest Advances, Challenges and Future Open
Computing is a critical driving force in the development of human civilization. In recent years, we have witnessed the emergence of intelligent computing, a new computing paradigm that is reshaping traditional computing and promoting digit…
View article: pull request id for the dataset of "Pull Request Latency Explained: An Empirical Overview"
pull request id for the dataset of "Pull Request Latency Explained: An Empirical Overview" Open
This is used for research, which includes the pull request id. Researchers need to request the access.
View article: pull request id for the dataset of "Pull Request Latency Explained: An Empirical Overview"
pull request id for the dataset of "Pull Request Latency Explained: An Empirical Overview" Open
This is used for research, which includes the pull request id. Researchers need to request the access.
View article: Understanding and Predicting Docker Build Duration: An Empirical Study of Containerized Workflow of OSS Projects
Understanding and Predicting Docker Build Duration: An Empirical Study of Containerized Workflow of OSS Projects Open
Docker building is a critical component of containerized workflow, which automates the process by which sources are packaged and transformed into container images. If not run properly, Docker builds can bring long durations (i.e., slow bui…
View article: Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration Open
The sparsity of extrinsic rewards poses a serious challenge for reinforcement learning (RL). Currently, many efforts have been made on curiosity which can provide a representative intrinsic reward for effective exploration. However, the ch…
View article: Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning Open
Under sparse extrinsic reward settings, reinforcement learning has remained challenging, despite surging interests in this field. Previous attempts suggest that intrinsic reward can alleviate the issue caused by sparsity. In this article, …
View article: Diversifying Message Aggregation in Multi-Agent Communication via Normalized Tensor Nuclear Norm Regularization
Diversifying Message Aggregation in Multi-Agent Communication via Normalized Tensor Nuclear Norm Regularization Open
Aggregating messages is a key component for the communication of multi-agent reinforcement learning (Comm-MARL). Recently, it has witnessed the prevalence of graph attention networks (GAT) in Comm-MARL, where agents can be represented as n…
View article: Trusted Multi-Scale Classification Framework for Whole Slide Image
Trusted Multi-Scale Classification Framework for Whole Slide Image Open
Despite remarkable efforts been made, the classification of gigapixels whole-slide image (WSI) is severely restrained from either the constrained computing resources for the whole slides, or limited utilizing of the knowledge from differen…
View article: Pull request latency explained: an empirical overview
Pull request latency explained: an empirical overview Open
View article: Nuclear Norm Maximization Based Curiosity-Driven Learning
Nuclear Norm Maximization Based Curiosity-Driven Learning Open
To handle the sparsity of the extrinsic rewards in reinforcement learning, researchers have proposed intrinsic reward which enables the agent to learn the skills that might come in handy for pursuing the rewards in the future, such as enco…
View article: Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast Open
We present an approach to learn voice-face representations from the talking face videos, without any identity labels. Previous works employ cross-modal instance discrimination tasks to establish the correlation of voice and face. These met…
View article: The Development and Prospect of Code Clone
The Development and Prospect of Code Clone Open
The application of code clone technology accelerates code search, improves code reuse efficiency, and assists in software quality assessment and code vulnerability detection. However, the application of code clones also introduces software…