Ruilong Chen
YOU?
Author Swipe
View article: Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation Open
We propose Kling-Foley, a large-scale multimodal Video-to-Audio generation model that synthesizes high-quality audio synchronized with video content. In Kling-Foley, we introduce multimodal diffusion transformers to model the interactions …
View article: Enhancing Soccer Camera Calibration Through Keypoint Exploitation
Enhancing Soccer Camera Calibration Through Keypoint Exploitation Open
Accurate camera calibration is essential for transforming 2D images from camera sensors into 3D world coordinates, enabling precise scene geometry interpretation and supporting sports analytics tasks such as player tracking, offside detect…
View article: SoccerNet 2023 Challenges Results
SoccerNet 2023 Challenges Results Open
The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first th…
View article: A Difference Subgridding Method for Solving Multiscale Electro-Thermal Problems
A Difference Subgridding Method for Solving Multiscale Electro-Thermal Problems Open
Because of less memory costs and time consumption, a finite difference subgrid technique can effectively deal with multiscale problems in electromagnetic fields. When used in Maxwell equation, symmetric elements of the matrix are required;…
View article: A Real Time Vision-Based Smoking Detection Framework on Edge
A Real Time Vision-Based Smoking Detection Framework on Edge Open
Smoking is the main reason for fire disaster and pollution in petrol station, construction site and warehouse. Existing solutions based on wearable devices and smoking sensors were costly and hard to obtain evidence of smoking in unmanned …
View article: Wildlife surveillance using deep learning methods
Wildlife surveillance using deep learning methods Open
Wildlife conservation and the management of human–wildlife conflicts require cost‐effective methods of monitoring wild animal behavior. Still and video camera surveillance can generate enormous quantities of data, which is laborious and ex…
View article: A Deep Learning Framework for Joint Image Restoration and Recognition
A Deep Learning Framework for Joint Image Restoration and Recognition Open
Image restoration and recognition are important computer vision tasks representing an inherent part of autonomous systems. These two tasks are often implemented in a sequential manner, in which the restoration process is followed by a reco…
View article: Machine learning methods for autonomous object recognition and restoration in images
Machine learning methods for autonomous object recognition and restoration in images Open
Image recognition and image restoration are important tasks in the field of image processing. Image recognition are becoming very popular due to the state-of-the-art deep learning methods. However, these models usually require big datasets…
View article: Vehicle logo recognition by spatial-SIFT combined with logistic regression
Vehicle logo recognition by spatial-SIFT combined with logistic regression Open
An efficient recognition framework requires both
\ngood feature representation and effective classification methods.
\nThis paper proposes such a framework based on a spatial Scale
\nInvariant Feature Transform (SIFT) combined with a logis…