Explanipedia

Deep Residual Learning for Image Recognition Open

Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun · 2016

Actualmente diversas investigaciones se han enfocado en analizar a partir de videos de alta velocidad, características de las descargas eléctricas atmosféricas con el fin de adquirir mejor comprensión del fenómeno, que conlleva entre otros…

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications Open

Andrew Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang , et al. · 2017

Computer science Engineering Physics

We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks…

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning Open

Hoo-Chang Shin, Holger R. Roth, Mingchen Gao, Le Lü, Ziyue Xu , et al. · 2016

Computer science Biology

Remarkable progress has been made in image recognition, primarily due to the availability of large-scale annotated datasets and deep convolutional neural networks (CNNs). CNNs enable learning data-driven, highly representative, hierarchica…

Simple online and realtime tracking Open

Alex Bewley, Zongyuan Ge, Lionel Ott, Fábio Ramos, Ben Upcroft · 2016

Computer science Physics Philosophy

This paper explores a pragmatic approach to multiple object tracking where the main focus is to associate objects efficiently for online and realtime applications. To this end, detection quality is identified as a key factor influencing tr…

R-FCN: Object Detection via Region-based Fully Convolutional Networks Open

Jifeng Dai, Yi Li, Kaiming He, Jian Sun · 2016

Computer science Chemistry

We present region-based, fully convolutional networks for accurate and efficient object detection. In contrast to previous region-based detectors such as Fast/Faster R-CNN that apply a costly per-region subnetwork hundreds of times, our re…

You Only Look Once: Unified, Real-Time Object Detection Open

Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi · 2016

Computer science Mathematics

We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to spatially separated bounding boxes and associate…

Res2Net: A New Multi-Scale Backbone Architecture Open

Shanghua Gao, Ming‐Ming Cheng, Kai Zhao, Xinyu Zhang, Ming–Hsuan Yang , et al. · 2019

Computer science Mathematics Physics

Representing features at multiple scales is of great importance for numerous vision tasks. Recent advances in backbone convolutional neural networks (CNNs) continually demonstrate stronger multi-scale representation ability, leading to con…

SECOND: Sparsely Embedded Convolutional Detection Open

Yan Yan, Yuxing Mao, Bo Li · 2018

Computer science Mathematics Economics

LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when …

DOTA: A Large-Scale Dataset for Object Detection in Aerial Images Open

Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie , et al. · 2018

Computer science Geography Mathematics

Object detection is an important and challenging problem in computer vision. Although the past decade has witnessed major advances in object detection in natural scenes, such successes have been slow to aerial imagery, not only because of …

Deep Learning for Generic Object Detection: A Survey Open

Li Liu, Wanli Ouyang, Xiaogang Wang, Paul Fieguth, Jie Chen , et al. · 2019

Computer science Geography Mathematics

Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful…

MobileNetV2: Inverted Residuals and Linear Bottlenecks Open

Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen · 2018

Computer science

In this paper we describe a new mobile architecture, MobileNetV2, that improves the state of the art performance of mobile models on multiple tasks and benchmarks as well as across a spectrum of different model sizes. We also describe effi…

A Review of Yolo Algorithm Developments Open

Peiyuan Jiang, Daji Ergu, Fangyao Liu, Ying Cai, Bo Ma · 2022

Computer science Philosophy Mathematics

Object detection techniques are the foundation for the artificial intelligence field. This research paper gives a brief overview of the You Only Look Once (YOLO) algorithm and its subsequent advanced versions. Through the analysis, we reac…

RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection Open

Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick · 2024

Computer science Geography Geology

This repository contains all the collected and aligned data for RU-AI dataset. It is constructed based on three large publicly available datasets: Flickr8K, COCO, and Places205, by adding their corresponding machine-generated pairs from fi…

A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS Open

Juan Terven, Diana‐Margarita Córdova‐Esparza, Julio-Alejandro Romero-González · 2023

Computer science Engineering Geography

YOLO has become a central real-time object detection system for robotics, driverless cars, and video monitoring applications. We present a comprehensive analysis of YOLO’s evolution, examining the innovations and contributions in each iter…

PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes Open

Xiang Yu, Tanner Schmidt, Venkatraman Narayanan, Dieter Fox · 2018

Computer science Mathematics Chemistry

Estimating the 6D pose of known objects is important for robots to interact with the real world.The problem is challenging due to the variety of objects as well as the complexity of a scene caused by clutter and occlusions between objects.…

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications Open

Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng, Yifei Geng , et al. · 2022

Computer science Philosophy

For years, the YOLO series has been the de facto industry-level standard for efficient object detection. The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios. In this…

DSSD : Deconvolutional Single Shot Detector Open

Cheng-Yang Fu, Wei Liu, Ananth Ranga, Ambrish Tyagi, Alexander C. Berg · 2017

Computer science Biology Physics

The main contribution of this paper is an approach for introducing additional context into state-of-the-art general object detection. To achieve this we first combine a state-of-the-art classifier (Residual-101[14]) with a fast detection f…

Mask R-CNN Open

Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick · 2017

Computer science Philosophy History

We present a conceptually simple, flexible, and general framework for object instance segmentation. Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. Th…

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale Open

Alina Kuznetsova, Hassan Rom, Neil Alldrin, Jasper Uijlings, Ivan Krasin , et al. · 2018

Computer science Geography Economics

We present Open Images V4, a dataset of 9.2M images with unified annotations for image classification, object detection and visual relationship detection. The images have a Creative Commons Attribution license that allows to share and adap…

Focal Loss for Dense Object Detection Open

Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár · 2017

Computer science

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations. In contrast, one-stage detectors that are applied over a reg…

Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges Open

Di Feng, Christian Schütz, Lars Rosenbaum, Heinz Hertlein, Claudius Gläser , et al. · 2020

Computer science Chemistry

Recent advancements in perception for autonomous driving are driven by deep learning. In order to achieve robust and accurate scene understanding, autonomous vehicles are usually equipped with different sensors (e.g. cameras, LiDARs, Radar…

A Survey of Deep Learning-Based Object Detection Open

Licheng Jiao, Fan Zhang, Fang Liu, Shuyuan Yang, Lingling Li , et al. · 2019

Computer science

Object detection is one of the most important and challenging branches of\ncomputer vision, which has been widely applied in peoples life, such as\nmonitoring security, autonomous driving and so on, with the purpose of locating\ninstances …

DeepFruits: A Fruit Detection System Using Deep Neural Networks Open

Inkyu Sa, Zongyuan Ge, Feras Dayoub, Ben Upcroft, Tristán Pérez , et al. · 2016

Computer science

This paper presents a novel approach to fruit detection using deep convolutional neural networks. The aim is to build an accurate, fast and reliable fruit detection system, which is a vital element of an autonomous agricultural robotic pla…

R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object Open

Xue Yang, Junchi Yan, Zi‐Ming Feng, Tao He · 2021

Computer science Geography Economics

Rotation detection is a challenging task due to the difficulties of locating the multi-angle objects and separating them effectively from the background. Though considerable progress has been made, for practical settings, there still exist…

Real-Time Seamless Single Shot 6D Object Pose Prediction Open

Bugra Tekin, Sudipta N. Sinha, Pascal Fua · 2018

Computer science Physics

We propose a single-shot approach for simultaneously detecting an object in an RGB image and predicting its 6D pose without requiring multiple stages or having to examine multiple hypotheses. Unlike a recently proposed single-shot techniqu…

YOLO-v1 to YOLO-v8, the Rise of YOLO and Its Complementary Nature toward Digital Manufacturing and Industrial Defect Detection Open

Muhammad Hussain · 2023

Computer science

Since its inception in 2015, the YOLO (You Only Look Once) variant of object detectors has rapidly grown, with the latest release of YOLO-v8 in January 2023. YOLO variants are underpinned by the principle of real-time and high-classificati…

TextBoxes++: A Single-Shot Oriented Scene Text Detector Open

Minghui Liao, Baoguang Shi, Xiang Bai · 2018

Computer science Mathematics Physics

Scene text detection is an important step of scene text recognition system and also a challenging problem. Different from general object detection, the main challenges of scene text detection lie on arbitrary orientations, small sizes, and…

Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection Open

Yongchao Xu, Mingtao Fu, Qimeng Wang, Yukang Wang, Kai Chen , et al. · 2020

Computer science

Object detection has recently experienced substantial progress. Yet, the widely adopted horizontal bounding box representation is not appropriate for ubiquitous oriented objects such as objects in aerial images and scene texts. In this pap…

M2Det: A Single-Shot Object Detector Based on Multi-Level Feature Pyramid Network Open

Qijie Zhao, Tao Sheng, Yongtao Wang, Zhi Tang, Ying Chen , et al. · 2019

Computer science Mathematics Philosophy

Feature pyramids are widely exploited by both the state-of-the-art one-stage object detectors (e.g., DSSD, RetinaNet, RefineDet) and the two-stage object detectors (e.g., Mask RCNN, DetNet) to alleviate the problem arising from scale varia…

Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification Open

Li-Jia Li, Hao Su, Li Fei-Fei, Eric P. Xing · 2018

Computer science Political science Philosophy

Robust low-level image features have been proven to be effective representations for a variety of visual recognition tasks such as object recognition and scene classification; but pixels, or even local image patches, carry little semantic …

Object detection ≈ Object detection