Explanipedia

Kaputt: A Large-Scale Dataset for Visual Defect Detection Open

Sebastian Höfer, Dorian Henning, Artemij Amiranashvili, Douglas Morrison, Mariliza Tzes , et al. · 2025

We present a novel large-scale dataset for defect detection in a logistics setting. Recent work on industrial anomaly detection has primarily focused on manufacturing scenarios with highly controlled poses and a limited number of object ca…

Learn to Predict Sets Using Feed-Forward Neural Networks Open

Hamid Rezatofighi, Roman Kaskman, Farbod Motlagh, Qinfeng Shi, Anton Milan , et al. · 2021

This paper addresses the task of set prediction using deep feed-forward neural networks. A set is a collection of elements which is invariant under permutation and the size of a set is not fixed in advance. Many real-world problems, such a…

MOT20: A benchmark for multi object tracking in crowded scenes Open

Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Qinfeng Shi, Daniel Cremers , et al. · 2020

Computer science Psychology Political science

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore…

CVPR19 Tracking and Detection Challenge: How crowded can it get? Open

Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Qinfeng Shi, Daniel Cremers , et al. · 2019

Computer science Psychology Geography

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore…

PoseTrack: A Benchmark for Human Pose Estimation and Tracking Open

Mykhaylo Andriluka, Umar Iqbal, Eldar Insafutdinov, Leonid Pishchulin, Anton Milan , et al. · 2018

Computer science Physics Psychology

Human poses and motions are important cues for analysis of videos with people and there is strong evidence that representations based on body pose are highly effective for a variety of tasks such as activity recognition, content retrieval …

Joint Learning of Set Cardinality and State Distribution Open

S. Hamid Rezatofighi, Anton Milan, Qinfeng Shi, Anthony Dick, Ian Reid · 2018

Computer science Mathematics Physics

We present a novel approach for learning to predict sets using deep learning. In recent years, deep neural networks have shown remarkable results in computer vision, natural language processing and other related problems. Despite their suc…

Design of a Multi-Modal End-Effector and Grasping System: How Integrated Design helped win the Amazon Robotics Challenge Open

S. Wade-McCue, N. Kelly-Boxall, M. McTaggart, Douglas Morrison, A. W. Tow , et al. · 2017

Computer science Engineering Chemistry

We present the grasping system and design approach behind Cartman, the winning entrant in the 2017 Amazon Robotics Challenge. We investigate the design processes leading up to the final iteration of the system and describe the emergent sol…

Mechanical Design of a Cartesian Manipulator for Warehouse Pick and Place Open

M. McTaggart, Douglas Morrison, A. W. Tow, Robert Smith, N. Kelly-Boxall , et al. · 2017

Computer science Engineering Mathematics

Robotic manipulation and grasping in cluttered and unstructured environments is a current challenge for robotics. Enabling robots to operate in these challenging environments have direct applications from automating warehouses to harvestin…

DeepSetNet: Predicting Sets with Deep Neural Networks Open

S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick , et al. · 2017

Computer science Mathematics Economics

This paper addresses the task of set prediction using deep learning. This is important because the output of many computer vision tasks, including image tagging and object detection, are naturally expressed as sets of entities rather than …

Semantic Segmentation from Limited Training Data Open

Anton Milan, Trung Pham, K. Vijay, Douglas Morrison, A. W. Tow , et al. · 2017

Computer science Economics

We present our approach for robotic perception in cluttered scenes that led to winning the recent Amazon Robotics Challenge (ARC) 2017. Next to small objects with shiny and transparent surfaces, the biggest challenge of the 2017 competitio…

Cartman: The low-cost Cartesian Manipulator that won the Amazon Robotics Challenge Open

Douglas Morrison, A. W. Tow, M. McTaggart, Robert Smith, N. Kelly-Boxall , et al. · 2017

Computer science Engineering Mathematics

The Amazon Robotics Challenge enlisted sixteen teams to each design a pick-and-place robot for autonomous warehousing, addressing development in robotic vision and manipulation. This paper presents the design of our custom-built, cost-effe…

Joint Learning of Set Cardinality and State Distribution Open

S. Hamid Rezatofighi, Anton Milan, Qinfeng Shi, Anthony Dick, Ian Reid · 2017

Computer science Mathematics Physics

We present a novel approach for learning to predict sets using deep learning. In recent years, deep neural networks have shown remarkable results in computer vision, natural language processing and other related problems. Despite their suc…

RGB-D object detection and semantic segmentation for autonomous manipulation in clutter Open

Max Schwarz, Anton Milan, Arul Selvam Periyasamy, Sven Behnke · 2017

Computer science Engineering Chemistry

Autonomous robotic manipulation in clutter is challenging. A large variety of objects must be perceived in complex scenes, where they are partially occluded and embedded among many distractors, often in restricted spaces. To tackle these c…

Tracking the Trackers: An Analysis of the State of the Art in Multiple Object Tracking Open

Laura Leal-Taixé, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid , et al. · 2017

Computer science Psychology Geography

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore…

Online Multi-Target Tracking Using Recurrent Neural Networks Open

Anton Milan, S. Hamid Rezatofighi, Anthony Dick, Ian Reid, Konrad Schindler · 2017

Computer science Engineering Philosophy

We present a novel approach to online multi-target tracking based on recurrent neural networks (RNNs). Tracking multiple objects in real-world scenes involves many challenges, including a) an a-priori unknown and time-varying number of tar…

Data-Driven Approximations to NP-Hard Problems Open

Anton Milan, S. Hamid Rezatofighi, Ravi Garg, Anthony Dick, Ian Reid · 2017

Computer science Mathematics Sociology

There exist a number of problem classes for which obtaining the exact solution becomes exponentially expensive with increasing problem size. The quadratic assignment problem (QAP) or the travelling salesman problem (TSP) are just two examp…

PoseTrack: Joint Multi-Person Pose Estimation and Tracking Open

Umar Iqbal, Anton Milan, Jüergen Gall · 2016

Computer science Economics

In this work, we introduce the challenging problem of joint multi-person pose estimation and tracking of an unknown number of persons in unconstrained videos. Existing methods for multi-person pose estimation in images cannot be applied di…

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation Open

Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid · 2016

Computer science

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification problems such as semantic segmentation. However, repeated subsam…

MOT16: A Benchmark for Multi-Object Tracking Open

Anton Milan, Laura Leal-Taixé, Ian Reid, Stefan Roth, Konrad Schindler · 2016

Computer science Psychology Physics

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore…

MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking Open

Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, Konrad Schindler · 2015

Computer science Psychology Geography

In the recent past, the computer vision community has developed centralized benchmarks for the performance evaluation of a variety of tasks, including generic object and pedestrian detection, 3D reconstruction, optical flow, single-object …

Anton Milan YOU? Author Swipe