Member-only story
Introduction
Object Pose Estimation (OPE) is a typical task in computer vision that consists of identifying specific object instances in an image and determining each object’s position and orientation relative to some coordinate system. The pose of an object in a 3D scene can be described by means of rotation and translation transformation.
However, using this type of fixed 3D orientation representation can cause pose ambiguities due to symmetries. An overview of the different causes of pose ambiguities is shown in the figure below.
State-of-the-Art Overview
In literature, 6D object pose estimation algorithms from a single RGB image can be divided into three main categories using traditional image processing techniques or deep learning: