Exploring the representation capabilities of the HOG descriptor

Aditya Jayant Tatu, Francois Bernard Lauze, Mads Nielsen, Benjamin Kimia

6 Citationer (Scopus)

Abstract

Object recognition strategies are increasingly based on regional descriptors such as SIFT or HOG at a sparse set of points or on a dense grid of points. Despite their success on databases such as PASCAL and CALTECH, the capability of such a representation in capturing the essential object content of the image is not well-understood: How large is the equivalence class of images sharing the same HOG descriptor? Are all these images from the same object category, and if not, do the non-category images resemble random images which cannot generically arise from imaged scenes? How frequently do images from two categories share the same HOG-based representation? These questions are increasingly more relevant as very large databases such as ImageNet and LabelMe are being developed where the current object recognition strategies show limited success. We examine these questions by introducing the metameric class of moments of HOG which allows for a target image to be morphed into an impostor image sharing the HOG representation of a source image while retaining the initial visual appearance. We report that two distinct images can be made to share the same HOG representation when the overlap between HOG patches is minimal, and the success of this method falls with increasing overlap. This paper is therefore a step in the direction of developing a sampling theorem for representing images by HOG features.

OriginalsprogEngelsk
Titel2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops)
Antal sider8
ForlagIEEE
Publikationsdato2011
Sider1410-1417
ISBN (Trykt)978-1-4673-0062-9
ISBN (Elektronisk)978-1-4673-0063-6
DOI
StatusUdgivet - 2011
BegivenhedInternational Conference of Computer Vision (ICCV) - Barcelona, Spanien
Varighed: 8 nov. 201113 nov. 2011

Konference

KonferenceInternational Conference of Computer Vision (ICCV)
Land/OmrådeSpanien
ByBarcelona
Periode08/11/201113/11/2011

Emneord

  • image recognition
  • image representation
  • very large databases
  • visual databases
  • CALTECH databases
  • HOG descriptor
  • ImageNet
  • LabelMe
  • PASCAL databases
  • SIFT
  • dense grid of points
  • impostor image sharing
  • initial visual appearance
  • metameric class
  • noncategory images
  • object content
  • object recognition strategies
  • random images
  • regional descriptors
  • representation capabilities
  • source image
  • sparse set of points
  • Educational institutions
  • Equations
  • Histograms
  • Mathematical model
  • Object recognition
  • Upper bound
  • Vectors

Fingeraftryk

Dyk ned i forskningsemnerne om 'Exploring the representation capabilities of the HOG descriptor'. Sammen danner de et unikt fingeraftryk.

Citationsformater