Joint spatial-depth feature pooling for RGB-D object classification

Hong Pan; Søren Ingvor Olsen; Yaping Zhu

doi:10.1007/978-3-319-19665-7_26

Joint spatial-depth feature pooling for RGB-D object classification

Hong Pan, Søren Ingvor Olsen, Yaping Zhu

Department of Computer Science

Abstract

RGB-D camera can provide effective support with additional depth cue for many RGB-D perception tasks beyond traditional RGB information. However, current feature representations based on RGB-D camera utilize depth information only to extract local features, without considering it for the improvement of robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split 2D image plane into sub-regions for feature pooling in RGB-D object classification. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards the depth topological information. Instead, we propose a novel joint spatial-depth pooling scheme (JSDP) which further partitions SPM using the depth cue and pools features simultaneously in 2D image plane and the depth direction. Embedding the JSDP with the standard feature extraction and feature encoding modules, we achieve superior performance to the state-ofthe- art methods on benchmarks for RGB-D object classification and detection.

Original language	English
Title of host publication	Image analysis : 19th Scandinavian Conference, SCIA 2015, Copenhagen, Denmark, June 15-17, 2015. Proceedings
Editors	Rasmus R. Paulsen, Kim S. Pedersen
Number of pages	13
Publisher	Springer
Publication date	2015
Pages	314-326
ISBN (Print)	978-3-319-19664-0
ISBN (Electronic)	978-3-319-19665-7
DOIs	https://doi.org/10.1007/978-3-319-19665-7_26
Publication status	Published - 2015
Event	Scandinavian Conference, SCIA 2015 - Copenhagen, Denmark Duration: 15 Jun 2015 → 17 Jun 2015 Conference number: 19

Conference

Conference	Scandinavian Conference, SCIA 2015
Number	19
Country/Territory	Denmark
City	Copenhagen
Period	15/06/2015 → 17/06/2015

Series	Lecture notes in computer science
Volume	9127
ISSN	0302-9743

Access to Document

10.1007/978-3-319-19665-7_26

Joint_Spatial_Depth_Feature_Pooling_for_RGB_D_Object_ClassificationFinal published version, 1.88 MB

https://link.springer.com/content/pdf/10.1007/978-3-319-19665-7_26.pdf

Cite this

Pan, H., Olsen, S. I., & Zhu, Y. (2015). Joint spatial-depth feature pooling for RGB-D object classification. In R. R. Paulsen, & K. S. Pedersen (Eds.), Image analysis: 19th Scandinavian Conference, SCIA 2015, Copenhagen, Denmark, June 15-17, 2015. Proceedings (pp. 314-326). Springer. Lecture notes in computer science Vol. 9127 https://doi.org/10.1007/978-3-319-19665-7_26

Joint spatial-depth feature pooling for RGB-D object classification. / Pan, Hong; Olsen, Søren Ingvor; Zhu, Yaping.

Image analysis: 19th Scandinavian Conference, SCIA 2015, Copenhagen, Denmark, June 15-17, 2015. Proceedings. ed. / Rasmus R. Paulsen; Kim S. Pedersen. Springer, 2015. p. 314-326 (Lecture notes in computer science, Vol. 9127).

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Pan, H, Olsen, SI & Zhu, Y 2015, Joint spatial-depth feature pooling for RGB-D object classification. in RR Paulsen & KS Pedersen (eds), Image analysis: 19th Scandinavian Conference, SCIA 2015, Copenhagen, Denmark, June 15-17, 2015. Proceedings. Springer, Lecture notes in computer science, vol. 9127, pp. 314-326, Scandinavian Conference, SCIA 2015, Copenhagen, Denmark, 15/06/2015. https://doi.org/10.1007/978-3-319-19665-7_26

@inproceedings{b8440e4b85a04c57b6bdad3bf27cf974,

title = "Joint spatial-depth feature pooling for RGB-D object classification",

abstract = "RGB-D camera can provide effective support with additional depth cue for many RGB-D perception tasks beyond traditional RGB information. However, current feature representations based on RGB-D camera utilize depth information only to extract local features, without considering it for the improvement of robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split 2D image plane into sub-regions for feature pooling in RGB-D object classification. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards the depth topological information. Instead, we propose a novel joint spatial-depth pooling scheme (JSDP) which further partitions SPM using the depth cue and pools features simultaneously in 2D image plane and the depth direction. Embedding the JSDP with the standard feature extraction and feature encoding modules, we achieve superior performance to the state-ofthe- art methods on benchmarks for RGB-D object classification and detection.",

author = "Hong Pan and Olsen, {S{\o}ren Ingvor} and Yaping Zhu",

note = "Achieved the Best Paper award of the conference; Scandinavian Conference, SCIA 2015 ; Conference date: 15-06-2015 Through 17-06-2015",

year = "2015",

doi = "10.1007/978-3-319-19665-7_26",

language = "English",

isbn = "978-3-319-19664-0",

series = "Lecture notes in computer science",

publisher = "Springer",

pages = "314--326",

editor = "Paulsen, {Rasmus R.} and Pedersen, {Kim S.}",

booktitle = "Image analysis",

}

TY - GEN

T1 - Joint spatial-depth feature pooling for RGB-D object classification

AU - Pan, Hong

AU - Olsen, Søren Ingvor

AU - Zhu, Yaping

N1 - Conference code: 19

PY - 2015

Y1 - 2015

N2 - RGB-D camera can provide effective support with additional depth cue for many RGB-D perception tasks beyond traditional RGB information. However, current feature representations based on RGB-D camera utilize depth information only to extract local features, without considering it for the improvement of robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split 2D image plane into sub-regions for feature pooling in RGB-D object classification. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards the depth topological information. Instead, we propose a novel joint spatial-depth pooling scheme (JSDP) which further partitions SPM using the depth cue and pools features simultaneously in 2D image plane and the depth direction. Embedding the JSDP with the standard feature extraction and feature encoding modules, we achieve superior performance to the state-ofthe- art methods on benchmarks for RGB-D object classification and detection.

AB - RGB-D camera can provide effective support with additional depth cue for many RGB-D perception tasks beyond traditional RGB information. However, current feature representations based on RGB-D camera utilize depth information only to extract local features, without considering it for the improvement of robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split 2D image plane into sub-regions for feature pooling in RGB-D object classification. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards the depth topological information. Instead, we propose a novel joint spatial-depth pooling scheme (JSDP) which further partitions SPM using the depth cue and pools features simultaneously in 2D image plane and the depth direction. Embedding the JSDP with the standard feature extraction and feature encoding modules, we achieve superior performance to the state-ofthe- art methods on benchmarks for RGB-D object classification and detection.

U2 - 10.1007/978-3-319-19665-7_26

DO - 10.1007/978-3-319-19665-7_26

M3 - Article in proceedings

SN - 978-3-319-19664-0

T3 - Lecture notes in computer science

SP - 314

EP - 326

BT - Image analysis

A2 - Paulsen, Rasmus R.

A2 - Pedersen, Kim S.

PB - Springer

T2 - Scandinavian Conference, SCIA 2015

Y2 - 15 June 2015 through 17 June 2015

ER -

Joint spatial-depth feature pooling for RGB-D object classification

Abstract

Conference

Access to Document

Fingerprint

Cite this