Deep reconstruction models for image set classification

Munawar Hayat, Mohammed Bennamoun, Senjian An

Research output: Contribution to journalArticlepeer-review

148 Citations (Scopus)
4 Downloads (Pure)


Image set classification finds its applications in a number of real-life scenarios such as classification from surveillance videos, multi-view camera networks and personal albums. Compared with single image based classification, it offers more promises and has therefore attracted significant research attention in recent years. Unlike many existing methods which assume images of a set to lie on a certain geometric surface, this paper introduces a deep learning framework which makes no such prior assumptions and can automatically discover the underlying geometric structure. Specifically, a Template Deep Reconstruction Model (TDRM) is defined whose parameters are initialized by performing unsupervised pre-training in a layer-wise fashion using Gaussian Restricted Boltzmann Machines (GRBMs). The initialized TDRM is then separately trained for images of each class and class-specific DRMs are learnt. Based on the minimum reconstruction errors from the learnt class-specific models, three different voting strategies are devised for classification. Extensive experiments are performed to demonstrate the efficacy of the proposed framework for the tasks of face and object recognition from image sets. Experimental results show that the proposed method consistently outperforms the existing state of the art methods.

Original languageEnglish
Article number6888522
Pages (from-to)713-727
Number of pages15
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Issue number4
Publication statusPublished - 2015
Externally publishedYes


Dive into the research topics of 'Deep reconstruction models for image set classification'. Together they form a unique fingerprint.

Cite this