Abstract
A large body of physiological findings has suggested the vision system understands a scene in terms of its local features such as lines and curves. A highly notable computer algorithm developed that models such behaviour is the Convolutional Neural Network (CNN). Whilst recognising an object in various scales remains trivial for the human vision system, CNNs struggle to achieve the same behaviour. Recent physiological findings are suggesting two new paradigms. Firstly, the visual system uses both local and global features in its recognition function. Secondly, the brain uses a distributed processing architecture to learn information from multiple modalities. In this paper we combine these paradigms and propose a distributed information integration model called D-Net to improve scale-invariant classification of images. We use a CNN to extract local features and, inspired by Google’s INCEPTION model, develop a trainable method using filter pyramids to extract global features called Filt er Pyramid Convolutions (FPC). D-Net locally processes CNN and FPC features, fuses the outcomes and obtains a global estimate via the central processor. We test D-Net on classification of scaled images on benchmark datasets. Our results show D-Net’s potential effectiveness towards classification of scaled images
Original language | English |
---|---|
Title of host publication | VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
Editors | Giovanni Maria Farinella, Petia Radeva, Jose Braz |
Place of Publication | Netherlands |
Publisher | Scitepress |
Pages | 491-498 |
Number of pages | 8 |
Volume | 5 |
ISBN (Electronic) | 9789897584022 |
ISBN (Print) | 9789897584022 |
DOIs | |
Publication status | Published - 2020 |
Event | 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Valletta, Valletta, Malta Duration: 27 Feb 2020 → 29 Feb 2020 http://www.visapp.visigrapp.org/?y=2020 |
Publication series
Name | VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
---|---|
Volume | 5 |
Conference
Conference | 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
---|---|
Abbreviated title | VISAPP 2020 |
Country/Territory | Malta |
City | Valletta |
Period | 27/02/20 → 29/02/20 |
Internet address |