Image Segmentation with Cascaded Hierarchical Models and Logistic Disjunctive Normal Networks

1 December 2013

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2 (15505499), 2168-2175
https://doi.org/10.1109/iccv.2013.269

Abstract

Contextual information plays an important role in solving vision problems such as image segmentation. However, extracting contextual information and using it in an effective way remains a difficult problem. To address this challenge, we propose a multi-resolution contextual framework, called cascaded hierarchical model (CHM), which learns contextual information in a hierarchical framework for image segmentation. At each level of the hierarchy, a classifier is trained based on down sampled input images and outputs of previous levels. Our model then incorporates the resulting multi-resolution contextual information into a classifier to segment the input image at original resolution. We repeat this procedure by cascading the hierarchical framework to improve the segmentation accuracy. Multiple classifiers are learned in the CHM, therefore, a fast and accurate classifier is required to make the training tractable. The classifier also needs to be robust against over fitting due to the large number of parameters learned during training. We introduce a novel classification scheme, called logistic disjunctive normal networks (LDNN), which consists of one adaptive layer of feature detectors implemented by logistic sigmoid functions followed by two fixed layers of logical units that compute conjunctions and disjunctions, respectively. We demonstrate that LDNN outperforms state-of-the-art classifiers and can be used in the CHM to improve object segmentation performance.

Keywords

This publication has 18 references indexed in Scilit:

Toward Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011
A Tree-Based Context Model for Object Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011
Kernelized structural SVM learning for supervised object segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
LIBSVM
ACM Transactions on Intelligent Systems and Technology, 2011
Detection of neuron membranes in electron microscopy images using a serial neural network architecture
Medical Image Analysis, 2010
TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context
International Journal of Computer Vision, 2007
Histograms of Oriented Gradients for Human Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Combining Top-Down and Bottom-Up Segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Robust Real-Time Face Detection
International Journal of Computer Vision, 2004
Fuzzy min-max neural networks. I. Classification
IEEE Transactions on Neural Networks, 1992

Cited by 61 articles