Perceptual organization

Perceptual grouping refers to the human visual ability to extract significant image relations from lower-level primitive image features without any knowledge of the image content and group them to obtain meaningful higher-level structure.

Research in perceptual grouping was started in 1920’s by Gestalt psychologists, whose goal was to discover the underlying principle that would unify the various grouping phenomena of human perception. Gestalt psychologists observed the tendency of the human visual system to perceive configurational wholes, with rules that govern the uniformity of psychological grouping for perception and recognition, as opposed to recognition by analysis of discrete primitive image features. The hierarchical grouping principles proposed by Gestalt psychologists embodied such concepts as grouping by proximity, similarity, continuation, closure, and symmetry.

Proximity

Continuation

Similarity

Closure

Symmetry

The members of the Gestalt school described what is computed in perceptual grouping, but didn’t find out why or how these computations were made. In the 1990’s, Sarkar and Boyer proposed a classificatory structure of perceptual grouping in Computer Vision divided in four levels: the signal level organises individual pixels or interest points into extended regions, edge chains or groups of points; the primitive level organises the features detected in the signal level into contours or region limits; the structural level searches for 2D structures like loops, corners, polygons or closed regions; The 2D structures generated at the structure level are further organised in the assembly level in order to identify relations among them.

Our experience in perceptual organisation consists in the design and implementation of a fully automatic framework for the detection of structured objects in 2D images. The framework starts at the signal level, where a Gabor decomposition extracts the directional primitives present in an image and a pyramid of artificial neural networks organises and integrates the results into a unique output. Next, in the primitive level, a line segment detector based on the principles of the fuzzy Hough transform and the Burns segment detector, is developed to group the edge pixels into straight line segments. The framework ends at the structural level, that integrates the relationships offered by Gestalt psychology (parallelism, continuity, closure, etc.) among line segments in order to form large groups.

Main publications

  • S. Pan, G. Shavit, M. Penas, D. Xu, L. Shapiro, R. Ladner, E. Riskin, W. Hol, D. Meldrum, «Automated classification of protein crystallization images using support vector machines with scale-invariant texture and Gabor features», Acta Crystallographica Section D, D62, 271-279, 2006.
  • M. Penas, M. J. Carreira, M. G. Penedo, N. Barreira, «Efficient combination of the fuzzy Hough transform and the Burns segment detector», Lecture Notes in Computer Science (selected papers EUROCAST’07), 4739, 733-739, 2007.
Abstract
This paper describes a computational framework for the fully automated detection of line segments in 2D digital images. The operation of the framework is divided in two stages, the low level directional primitive detection through Gabor wavelets and growing cell structures, and the segment detection through an efficient and very accurate combination of the fuzzy Hough transform and the Burns segment detector.
  • J. Rouco, M. Penas, M. G. Penedo, M. Ortega, C. Alonso-Montes, «Certainty Measure of Pairwise Line Segment Perceptual Relations using Fuzzy Logic», LNCS, Progress in Pattern Recognition, Image Analysis and Applications, 4756, 477-486, 2007.
Abstract
Perceptual grouping is an important part of many computer vision systems. When inferring a new grouping from the primitive features there is always an uncertainty degree on this detection, that might be useful in further reasonings. In this paper, we present a fuzzy logic based system for the computation of the certainties assigned to pairwise line segment relations and introduce its application to the detection of continuity, identity, junction, L-junction, incidence, T-junction and parallelism relations. The results presented show that the proposed method might be very promising for future applications.
  • M. Penas, M. G. Penedo, M. J. Carreira, «A neural network based framework for directional primitive extraction», Neural Processing Letters, 27 (1), 67-83, 2008.
Abstract
This paper describes a computational framework for the extraction of low-level directional primitives in images. The system is divided in two stages. The first one consists in the extraction of the low level directional primitives, through the Gabor wavelet decomposition. The second one consists in the reduction of the high dimensionality of the Gabor decomposition results by means of auto-organised structures. The main advantages of the system introduced are two: it provides accurate and reliable information, and it produces good results on different image types without intervention of the final user. These advantages will be demonstrated by comparing our system with a classical edge detector.
  • M. Penas, M. J. Carreira, M. G. Penedo, N. Barreira, «Comparison of Alternative Frameworks for Directional Primitive Estraction», Pattern Recognition and Image Analysis, 17, Nº 4, 439-449, 2007.
Abstract
This paper describes and compares three alternative computational frameworks for the extraction of the directional primitives present in an image. These frameworks are based on Gabor decomposition, Sobel filtering, and filtering through the first derivative of a Gaussian filter, respectively. They are divided into two stages: low-level primitive extraction and organization of low-level primitives by means of dynamical neural networks. These alternative frameworks are compared and their relative advantages and disadvantages are out-lined.