Skip Navigation
York U: Redefine the PossibleHOME | Current Students | Faculty & Staff | Research | International
Search »FacultiesLibrariesCampus MapsYork U OrganizationDirectorySite Index
Future Students, Alumni & Visitors
2010 Technical Reports

Intermediate Visual Representations for Attentive Recognition Systems

Antonio J. Rodriguez-Sanchez

Technical Report CSE-2010-06

York University

September 1 2010

Abstract

Computational models of visual processes are of interest in fields such as cybernetics, robotics, computer vision and others. This thesis provides an analysis of a model of attention and of intermediate representation layers in the visual cortex that have direct impact on the next generation of object recognition strategies in computer vision. Biological inspiration - and even biological realism - is currently of great interest in the computer vision community. This thesis includes three major pieces, explained next. First, I believe that visual attention is a requirement to performnon-detection object recognition tasks. In order to test this hypothesiswe compare the Selective Tuning model of attention (Tsotsos et al., 1995) to studies from psychophysics in visual search tasks involving color and 2D shapes. Second, I define a biologically plausible model of Shape Representation which incorporates intermediate layers of visual representation that have not previously been fully explored. I hypothesize that endstopping and curvature cells are of great importance for shape selectivity and show how their combination can lead to shape selective neurons. This Shape Representation model provides a highly accurate fit with neural data from Pasupathy and Connor (2001, 2002).Finally, in the same way curvature parts may be configured into shapes, spatial gradients of velocity vectors may be related to optic flow in a hierarchical representation of visual motion analysis. For my last contribution I provide psychophysical evidence of the role of spatial gradients of velocity in optical flow perception as well as neurophysiological evidence for neurons tuned for such gradients.Following previous authors such as Zucker (1981) and Marr (1982), I have shown that deeper understanding of visual processes in humans and non-human primates can lead to important advancements in computational perception theories and systems.

Download paper in PDF format.



The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.