Briefly Describe the Functional Organisation of the Visual System

Briefly describe the functional organisation of the visual

system; describe a computational problem the visual system

has to solve. How is this solution achieved in the nervous

system?

Dionne Angela Donnelly

Module Code: PSYC202

Word Count: 1545

The visual system (VS) consists of the eye, the lateral geniculate nucleus (LGN), the striate

cortex (visual cortex, VC) and the magno- (M), parvo- (P) and koniocellular (K) pathways which

connect them (Eysenck & Keane, 2005). The whole retina-geniculate-striate pathway is an

inverted retinotopic map, which means that neurons will have inverted positions relative to one

another in the brain that are the same as their positions in the retina (Eysenck & Keane, 2005).

One computational problem the VS must solve is the problem of depth perception, or how we

combine information from both eyes to create a single three-dimensional (3D) representation

from the two-dimensional (2D, proximal stimuli) images created on our retinas from the world

around us (the distal stimuli). The information we take from these images appear to be a variety

of cues to depth which work in varying degrees depending on the distance of the stimulus from

the observer (Goldstein, 1999). In an attempt to understand the resolution of this problem each

component of the VS in evaluated in order to understand the role it plays in our perception of

depth. From this it is seen that the amalgamation of cues our eyes receive from the outside world

in order to perceive depth probably occurs within the cortical areas of the brain involved in

higher order processing.

In everyday situations, depth perception is usually based on cues provided by movement

(Eysenck & Keane, 2005). The most important of these cues is motion parallax in which when

we move, objects nearest to us move quickly past us, but those objects furthest away are much

slower (Goldstein, 1999). Static cues are monocular and include: pictorial cues, of which several

examples are linear perspective, texture gradient, image blur and interposition/occlusion;

oculomotor cues consist of convergence and accommodation; the binocular cue is stereopsis,

which is due to disparity of the two retinal images (Eysenck & Keane, 2005, Goldstein, 1999).

2

Disparity is the difference in the angle of the two images on the retinas. The further away details

are from the two fixation points (the parts of the image will fall on the foveas) the greater the

disparity (Goldstein, 1999). Disparity and consequently stereopsis is thought to play a major role

in our visual perception of depth.

Visual perception begins with the reception stage (Kalat, 1988) which involves the absorption of

light energy into the first component of the VS, the eye. Light travels in through the cornea and

passes through the pupil (a hole in the iris) to the lens (controlled by the ciliary muscle) which

focuses the light through the vitreous humour to the back of the eye – the retina (Eysenck &

Keane, 2005). The oculomotor cues originate within the eye: Convergence is when the eyes

move together to focus the fovea onto a nearby object and accommodation is when the lens

changes shape in order to change the level of focus on objects at different distances from the

observer (Eysenck & Keane, 2005, Goldstein, 1999). Convergence and accommodation are poor

cues to depth as they only provide information about stimuli between zero and two metres away

from the observer (Goldstein, 1999). Therefore they are not considered to contribute much to the

overall perception of depth.

The transduction stage takes place in the retina (Kalat, 1988), as the light energy is converted to

electrochemical signals by the photoreceptors – rods and cones. The level of coding is directly

dependent upon the intensity of the stimulus (Kalat, 1988). Firstly, light travels through four

layers of cells (the ganglion (G), amacrine (A), bipolar (B) and then horizontal (H) cells) to the

rods and cones. There are six million cones which are concentrated around the fovea (Eysenck &

3

Keane, 2005) and are specialised for colour vision and more importantly for depth perception -

sharpness. Image blur is one of the pictorial cues to depth perception, if an image is sharper than

its surroundings then it is perceived as being closer to the observer (Eysenck & Keane, 2005).

Therefore, it can interpreted that cones play a role in our perception of depth. There are 125

million rods which are mainly located around the periphery and are specialised for motion

perception and seeing in dim light (Eysenck & Keane, 1995). From this, it could be that rods

play a role in everyday depth perception involving motion parallax. Most of the differences

between rods and cones are due to retinal convergence, as there are 130 million photoreceptors

which converge onto 20 million B cells, which then converge onto only one million G cells

meaning that rods converge many more times than cones, and some cones only converge onto

one G cell each (Goldstein, 1999). There are different kinds of G cell: the M-ganglion cell which

projects down the magnocellular (M) pathway, and the P-ganglion cell, which projects down the

parvocellular (P) pathway. These pathways travel to the brain via the optic nerve down the

retina-geniculate-striate pathway (Eysenck & Keane, 2005).

Upon entering the brain, the signals pass through the optic chiasma (where the pathways from

the two retinas cross and the outer halves of each retinal projection go to the ipsilateral

hemisphere) to the lateral geniculate nucleus (LGN) of the thalamus (Eysenck & Keane, 2005).

The LGN actually receives 80 per cent of its input from the VC but the reason for this is

unknown (Weurger, 2010). The LGN is retinotopic and has six layers. Layers 1, 4, and 6 receive

signals from the contralateral eye. Layers 2, 3, and 5 receive signals from the ipsilateral eye.

These layers are often referred to as ocular dominance columns (ODCs). Also, the layers

correspond to the different pathways, with layers 1 and 2 relating to the M pathway, in which the

4

cells have large retinal fields (RFs) and large cell bodies, responding mainly to motion. This

pathway could therefore account for the transmission of depth information for moving objects.

Layers 3, 4, 5 and 6 relate to the P pathway, with small cell bodies. The P pathway is split into

two areas – blobs (high metabolic activity, respond to colour) and interblobs (low activity,

respond to location and orientation). As interblobs are sensitive to location and orientation it

could be that they too help to carry information regarding depth to the VC. Both pathways also

respond to contrast (Eysenck & Keane, 2005). Just ventral to each layer are the K cells, which

have very small cell bodies (Weurger, 2010), but apart from their association with colour, little is

known about these cells.

After passing through the LGN, the pathways arrive at the six layered VC in the occipital lobe

(Eysenck & Keane, 2005) and the axons mainly end in layer 4C in the primary VC (PVC, V1 or

Area 17) (Weurger, 2010). The mixing of the pathways of both eyes first occurs here, as in the

LGN they are completely segregated (Zigmond et al. 1999). The M pathway projects from the

LGN to layer 4Cα, the P pathway to layer 4Cβ, whilst the K pathway projects to layers 2 and 3

of the PVC (Weurger, 2010, see also Zigmond et al. 1999). More of these cortical neurones

receive information from the fovea than from the periphery (Goldstein, 1999). Zeki (1992, cited

in Eysenck & Keane, 2005) argued that the VC of the macaque monkey is organised into

different functional areas, and we generalise these to the human brain. He stated that the V1 and

V2 are mainly involved in early perception, whilst the V3 and V3A cells respond to form. The

V4 is mainly responsive to colour and line orientation and the V5 is dedicated to motion. The V5

(or middle temporal area) is thought to be involved in stereopsis (Eysenck & Keane, 2005) as

cells located in V2 which deal with disparity also project to V5 (Zeki, 1978, and Britten, 2004

5

cited in Daw, 1995). In a test of this proposal, DeAngelis, Cumming and Newsome (1998, cited

in Eysenck & Keane, 2005) stimulated cell clusters in monkey V5 and found their depth

perception to be biased towards the disparity preferred by those cells. Bülthoff, Bülthoff and

Sinha (1998, cited in Eysenck & Keane, 2005) found that when stereoscopic information of

familiar objects was scrambled, subjects did not realise it was wrong. The researchers argued that

the stereoscopic information is overridden by their expectations of what the stimuli should look

like. This is supported by research by Bruce, Green and Georgeson (2003, cited in Eysenck &

Keane, 2005), who reversed information presented to the right and left eyes, they found that

depth was reversed when looking at random-dot stereograms (RDS) and wire-frame images. But,

this effect was not seen when using photographs, which suggests that other factors such as

expectation of size/shape etc., and other depth cues such as occlusion play a role in modifying

our perception of depth, especially in situations which conflict with our normal perception of

stimuli.

To conclude, it would seem the computational problem of depth perception is solved

predominantly in the VC, showing that resolution of so many different cues requires higher

cortical functioning to solve. However, it is not clearly understood how these cues are integrated

within the cortex nor why binocular cues are sometimes ignored in favour of other cues, such as

prior knowledge of objects and occlusion.

References

Daw, N. W. (1995). Visual Development (Second Ed.). NY: Springer.

6

Eysenck, M. W. & Keane, M. T. (2005). Cognitive Psychology A Student’s Handbook (5th Ed.).

Hove: Psychology Press.

Goldstein, E. B. (1999). Sensation and Perception (5th Ed.). California: Brooks/Cole.

Kalat, J. W. (1998). Biological Psychology (5th Ed.). California: Wadsworth.

Weurger, S. M. (2010). Vision II: From the Retina to the Brain. PSYC202 Perception and

Memory Lecture Slides.

Zigmond, M. J., Bloom, F. E., Landis, S. C., Roberts, J. L. & Squire, L. R. (Eds.) (1999).

Fundamental Neuroscience. London: Academic Press.

7

Documents

Briefly Describe the Functional Organisation of the Visual System