How the Visual Cortex Works — Hubel-Wiesel, V1–V5 Hierarchy, and Neural Coding
About 30% of the human cerebral cortex is dedicated to vision. The visual system extracts edges, motion, color, depth, and object identity from the noisy, low-quality image that arrives through a 130-million-photoreceptor retina. This article traces the path from retinal ganglion cells to high-level visual areas, covering the Nobel Prize-winning work of Hubel and Wiesel, the functional organization of V1, the two processing streams, and what we know about how the brain codes visual information.
1. From Retina to Primary Visual Cortex
Visual signals travel from the retina along the optic nerve. At the optic chiasm, fibers from the nasal half of each eye cross: fibers from the left visual field converge in the right hemisphere, and vice versa. This partial crossing means each cerebral hemisphere processes the contralateral visual field.
The first central relay is the lateral geniculate nucleus (LGN) of the thalamus. LGN has six layers:
- Magnocellular (M) layers 1–2: Large cells, fast conduction, sensitive to luminance contrast and motion. Project mainly to the dorsal stream.
- Parvocellular (P) layers 3–6: Smaller cells, slower, sensitive to color and fine spatial detail. Project mainly to the ventral stream.
- Koniocellular (K) sublayers: Between M/P layers, involved in color (blue-yellow chromatic signals, S-cone input).
2. V1 — Primary Visual Cortex
V1 (primary visual cortex, Brodmann area 17) in the occipital lobe is the main cortical target of LGN output. It contains the most precise retinotopic map and is the largest single visual area in humans (~25 cm² per hemisphere). V1 is organized in six layers:
Functional Columns
V1 neurons are organized into vertical columns (~50 µm diameter) that share functional properties. Two types of modules tile the cortical surface:
- Orientation columns: Cells within a column respond preferentially to edges of the same orientation (e.g., 45°, 90°, 135°). Orientation preference shifts smoothly across the cortex in ~180° "hypercolumns" (~750 µm across).
- Ocular dominance columns: Alternating ~500 µm wide bands driven preferentially by the left or right eye (although both eyes project to each neuron).
- Cytochrome oxidase blobs: ~200–300 µm round patches in layers 2/3, rich in mitochondria, contain color-opponent (double-opponent) cells, receive K-cell input.
3. Hubel and Wiesel's Nobel Discoveries (1981)
David Hubel and Torsten Wiesel spent the 1960s–70s recording single neurons in the primary visual cortex of cats and monkeys with microelectrodes. Their discoveries reshaped neuroscience:
Simple Cells
Simple cells have elongated, subdivided receptive fields with distinct ON and OFF subregions. They respond maximally to an oriented edge or bar at a specific position and orientation in the visual field. Their response can be modeled as a linear spatiotemporal filter:
Complex Cells
Complex cells respond to oriented edges anywhere within their (larger) receptive field — they are PHASE INVARIANT. Moving an oriented edge across the receptive field produces sustained firing; the cell doesn't distinguish whether the edge is in an ON or OFF region. Modeled as a sum of squared simple cell outputs at opposite spatial phases:
Critical Period and Plasticity
Hubel and Wiesel discovered that monocular deprivation of kittens during a critical period (roughly postnatal weeks 3–8) causes permanent loss of responsiveness of V1 neurons to the closed eye — structural reorganization of ocular dominance columns. This revealed experience-dependent plasticity in early visual development and explained the basis for amblyopia (lazy eye) treatment windows.
4. V2, V3, V4, MT — The Hierarchy
| Area | Location | Key Functions | Notable Properties |
|---|---|---|---|
| V1 | Occipital pole (calcarine sulcus) | Edges, orientation, spatial frequency, binocularity | Finest retinotopy; 6-layer LGN input |
| V2 | Adjacent to V1 (lunate sulcus) | Complex edges, color, disparity (depth), illusory contours | Stripe architecture (thick/thin/pale); feedback to V1 |
| V3/V3A | Above/below V2 | Complex shapes, global form | Large receptive fields; less well-studied |
| V4/hV4 | Ventral occipital | Color, form, recognition | Color constancy; V4 lesions → achromatopsia |
| MT/V5 | Posterior superior temporal sulcus | Motion direction, speed, optic flow | Directionally selective; MT lesions → akinetopsia |
| MST | Adjacent to MT | Complex optic flow, heading, smooth pursuit | Large receptive fields; >50% area of visual field |
5. The Two Visual Streams
Ungerleider and Mishkin (1982) proposed that visual signals bifurcate after V1 into two parallel processing pathways:
6. Spatial Frequency Channels
The visual system decomposes images into multiple spatial frequency bands (cycles per degree of visual angle), analogous to a Fourier decomposition:
7. Neural Coding and Sparse Representations
A key question: How does a population of neurons represent visual information?
Rate Coding vs. Temporal Coding
Rate coding: Information carried by mean firing rate over 100–200 ms windows. Simple and robust; most V1 neurons are understood this way. Temporal coding: Information in precise spike timing, spike synchrony, or oscillation phase — important in some higher areas, especially for binding features across cortex.
Sparse Coding
Olshausen and Field (1996) showed that if you train a neural network to represent natural image patches using a small number of active units, the learned basis functions spontaneously become Gabor-like — oriented, localized, bandpass. This sparse coding hypothesis suggests V1 evolved to efficiently represent natural images:
fMRI and Population Receptive Fields
fMRI measures the BOLD (Blood Oxygenation Level Dependent) signal — a hemodynamic proxy for local neural activity, delayed by ~4–8 s. Retinotopic mapping using phase-encoded stimuli (rotating wedge / expanding ring) reveals V1–V4 and MT in individual subjects. Population receptive field (pRF) models estimate the retinal location and size best driving each cortical voxel — showing systematic magnification and progression of pRF sizes across the visual hierarchy.