Regular readers will know that I hate arbitrary maxims labelled as ‘photographic’ rules simply because there is no such thing as a ‘universal scene’ or universal set of parameters for every image. Every composition is different, and every creative intention is different, which means the whole premise of there being a fixed set of laws that make a ‘good’ image or ‘image that works’ can only be nonsense. However, I do think there are some fundamental principles of human vision – and consequently psychological response to elements in an image – that we cannot ignore since they directly influence the response of our audience to the ideas we are trying to present. That is what I wish to address today: what are the autonomous/ subconscious/ reflex/ automatic – pick your preferred term – visual responses that we should be aware of and seek to utilise when we compose an image? Think of this post as the predecessor to The Four Things: it’s the underlying reason why some of the Things have to be the way they are.
Here’s what I’ve found so far, and their implications on the underlying structure of an image – there may well be more that I’m not consciously aware of:
We are attracted to anything that breaks pattern
This is the first principle in many ways, because everything that comes after this is contingent upon it: a uniform surface that fills our entire field of vision does not attract any attention in itself, yet the slightest flaw or imperfection or break draws our eyes straight away. The same is true of a scene in the real world: anything that seems out of place, either logically or in a contrast/color/tone continuity sort of way, will jump out. The only time something that’s uniform and homogeneous draws attention is when it’s a much smaller part of a whole – then it breaks pattern from the rest of the frame by being empty.
Photographic implication: Make sure your subject breaks pattern to stand out; think of camouflage and reverse camouflage.
We follow lines
Lines break pattern because they divide up the overall composition – our eyes then follow the break until we reach the end, presumably looking for the reunification of the two sides of the break. Where a line begins and ends inside a frame, we mostly ignore it. Where it does not, or takes up a large enough portion of the frame, we cannot, and we follow it to the end. Since most cultures read top left to bottom right, we tend to follow lines in this order of priority; however, we also follow lines from bottom to top of the frame because that follows the general layout of the world – things at our feet are generally closer than things in the middle of our field of view, unless this is broken by other circumstances.
Photographic implication: Use lines to lead the eye to the intended subject, or draw the audience through the frame past subjects in the intended causal direction for your story.
We look inside frames
Frames are closed lines that land up isolating a single specific section of the composition from the rest of it. They are boundaries between the isolated element and the background. For the same reasons we use frames to isolate a picture from the rest of the wall, but still require the rest of the wall for situational context, an image is stronger when structured this way.
Photographic implication: Put your subject inside an area of uniform textural continuity; it doesn’t have to be an actual frame, but even a plain background running most of the way around will do the trick.
We prioritise more and bigger
A big object will stand out more than a small one within the same presentation space, as will a brighter one, a more colourful one, etc. Tension and balance are created when you have two objects of similar or identical visual weight competing for your attention. They do not have to be identical objects, just objects of identical prominence relative to the background.
Photographic implication: Consider the visual mass distribution of your composition: it shouldn’t tip to one side as though it’s trying to throw you outside the frame, and the heaviest part should be where you want your audience’s attention to settle. Note that there are factors that can significantly alter visual weight such as pattern recognition…
We pattern recognise familiar objects and text
Humans have a sort of built in visual vocabulary: objects we encounter regularly are stored in our subconscious so that they don’t require conscious and active effort to process every time we see them; this is directly related to the element of surprise. If we have seen it before, the brain has an expectation of both the object and any causality of events attached to it – and we only become surprised when actual fact does not match our experience. What this means is that we are drawn to certain elements in an image – e.g. human faces, text, animals etc. – and much more strongly than their normal visual prominence as an abstract form would suggest. It also means that we perhaps don’t evaluate them as closely as an unfamiliar object, because our brains make up the gaps – we see what we expect to see.
Photographic implication: Familiar elements can be very small and still have the same visual impact; they can be large and have much higher visual impact; however, if they exist in a composition and are not the main subject, the they can land up distracting much more than you might expect.
We draw causal inferences from proximity
Since the space of the ‘stage’ in an image is defined and restricted by the frame boundaries, we look at relative spacing between elements to determine if there’s any implied relationship between them; less distance implies closeness and familiarity. A composition is somewhat misleading in that it can exclude surrounding elements, too; you can eliminate competition for proximity simply by leaving it out of the frame.
Photographic implication: The impression of closeness can be created by removing external distractions, or by artificially creating the feeling of space by putting subjects in the foreground and using a wide lens to exploit perspective.
We draw spatial inferences from shadow overlap and contrast
If an object projects a shadow on top of another object, it is assumed to be in front of it; the ‘bottom’ is always assumed to be darker than the ‘top’ – even if this may not physically be the case by the laws of gravitation. More distant objects are assumed to have lower contrast – think misty mountains.
Photographic implication: If you’re lighting something physically impossible, you can shoot it in any orientation you wish, so long as the ‘top’ and ‘bottom’ difference in brightness is maintained – even very subtle differences can be enough. On top of that, further structure in an image can be created by both gradients in luminance and contrast – this encourages progression through the frame and suggests varying distances, too.
This is a rather heavy article, so I’ll give it a day or so to digest and continue it in Part 2…
My ultimate photographers’ daybag in collaboration wth Frankie Falcon is available here until the end of October, in a strictly limited production run.
Visit the Teaching Store to up your photographic game – including workshop and Photoshop Workflow videos and the customized Email School of Photography. You can also support the site by purchasing from B&H and Amazon – thanks!
Images and content copyright Ming Thein | mingthein.com 2012 onwards. All rights reserved