Mix My Record in Stereo!
What is stereo? One of Wikipedia’s definitions is “stereo usually means 2-channel sound recording and sound reproduction using data for more than one speaker simultaneously.” I think that this is a very good definition because it is simple and does not assume too much. Stereo is not synonymous with panning or just sound coming out of two speakers. Stereo is not a name for a fancy musical playback setup. It is simply two independent sound channels that play back simultaneously. One of the conditions of the above statement is that in order to each channel to be played back independently two speakers are needed. Things really get interesting if the person who made the two channels understands how binaural hearing, the human ear, and these two sound sources can be used together to create some interesting effects and a stereo image.
Part of understanding how to create a good stereo images is to first understand a little bit about how binaural hearing works (humans by default are binaural). Binaural literally means having two ears. If we’ve learned anything about sound it’s that as the number our sound sources and the number our sound receivers (ears and microphones) increases the complexity and possibilities of mixing those sounds increases exponentially. There are three main things to pay attention to when mixing in stereo for a binaural audience (that’s all the time these days right).
1. Inter aural amplitude differences
2. Inter aural timing differences
3. The head related transfer function
(When I say inter aural I’m basically referring to how the sound is different in the left and right ears.)
Inter Aural Timing Differences
Note that the list above works from the low end of the frequency spectrum to the high end, I’ve started with timing differences because they are the most commonly manipulated by engineers. Inter aural timing differences refer to the difference in time that it takes the same sound to reach either ear. Image you are mixing and you’ve panned the rhythm guitar hard left. If you take a minute to think about how your brain figures out that the rhythm guitar is now coming from the left it’s because your left ear actually hears it before the right hear (ever so slightly). Depending on how much later your right ear hears the sound helps your brain actually place the sound in 2 dimensional space. This is how you know often were to look and verify the source of any sound, be it a car approaching, a bee on a flower, or where the phone is when it rings. Your brain does this well with frequencies roughly between 150Hz and 1.5 KHz beyond those ranges the waveforms of the sounds become either too big or too small for the brain to decipher in this way.
If we take a step back in the frequency spectrum and look at how our brain handles frequencies below 150 Hz we’ll find some interesting things. The waveform of 150 Hz is 2.25m or 7.4 feet in length. The size of these low frequency waveforms make it seem like the low frequencies are coming from everywhere, if you think about it one or two cycle could easily fill a room. This is why subwoofer placement in a surround sound system is not as critical as the other speakers. This is also why by default the bass is typically panned up the middle. If you try panning the bass guitar to the hard left or right in a mix it’s hard to close your eyes and place it, it will likely sound like it’s still in the middle. The lower the frequency the less control you have over it in the stereo image. There is a technique that can work for lower frequencies but begins to lose effectiveness around 80-100 Hz.
You can fabricate the timing difference between speakers by manually delaying one of the signals rather than panning. This technique is sometimes referred to at the Haas Panner Technique. If you delay the sound in the right channel somewhere between 0.2 and 1.12 ms (9-49 samples at 44.1KHz sampling rate), you can effectively make the sound seem to come from the left speaker. Note that DAW without plug-in delay compensation can yield effect when it is unintended or unwanted.
The Head Related Transfer Function
The Head Related Transfer Function has to do with the complex way that all the junk between our ears affects the way sound waves hit our ears. Due to the size, density and shape of the human head higher frequencies or smaller wavelengths will exhibit more pronounced effects. Typically frequencies above 2.5 KHz are most effected by the head. The head transfer function produces both phase and frequency effects, so the best way to counteract or control these effects are through EQ. Here is another one of those places in record where the term “There is no such thing as a free lunch” really applies. EQ used as a tool to add and blend layers to a mix, but it can also have an effect on the stereo image. Unfortunately there are no hard fast rules for EQing for a good mix and EQ for a good stereo image, every mix will be different, the process to marry the two will undoubtedly be careful listening and trail and error tweaking.
Inter Aural Amplitude Differences
Inter Aural Amplitude Differences are possibly the easiest to achieve. They occur when a sound is simply louder in one ear than the other. The easiest way to manipulate this is panning, but you could also accomplish this by processing the left channel differently from the right, though you will undoubtedly add some of the other aforementioned effects as well.
Another thing to mention about stereo image is which mic techniques are most conducive to stereo imaging and which techniques can reproduce the effects above. Below is a chart of which techniques will capture which effects.

End

