Back to blog
MixingMay 9, 202612 min read

How to Mix Vocals: The Full Chain From Pitch Correction to Print

Vocals are the most important element in 95% of modern mixes. They are also the element where the difference between amateur and professional is most audible. The good news: vocal mixing is a solved problem. There is a roughly stable signal chain that works for most pop, rock, country, R&B, and folk vocals, and the variations between styles are mostly differences of degree, not kind. Once you understand the chain and what each stage is actually doing, you can apply it intelligently rather than copying presets and hoping.

The chain in order: gain staging and clean-up, subtractive EQ, compression (often two stages), additive EQ and saturation, de-essing, send to reverb, send to delay, optional second compressor or limiter for control. That is the full path from raw recording to print-ready vocal. Skip a stage and you will hear the absence; do too much in any one stage and you will hear the excess.

Gain staging first. Look at your raw vocal track. Peaks should hit roughly -6 dBFS, with the body of the performance averaging around -18 to -12 dBFS RMS. If your singer was hot and you are clipping, ride the gain down with clip gain (Pro Tools) or a gain plugin. If they were quiet and the noise floor is too audible, ride the quiet sections up. Doing this with clip gain rather than compression keeps your dynamic range intact and gives every downstream plugin clean, consistent input. This is the single most-skipped step in amateur mixing and it is the reason your compressor sounds wrong no matter what settings you try.

Subtractive EQ comes next. The goal is to remove what does not belong, not to add what is missing. Start with a high-pass filter at 80 to 100 Hz for most male vocals, 100 to 130 Hz for most female vocals. This kills mic stand thumps, plosives that survived the pop filter, and low-end mud that has nothing to do with the voice. Next, sweep with a narrow boosted band looking for honkiness in the 250 to 500 Hz range and harshness in the 2 to 4 kHz range. When you find the offending frequency, cut by 2 to 4 dB with a moderate Q. Do not cut more than that — you are not trying to reshape the voice, just to clean it up.

First compressor: an opto-style or FET-style compressor doing slow, musical level control. Ratio 2:1 to 3:1, attack 10 to 30 ms, release set by ear so the gain reduction needle pumps with the syllables. Aim for 3 to 5 dB of gain reduction on the loudest words. This is the compressor that makes the vocal sit consistently in the mix without sounding squashed. The LA-2A and 1176 hardware emulations are the workhorses here. If you only own one, the LA-2A clone gets you 80% of the way to most pop vocal sounds.

Second compressor (optional but common in modern pop): a faster compressor doing peak control. Ratio 4:1 to 6:1, attack 1 to 5 ms, fast release. Aim for another 2 to 4 dB on the loudest peaks. Stacking two compressors with different time constants does what one compressor cannot — slow musical leveling plus fast peak catching, without either compressor having to work hard enough to sound bad. This is one of the open secrets of modern vocal mixing.

Additive EQ after compression. Now that the dynamic range is controlled, you can add air and presence without it becoming painful. A gentle high shelf around 10 to 15 kHz, boosted 2 to 4 dB, opens up the top end. A narrow boost around 4 to 6 kHz adds presence and intelligibility. Be careful: every dB you add at 4 kHz also adds sibilance, which is why de-essing comes next.

De-essing is targeted compression on the sibilant frequencies (typically 5 to 10 kHz). Use a dynamic EQ or a true de-esser plugin. Set the threshold so it kicks in only on the harsh "S" and "T" sounds, not on the body of the vowels. 3 to 6 dB of reduction on offending consonants is usually enough. If your de-esser is constantly engaged, your additive EQ is too aggressive — pull back the high shelf.

Reverb sends. Use a stereo reverb bus, not an insert. Plate reverb (1.5 to 2.5 seconds decay) is the safest choice for most styles. Pre-delay of 20 to 50 ms keeps the dry vocal forward while still adding space. EQ the reverb return: high-pass at 300 Hz, low-pass at 8 kHz. This keeps mud out of the low end and harshness out of the top, leaving a band of "air" that sits behind the dry vocal without competing with it.

Delay sends are where modern vocal production gets interesting. A 1/4-note delay synced to tempo, sent at -20 dB to -30 dB below the dry vocal, adds depth without being audible as a discrete echo. Slap delay (80 to 120 ms, single repeat, no feedback) adds vintage character. Both can be sent to the reverb afterward for delay-then-reverb texture.

Reference tracks. A vocal mix without reference is a vocal mix flying blind. Pull up the most-streamed vocal in the same genre and key as your song. A/B your vocal against it on the same playback system. The professional vocal will almost always sit slightly more present, slightly less dynamic, and have less low-end and more high-end than yours. Use that gap as your todo list.

Common mistakes: too much compression early (kills the life), too much reverb (pushes the vocal back), no de-essing (painful highs), no high-pass (muddy mix), trying to fix tuning with EQ (use Melodyne or Auto-Tune for that). Each of these is a beginner pattern; eliminating them is a measurable upgrade in mix quality.

Pick one finished song you love and reverse-engineer its vocal chain by ear. Match the dynamics first, then the tonal balance, then the space. Doing this once teaches you more than a year of preset hunting. The Vocal Chain drill series in Studio Mix walks you through the same loop with synthetic stems.

Start Free