Why Coaxial Speaker Geometry Creates Sound That Single Drivers Cannot Reproduce
Sabbat E16 Noise Cancelling Wireless Earbuds
If you moved your head exactly two inches to the right while listening to a pair of speakers, would the sound change?
For nearly every multi-driver speaker ever made, the answer is yes. And the reason has nothing to do with the quality of the components, the amplifier, or the recording. It has to do with geometry — specifically, the physical distance between two drivers that are trying to act as one.
This is not a minor quirk. It is a fundamental limitation baked into the physics of sound. And it is the reason that some of the most obsessive audio engineers in history — from recording studio designers in 1940s Hollywood to hearing-aid researchers in modern Cambridge — have pursued a deceptively simple solution: put both drivers on the exact same axis.
The result is called a coaxial speaker. Understanding why it exists, and why it still cannot be easily replaced by a single driver, reveals something profound about the nature of sound itself.

The Two-Inch Problem That Ruins Your Music
Sound travels through air at approximately 343 meters per second. That number feels abstract until you convert it into something tangible: at a frequency of 1,000 Hz — roughly the pitch of a soprano's high note — a single sound wave spans about 34 centimeters from peak to peak. At 10,000 Hz, the wavelength shrinks to just 3.4 centimeters. At 20,000 Hz, near the upper limit of human hearing, it is barely 1.7 centimeters.
These numbers matter because speakers typically divide the audible spectrum between two or more drivers. A woofer handles the low frequencies. A tweeter handles the highs. Between them lies a crossover region — usually somewhere between 2,000 and 5,000 Hz — where both drivers are actively reproducing sound simultaneously.
In a conventional speaker, the tweeter sits above or beside the woofer. The physical separation between them might be 10 centimeters, 15 centimeters, sometimes less. This seems negligible until you remember that at the crossover frequency, the wavelength of sound is comparable to that separation.
When both drivers produce the same frequency from different physical locations, their sound waves arrive at your ears at slightly different times. At some positions, the wave crests align — reinforcing the sound. At others, a crest from one driver meets a trough from the other — canceling it. Move your head even a few inches, and the pattern shifts entirely.
This phenomenon is called lobing, named for the lobe-shaped patterns visible when engineers measure a speaker's output at different angles. It is not a defect in manufacturing. It is an unavoidable consequence of geometry. Any two sound sources separated by a distance that approaches half the wavelength of the frequencies they share will produce these interference patterns.
The result is a speaker whose tonal character changes depending on where you sit. In a recording studio, where the engineer sits in one precise location, this can be managed. In a living room, where listeners scatter across a couch, it cannot.
When Two Waves Collide: The Physics You Were Never Taught
To understand why coaxial design matters, you need to understand what actually happens when two sound waves meet.
Sound is a pressure disturbance — alternating regions of compressed and rarefied air molecules. When two such disturbances occupy the same space, they combine according to a principle called superposition. Where a compression from one source meets a compression from another, the pressure doubles. Where a compression meets a rarefaction, they cancel. The net result depends entirely on the relative timing — the phase — of the two waves.
This is not abstract theory. It is the same physics that makes noise-canceling headphones work: they generate a wave that is the exact mirror image of the incoming noise, and the two annihilate each other at your eardrum.
In a multi-driver speaker, this interference happens unintentionally. At the crossover frequency, both the woofer and tweeter are reproducing the same notes. A violin's A4 (440 Hz) produces overtones at 880 Hz, 1,320 Hz, 1,760 Hz — right in the crossover zone. If the listener is positioned such that the path from the violin's recorded fundamental to the woofer is slightly different in length than the path to the tweeter, the overtones arrive at slightly different times.
The ear perceives this not as silence (the cancellation is rarely complete) but as a subtle hollowing or coloration of the instrument's timbre. The violin still sounds like a violin, but something about it feels slightly off — less present, less three-dimensional, less real.
Engineers have a rule of thumb: when the distance between two sound sources is less than one-quarter wavelength at the crossover frequency, interference effects become negligible. At a typical crossover of 2,500 Hz (wavelength ≈ 13.7 cm), that means the drivers must be within about 3.4 centimeters of each other. In a bookshelf speaker, the physical gap between a woofer's center and a tweeter's center is often 10 centimeters or more.
This is the fundamental math problem that coaxial design solves.

The Point Source — Audio's Holy Grail Since 1928
In acoustics, the theoretical ideal has a name: the point source. It is a concept borrowed from physics — a source of radiation so small that it occupies a single mathematical point in space. From a point source, waves propagate as perfect spheres, uniform in every direction, uncolored by the shape or size of the source itself.
No real speaker is a point source. Every driver has physical dimensions. A 6-inch woofer is not a point — it is a surface that pushes air. But some designs approximate the ideal more closely than others.
A single full-range driver comes closest in concept. All frequencies emanate from one diaphragm. There are no crossover regions, no path-length differences, no lobing. The phase relationships between a sound's fundamental and its harmonics are preserved perfectly.
But a single driver has its own physics problem — actually, several of them.
First, bandwidth. No practical diaphragm can move slowly enough to reproduce 20 Hz bass while simultaneously vibrating fast enough to reproduce 20,000 Hz treble. The diaphragm's mass limits its high-frequency response. Its stiffness limits its low-frequency excursion. Engineering a driver that covers the full range means compromising on both extremes.
Second, Doppler distortion. When a single diaphragm is simultaneously producing a low bass note (requiring large, slow excursions) and a high treble note (requiring tiny, rapid vibrations), the bass excursion physically moves the source of the treble. The treble frequency gets modulated by the bass motion — exactly like the pitch of an ambulance siren changes as the vehicle passes. The result is intermodulation: new frequencies that were never in the original recording, audible as a subtle grit or harshness.
Third, directivity. As frequency increases, a driver's output becomes more directional — beaming forward like a flashlight rather than spreading outward like a lantern. A 6-inch driver beams significantly above about 2,300 Hz. A 1-inch tweeter does not beam until above 13,000 Hz. This mismatch means a single large driver sounds different off-axis than on-axis.
So the single driver preserves phase coherence but sacrifices bandwidth, adds Doppler distortion, and beams at high frequencies. The multi-driver speaker solves those problems but introduces lobing and phase errors. This is the central tension in loudspeaker design.
The coaxial approach attempts to capture the benefits of both.
How Altec Lansing Changed Recording Studios Forever
In 1943, a company called Altec Lansing introduced the Duplex 601 — the first commercially viable coaxial loudspeaker. Its successor, the Model 604, appeared in 1944 and within a few years became the standard studio monitor in virtually every major American recording facility.
The 604's design was ingeniously simple in concept and brutally difficult in execution. It combined a 15-inch woofer with a high-frequency compression driver mounted behind it. The compression driver fired through a horn that passed through the center of the woofer's magnetic structure, exiting through a narrow opening at the apex of the woofer cone.
This meant both drivers shared the same axis. High and low frequencies originated from essentially the same physical location. The lobing that plagued separate-driver designs was eliminated. An engineer could move around the control room and hear consistent tonal balance — a revolutionary capability at the time.
The 604 was not perfect. The horn's mouth created diffraction — sound waves bending around its edges, reflecting off the woofer cone, and arriving at the listener slightly delayed relative to the direct sound. This produced a subtle coloration that became more noticeable at higher volumes. And because the high-frequency driver was physically positioned behind the woofer, its sound arrived at the listener slightly later than the low frequencies — a time-alignment error measured in fractions of a millisecond that nonetheless blurred transient detail.
Despite these flaws, the 604's advantages were so significant that it remained in continuous production for over six decades. Great Plains Audio in Oklahoma still manufactures versions using the original Altec tooling — a testament to a design that got the fundamentals right in 1944.
The 604's influence on recorded music is almost impossible to overstate. From the 1950s through the 1980s, the vast majority of popular music — rock, jazz, soul, country — was mixed and mastered on Altec 604s or their descendants. The sound of those records, the balance of frequencies that defines mid-century recording aesthetics, was shaped by a coaxial speaker.
The British Counterpart: Tannoy's Waveguide Revolution
Across the Atlantic, another company was pursuing coaxial design with a fundamentally different approach. Tannoy, founded in London in 1926 by Guy R. Fountain (the name is a contraction of "tantalum alloy," a material used in the company's early rectifiers), had been building loudspeakers for public address systems and cinemas since the 1930s.
In 1947, Tannoy's chief engineer Ronald Rackham created the Dual Concentric driver. Where the Altec 604 placed a compression horn in front of the woofer, Rackham's design placed the high-frequency driver behind the woofer's motor unit and fired it through an acoustic waveguide — a concept borrowed from microwave engineering — that passed through the center of the woofer's magnetic structure and exited at the apex of the cone.
The critical innovation was this: the woofer cone itself served as the final section of the high-frequency horn. The cone's flare continued the compression driver's expansion rate, creating a smooth, continuous acoustic path from the HF diaphragm to the listening room. This meant the high-frequency output was radially symmetric — uniform in all directions — rather than merely mirror-image symmetric as in the Altec design.
Rackham set the crossover point at 1,000 Hz, nearly a full octave above the horn's natural cutoff frequency. This generous margin resulted in extraordinarily low coloration — the horn's resonant character was pushed well below the frequencies where it could be heard.
The first Tannoy Dual Concentric, the 15-inch Monitor Black, used magnets producing 12,000 gauss (1.2 Tesla) for the low-frequency voice coil and 18,000 gauss (1.8 Tesla) for the high-frequency coil — formidable figures for the 1940s. Decca Records purchased the first six units. EMI visited Tannoy's factory in 1951 and adopted the design for their studios. Within a decade, Tannoy Dual Concentrics were the European studio standard, serving the same role that Altec 604s played in America.
The Tannoy approach had one significant weakness: because the woofer cone formed part of the high-frequency horn, the woofer's motion modulated the tweeter's output more severely than in the Altec design. This intermodulation distortion increased with volume — a problem that would not be fully addressed until digital signal processing arrived decades later.

The Doppler Paradox: Why Your Woofer Lies to Your Tweeter
The most persistent challenge in coaxial design is not mechanical or electrical — it is physical. It is a consequence of the Doppler effect, the same phenomenon that makes a passing ambulance siren rise in pitch as it approaches and fall as it recedes.
In a coaxial driver, the tweeter is physically mounted to the woofer — or at least positioned within the woofer's structure. When the woofer moves forward to produce a bass note, it carries the tweeter with it. When it moves backward, the tweeter moves backward too. The tweeter is now a moving sound source, and its high-frequency output is frequency-modulated by the woofer's motion.
Consider a concrete example. Suppose the woofer is producing a 100 Hz bass tone while the tweeter is simultaneously reproducing a 5,000 Hz signal. As the woofer cone moves forward (toward the listener), the tweeter's apparent frequency increases slightly — say to 5,010 Hz. As the cone moves backward, the frequency decreases to 4,990 Hz. The result is not a clean 5,000 Hz tone but a modulated signal with sidebands at 5,100 Hz and 4,900 Hz.
These sidebands are frequencies that were never in the original recording. The ear perceives them as a subtle harshness or metallic quality overlaid on high-frequency content — particularly noticeable on vocal sibilance, cymbal shimmer, and string overtones.
The magnitude of this Doppler distortion is proportional to the woofer's cone displacement. At low volumes with modest bass content, the effect is negligible. At high volumes with deep bass — think orchestral crescendos or electronic music drops — it becomes audible.
This is the paradox at the heart of coaxial design. By placing both drivers on the same axis to eliminate spatial interference, you introduce a new form of temporal interference. You solve lobing but create Doppler distortion.
Engineers have developed several countermeasures. Some designs mechanically decouple the tweeter from the woofer using compliant mounting systems that absorb the woofer's motion. Others use three-way coaxial configurations — adding a dedicated midrange driver — which reduces the excursion required from any single driver and thus reduces Doppler modulation.
In the 2000s, Fulcrum Acoustic pioneered the use of digital signal processing to actively compensate for coaxial imperfections. Their system analyzes the woofer's motion in real time and pre-distorts the tweeter's signal to counteract the predicted Doppler shift. It is the audio equivalent of the image stabilization system in a camera — correcting for physical movement electronically.
Genelec took a different approach with their Minimum Diffraction Coaxial (MDC) driver, detailed in a paper presented at the Audio Engineering Society convention. Their design uses elastic material to bridge the gap between the tweeter dome and the midrange cone, creating a continuous, smooth surface that eliminates the diffraction that plagues traditional coaxial designs. The result is a driver with exceptionally smooth frequency response both on-axis and off-axis.
Your Ear Is a Phase Detector
The reason all of this matters — the lobing, the Doppler distortion, the diffraction — is that the human auditory system is exquisitely sensitive to phase relationships between frequencies.
When you hear a violin play A4 (440 Hz), you are not hearing a single frequency. You are hearing a fundamental at 440 Hz plus harmonics at 880 Hz, 1,320 Hz, 1,760 Hz, 2,200 Hz, and upward — each at decreasing amplitudes, each shaped by the instrument's resonant body and the player's technique. Your brain uses the precise timing relationships between these harmonics to construct your perception of the instrument's timbre, its position in space, and the character of the room around it.
This is not metaphor. The cochlea — the spiral-shaped organ in your inner ear — contains approximately 15,000 hair cells, each tuned to a specific frequency. When a sound enters the ear, different hair cells respond to different frequency components, and the brain receives a time-stamped map of which frequencies arrived when. Differences as small as 10-20 microseconds between ears are sufficient for spatial localization.
When a speaker preserves the phase relationship between a fundamental and its harmonics — delivering them to the eardrum with the same timing they had in the original recording — the brain constructs a stable, detailed, three-dimensional percept. Musicians call this imaging: the sense that you can hear exactly where each instrument is positioned in space.
When phase is disrupted — when the woofer's reproduction of the fundamental arrives at a slightly different time than the tweeter's reproduction of the harmonics — the imaging collapses. Instruments blur together. The soundstage flattens from three dimensions into two. Everything becomes louder or softer but less precise.
This is why audiophiles obsess over phase coherence. It is not esoterica. It is the difference between hearing a performance and hearing a reproduction of a performance.
A single-driver speaker, by definition, preserves phase coherence perfectly. All frequencies originate from the same diaphragm at the same time. But as we have seen, a single driver sacrifices bandwidth, adds Doppler distortion, and beams at high frequencies.
A coaxial speaker is the closest multi-driver approach to this ideal. Because both drivers share the same spatial origin, the path-length difference is effectively zero. The phase relationship between the fundamental and its harmonics is preserved — not perfectly, as in a single driver, but far more closely than in any conventionally separated multi-driver design.
The result is a speaker whose imaging holds up across a wider listening area. You do not need to sit in one specific chair to hear the three-dimensional quality of the sound. The music maintains its spatial integrity throughout the room.
The Miniaturization Miracle: From Studio Monitors to Your Ear Canal
For decades, coaxial design was the province of studio monitors and high-end home speakers — large, expensive, impractical for portable audio. The physics did not change, but the scale of the problem did when audio engineers began shrinking speakers to fit inside earbuds.
In an earbud, the listening distance is not meters or even centimeters — it is millimeters. The driver sits at the entrance to (or inside) the ear canal, roughly 10-15 millimeters from the eardrum. At this distance, phase errors that would be negligible in a room become significant.
The ear canal also creates a sealed acoustic environment. Sound waves bounce off the canal walls, creating standing waves and reflections that interact with the driver's output. In this confined space, any phase inconsistency between drivers is amplified rather than dispersed.
The crossover challenge becomes even more acute. In a room speaker, the crossover frequency is typically set between 2,000 and 5,000 Hz. In an earbud, where the tiny acoustic chamber shapes the response dramatically, crossover design is arguably more critical to the final sound than the choice of drivers themselves.
The breakthrough that enabled coaxial design in compact form factors came in 1988 from an unexpected direction. KEF, a British loudspeaker company, introduced the Uni-Q driver. Their innovation used newly available rare-earth neodymium magnets to shrink the tweeter assembly — magnet and all — small enough to fit entirely inside the woofer's voice coil. No horn. No waveguide. Just a conventional dome tweeter sitting at the exact apex of the woofer cone.
This was a fundamental shift. Previous coaxial designs (Altec, Tannoy) used compression drivers and horns — effective but bulky. KEF's approach used the same type of tweeter found in conventional speakers, just miniaturized. The woofer cone naturally acted as a waveguide for the tweeter, controlling its dispersion without additional physical structures.
The miniaturization principle that KEF demonstrated in bookshelf speakers applied even more powerfully in earbuds. When a coaxial driver is small enough to fit in an earbud shell, the physics of point-source radiation work strongly in its favor. The driver is so close to the eardrum that the wavefront has almost no space to develop lobing patterns. The phase coherence is preserved across the entire frequency range, from the deepest bass to the highest treble.
In a coaxial earbud, the tweeter sits at the center of the woofer within the same acoustic chamber. High and low frequencies originate from the same point. The crossover network — whether implemented as an electrical circuit, an acoustic filter using tubing and chambers, or a combination — must ensure that at the crossover frequency, both drivers contribute equally without constructive or destructive interference artifacts.
The result is something that no single driver can achieve: the full frequency range reproduced with the phase coherence of a point source, but without the bandwidth limitations and Doppler distortion of a single-diaphragm design. It is a compromise, like all engineering solutions, but it is a compromise that leans in the direction of what human hearing evolved to expect — sound from a single location, arriving with coherent timing.
This is why coaxial geometry, an idea born in 1928 and refined through nine decades of engineering, continues to matter. Not because it is new, but because the physics it addresses — wave interference, phase coherence, and the human brain's extraordinary sensitivity to timing — are as old as hearing itself.
Sabbat E16 Noise Cancelling Wireless Earbuds
Related Essays
Why Hybrid IEMs Sound Like One Voice, Not Two
Sennheiser Momentum 4: The Physics of Sound, Silence & 60-Hour Stamina
The Science of Safe Listening: How Bone Conduction Headphones Keep You Aware
Why Bone Conduction + Noise Cancelling Is a Physical Contradiction
The Physics of Noise Cancellation: Why Perfect Silence Fails
Beyond the Eardrum: The Hidden Physics of Bone Conduction
The Neckband Renaissance: Engineering Comfort and Fidelity in the Wireless Age
The 10-Driver IEM: A Guide to Crossovers, Tuning Switches, and Acoustic Vents
Why Do Headphones Sound Different? The Science of Audio Physics