Home | Audio Magazine | Stereo Review magazine | Good Sound | Troubleshooting |
By: DON DAVIS [Don Davis, who has more than 30 years of experience in the audio field, is a Fellow of the Audio Engineering Society, a member of the Acoustical Society of America, and a senior member of the Institute of Electrical and Electronic Engineers. He is co-owner, with his wife Carolyn, of Synergetic Audio Concepts or Syn-Aud-Con, as it is more often known. This audio consulting firm has had more than 6,000 persons attend its seminars on sound system design, installation, operation, and maintenance techniques. Davis has written hundreds of articles, as well as the best-selling How to Build Speaker Enclosures with the late Alexis Badmaieff and the newly revised Sound System Engineering with his wife. He is also a contributor to Handbook for Sound Engineers: The New Audio Cyclopedia, which is a recent addition to the Howard W. Sams & Co. Audio Library series. LEDE is a trademark of Synergetic Audio Concepts; LEDR is a trademark of Electro Acoustic Systems Inc., and RPG is a trade mark of RPG Diffusor Systems, Inc.] The Live-End Dead-End (LEDE) concept evolved from an insight I had during the mid-970s while measuring the acoustic response of monitor loudspeakers in recording studio control rooms. I was using analyzers that had been converted to do the Heyser Transform, which is the frequency -domain portion of Tim -Energy-Frequency or TEF measurements. Most obvious to me was the inability of a monitor, in a typical control room of that period, to reproduce accurately the composite record of the signal delays recorded in the studio. I saw that the LEDE room could be a neutral playback environment that would allow the recording environment to be heard with correct temporal imaging of the recorded material. The LEDE effect is a psychoacoustic effect. The original design goal for recording control rooms was to give the mixing engineer's ears the acoustic clues of a larger space, thus allowing the perception of hearing the studio rather than the control room. Those readers familiar with the illusive nature of the five physical senses should not be surprised to find that hearing is easily fooled. Human hearing is little understood even by those studying its complexities, despite the many devoted and talented workers in this field. As simple a subject as the integration time of the human hearing mechanism is under question as the newest measurement techniques reveal serious anomalies in the older data. Because LEDE theory and practice deal primarily, though not totally, with the time domain, understanding the various time relationships shown in Figs. 1 and 2 is important. The top portion of Fig. 1 is a schematic drawing of the temporal behavior of the ear/brain, with graphs of the amplitude response of the outer and middle ear below.
Sound Fields To better understand the importance of the theoretical and practical work now being done, we should first discuss a few fundamentals of sound fields in small rooms-small in the acoustical sense. Figure 2 illustrates some key time-domain characteristics of human hearing. The useful range of reflections is such that signal delay and signal level combine to create the Haas effect, but it does not evoke false imaging. This illustration encourages the belief that a "Haas-kicker" (i.e., a strong reflection which triggers the Haas effect following an initial signal delay gap) should be within 20 to 30 mS at a level optimally 5 to 10 dB below the direct sound. This would place it within the range of useful reflections and well past the danger of "image shift." The directivity of such returns is also a critical factor. Initial Signal Delay Gap: A fundamental error often committed by those unfamiliar with small-room acoustics is to believe that classical statistical methods can be used. Let's first view the sound field as a record of signal delays using a chart with vertical coordinates of level, in dB, and horizontal coordinates of delay, in seconds, from some arbitrary zero point. As shown in Fig. 3, the first signal to arrive is the direct sound, LD. This is followed by the initial signal delay gap, ISD, well known in concert-hall acoustics as the initial time delay gap, ITD. (Since time can't be delayed, I prefer the term "signal," which can be delayed.) The conclusion of the ISD is signaled by the arrival of the first significant reflection (i.e., the first reflection that is higher in level than 30 dB) below the LD, which is usually the highest level in its group of early reflections. Following the early reflections will be the reverberant sound field, if present. In small rooms, this reverberant sound field normally never appears above the ambient noise level, LAMB, in the room, and consequently we are concerned only with the direct sound level, Lo, and the early reflected level, LIRE, and not at all with the reverberant sound level, LR. The kind of reflected energy capable of triggering the Haas effect can come from a single discrete reflection or from a cluster of returns, such as those from a quadratic residue diffusor like the ones discussed in Manfred R. Schroeder's 1984 JAES paper. A first significant reflection can be broadly defined as the first appreciable level of energy to arrive after the direct sound from the source. The first significant return also has constraints of coming from the proper direction (or containing the comb filtering that mimics the proper direction) and having the correct frequency content (i.e., containing the higher frequencies that provide the correct pinnae clues). The primary goal of an LEDE listening space is to extend the ISD that occurs acoustically in the listening room to a value greater than that present in the performer's area, such as a recording studio (see Fig. 4). Leo Beranek has written in Music, Acoustics and Architecture, "Persons trained in listening-for example, blind people, who receive all their clues about the environment around them through the senses other than the eye-can 'measure' the size of a room or judge the distance to a wall behind them by the length of the time interval between the direct sound and the first reflected sound." Beranek goes on to note that this capability is not restricted to the unsighted, but that "experienced music listeners ... sense the approximate size of a hall ... by the length of the 'initial time delay gap.' " The LEDE technique, by virtue of the distance the direct sound must travel to encounter a first reflection, adjusts the initial time delay gap to the same figure that Beranek judged as desirable in the best concert halls, namely 20 mS._ It is no coincidence that the same 20 mS is the optimum delay for the maximum Haas effect in good, diffuse, semi-reverberant spaces. William B. Snow, of Bell Labs fame, in 1957 wrote in Application of Acoustical Engineering Principles to Home Music Rooms, "The direct sound alone carries the information giving the sense of direction, by allowing the listener to observe initial transients clearly during the short time interval before the many-directioned reflections begin to arrive at his ears."
The horizontal axis is 0 to 80 mS; the vertical axis is 6 dB/div. This ETC shows the actual distribution of acoustic energy vs. time in an LEDE control room. The increased density is created by the generation of deliberate specular reflections which repeatedly re-drive through the quadratic residue diffusors, QRD.
Fig. 6B-A "3-D" view of the energy depicted in the ETC of Fig. 6A. The horizontal axis is 20 to 10,000 Hz; the vertical axis is 6 dB/div. The oblique axis is time, with T2 being later than T. Note here how rapidly the energy divides into room modes, definitely not a statistically random sound field.
Fig. 7B-A perfect example of what a "3-D" sound field should look like, an exceptional LEDE control room. (Courtesy of Charles Bilello.)
Fig. 8B-This is the same small room as in Fig. 8A, after the installation of a Helmholtz resonator. Note that the low frequencies cannot be absorbed by soft, fuzzy materials. (Courtesy of Doug Jones and WFMT, Chicago.) Haas Effect: The Haas effect is the inability of the brain to discriminate be tween echoes and delays of sounds that arrive approximately 10 to 20 mS after the original waves. The reflected sound is still present but psychologically does not exist so far as the listener is concerned. The auditory system temporarily fuses sounds in this 20-mS zone. This ability to fuse sounds is what allows us to blend the direct sound, early reflections, and reverberation into one sound perception. Critical Frequency: We can also generate a graph, as in Fig. 5, with the sound pressure level, in dB, as the vertical scale and the frequency, in Hz, as the horizontal scale. The key to this graph is the critical frequency, fc, which is the frequency at which the wavelength is roughly comparable to the largest dimension of the room. Be cause the equation for f0 is merely a handy approximation for the point where the room modes begin to over lap, it can be condensed to this relatively simple form, 3 c/D, where c is the velocity of sound in air and D is the smallest room dimension. The reverberation time, RT60, is simply the apparent rate of decay, in dB per second, converted to an equivalent RT60.
Room Modes: In a room that has a decay of 300 dB per second, i.e., a room where a line drawn across the peak energy of the early reflections mimics a reverberant decay, the apparent reverberation time would be 0.2 S. Let's have a look at the actual density of the sound field in such a room; see Fig. 6. Note that T1 and T2 at the bottom of Fig. 6A indicate the time span of the oblique scale on Fig. 6B. It can be clearly seen that the modes do not overlap. Figure 7 shows what the sound field in a superb LEDE control room looks like. Once again, looking at the frequency-versus-level graph, Fig. 5, we can see that below fc there are standing waves, so that movement of the listener about the room can result in major changes in level as the listener's ears leave a peak and enter a null. The change in level can be in excess of 40 dB. Through the use of tuned resonators, these modes can be both broadened and damped. See Fig. 8 for an example of the effects of damping. Pressure Zone: The pressure zone is that region of the room where the encounter of energy with a boundary is still adding to nearly 6 dB. If we accept a phase shift of 60° as still near enough to coherent addition (i.e., 5.5 dB), then the pressure zone you associate with the floor, for a frequency of 20 Hz, is: 60° 1.130 feet S - 9 4 feet deep 360° 20 Hz This formula takes the fraction of a complete wave that is still adding coherently (i.e., 60°/360°) and multiplies that fraction by the wavelength for the specific frequency, which is calculated by dividing the speed of sound, 1,130 feet/S, by the frequency, in this case 20 Hz. Diffusion Zone: In Fig. 5, the region from fc to 4fá is the diffusion zone. The diffusion problem has been elegantly solved through the mathematics of Manfred R. Schroeder; Peter D'Antonio of RPG Diffusor Systems has made the equations into practical products. The creativity of D'Antonio (known in the industry as Dr. Diffusor) is shown in Fig. 9. In AES Preprint No. 2365 from the 81st Convention, D'Antonio writes: The primary purpose of reflection phase grating (RPG) diffusion is to 1) provide uniformly high density of closely spaced reflections at the listener position, without any density gradients or discontinuities; 2) provide a dense pat tern of uniformly distributed, irregularly spaced frequency notches, and insure that any inadvertent reflection combinations with slight time differences, which could result in broadband frequency anomalies, are minimized; 3) uniformly backscatter a broad frequency band width over a wide angle, and 4) reduce the backscattered energy to minimize frequency coloration and image shifting, resulting from interference with the direct sound.
(Courtesy of Peter D'Antonio.)
Frequency Dependency: Sometimes it seems that only polarity is not frequency dependent; indeed, polarity is defined in the IEEE Dictionary as "not frequency dependent." The question naturally arises, "What frequencies are pertinent to the LEDE concept?" The answer is, "The specular frequencies, those frequencies which can be modeled by light rays." The beauty of using reflections instead of absorption is that, once above the frequencies of diffusion, the result of reflecting the energy is non-frequency dependent. This frequency range is in the center of the ear's sensitivity, and the higher frequencies provide the pinnae with its directional clues. Specular reflections occur in the region above 4f0, and it is in this region that LEDE must be applied. If the term LEDE is to be applied correctly, then the following rule must be satisfied: Control of frequencies between f0 and 4f, by means of diffusors is such that no reflected energy departs dramatically from exponential decay or exhibits energy voids over time. Reflection-Free Zone: The original JAES paper on LEDE in September 1980 stated that to qualify as an LEDE room there had to be "an effectively anechoic path between the monitor loudspeakers and the mixer's ears which extends for at least 2 to 5 mS beyond the studio's initial time delay gap." Note that no limitations were placed on how this was to be accomplished, inasmuch as it was realized at the time that the end result could be accomplished either by absorption or by the loudspeaker's directivity in combination with the room geometry, i.e., a highly directional loudspeaker in a wide room (see Fig. 10). An excellent example of the former approach is shown in Fig. 11, which illustrates how Doug Jones of Electro Acoustic Systems made an ordinary room meet this particular part of the LEDE criteria. When this technique is employed, it allows the designer to use specular surfaces at the front of the room as sources of energy spaced later in time to again "drive" the rear wall diffusors, thus adding to the total energy density present over a longer time interval. In control-room design, taking advantage of source directivity has always been the way to avoid reflections from the control room window.
Note that the side plots show relative directivity and on -axis frequency response.
Monitor Loudspeakers At frequencies below fc, an LEDE room is a "live" room. If the room construction has been massive enough so that bass frequencies stay in the room, the wall in which the monitors are mounted lends substantial support. Loudspeakers should not be left standing outside of a wall surface in an LEDE room. They should be mounted on shock mounts inside the wall and should use the wall at low frequencies. Fortunately, at the specular frequencies above 4fc, there are loudspeakers available that approach reasonable control of their polar responses. In accurately engineering an LEDE listening space, the loudspeakers chosen must be measured carefully through the critical region from 500 to 2,000 Hz for their directional responses (see Figs. 12 through 14). Bipolar and similar designs are not optimum choices for LEDE rooms. If one is used, the LEDR tape, discussed later, can reveal to the listener the audible compromise encountered. Characteristics of a good loud speaker include: 1) The fewer crossovers, the better (two-way systems are usually best); 2) Full signal alignment; 3) Identical amplitude and phase responses for both speakers; 4) Identically controlled polar responses for both speakers; 5) Ability to produce sound pres sure levels of at least 130 dB SPL at 10 feet; 6) Sufficiently well-damped enclosures which, once shock mounted, radiate little spurious energy; 7) A reasonable impedance to match the narrow range of stable amplifiers capable of driving such loud speakers to the sound pressure level specified above; 8) Harmonic distortion below 2% at all frequencies above 50 Hz; 9) Directivity factor of at least 5, with a 0 of 10 the most useful, for the frequency band from 500 to 5.000 Hz, and 10) Time -domain behavior for each individual driver which is the conjugate of a network at the crossover frequency, and which adjusts the frequency-dependent behavior of the drivers back to uniformity. Imaging The work of Carolyn "Puddie" Rodgers has shown that the pinnae trans form incoming signals, superimposing upon the original signal a comb filter-like spectrum. Recent evaluations of misaligned loudspeakers and early re flections (i.e., those from less than 3 feet away) reveal that they can generate pseudo-pinnae clues, resulting in dramatic image shifts. Directional clues to a large degree reside in comb filter information at high frequencies. Signal delays that generate comb filters cause specific and predictable directional effects. Misalignment of loudspeakers causes comb filters. This effect can be devastating in an LEDE room designed for aligned loudspeakers, because the misalignment causes a change in polar response (see Fig. 15). LEDR Tape Researchers at Northwestern University have developed a special tape recording that, when played in an LEDE room over signal -aligned loud speakers, causes the listener to perceive the sound as rising straight up out of the right loudspeaker almost to the ceiling, and then going in back of the listener, circling around the listener, moving forward, and finally drop ping back down from the ceiling area into the left loudspeaker. Any aberration in either the loudspeakers or the room mars this remarkable imaging. (The LEDR tape can be purchased for $250 in two formats: Quarter-inch, 15 ips open-reel analog or Sony Beta F1 digital, from Doug Jones, Electro Acoustic Systems, 715 Monroe St., Evanston, III. 60202.) LEDE in the Home Where loudspeakers cannot be built into the wall, I suggest that great care be taken to avoid diffraction effects around the enclosure. "Wings" made of material such as Sonex mounted on plywood panels can be very effective. The greatest difficulty I encounter in consumer loudspeakers is their claim to be aligned when they're not. The high -resolution energy -time curves (ETCs) which the late Richard Heyser included in his loudspeaker reviews have been a totally reliable way to inspect loudspeaker alignment. Particular care needs to be exercised with regard to the vertical polar response of loudspeakers for use in the home, as the ceiling is quite often too low and highly reflective. Floors can be handled adequately with heavy carpeting and pads. Paul Klipsch's advocacy of corner placement for the past 50 years is still correct. Corner placement has the following advantages: The entire audience-coverage angle is within 90°, polar control is excellent at specular frequencies, and the best low-frequency modal response in acoustically small rooms is obtained. It is absolutely necessary to intro duce diffusion behind the listener. Where the budget allows, quadratic residue diffusors are best. Bookcases can also be useful. Just make certain that the rear surface is not a hard, flat wall. In addition, make certain that it is not absorptive. Whatever absorptive material is chosen, it can and should be hidden be hind acoustically transparent grille cloth-like material. I prefer using Sonex panels because the wedges of this material scatter sound at the higher frequencies, and they smooth the transition to acoustic transparency near the critical frequency, fc, normally en countered in small rooms. Whatever material is chosen, the designer needs to know its behavior at specific angles of incidence, not its statistical behavior in a reverberation chamber. Absolute Polarity Figures 16 and 17 illustrate the phase measurement of a loudspeaker, in polarity and out of polarity. Energy-time curves and energy-frequency curves do not show any effect of being out of polarity, yet it is audible. Only phase measurements reveal absolute polarity. Ed Long's MDM near-field monitors allow front-panel switching of polarity, as do a few preamplifiers. On a recent demonstration Compact Disc, I observed the need to switch absolute polarity on both channels for every selection. Incorrect absolute polarity manifests itself as sounding like two holes in the wall. When switched, the result is a solid curtain of sound with correct imaging between the two loudspeakers. These minor changes become quite audible when one is listening to good loudspeakers in an LEDE environment. What does an LEDE room sound like? You shouldn't hear it at all. What you should hear is the characteristic acoustic signature of the room in which the recording you are listening to was made. Most good classical recordings include the sound of the original room on the disc, but the small listening room usually masks it. When the physically and acoustically small room is converted into a physically small, acoustically large room by LEDE means, then this signature is heard from the loudspeakers. When the recordings are essentially multi-channel mono (often up to 32 channels), it makes little difference what kind of a room you listen in. Many folk, country -western, and some con temporary recordings are properly re corded, in terms of being useful to re produce the ambience of the recording space over the playback system. Many recording engineers, upon firs hearing one of their non-LEDE recordings played back in an LEDE environment, say "That's terrible!" Only after time elapses do they realize that what's terrible is the mix and not the environment. LEDE rooms are harsh critics of technically deficient recordings. On the other hand, technically proficient recordings are a sensuous reward in an LEDE room. There are presently more than 200 Techron TEF analyzers in the field, mostly in the hands of professional sound contractors and acoustical consultants. These consultants typically charge $500 for a day of measurement, which includes an operator for the TEF analyzer. If five audio enthusiasts were to contract for such measurements to be made in a hi-fi dealer's showroom, choosing a loudspeaker would become an easily documented task-for just $100 per person. Use of the LEDR tape in their own listening rooms would reveal if they needed further TEF analysis of those rooms. Eliminating the loudspeaker as a variable by means of TEF analysis makes playing the LEDR tape in the home an acid test of a listening room's quality. You would be astounded to see how easy it is to find interfering reflections in a home listening room using the TEF analyzer (see Fig. 18). The direction from which reflections come has an important bearing on how that energy is perceived. Thanks to the creativity of Farrel Becker, we can now easily plot the direction as well as the travel time of each and every reflection. I am willing to predict that by the mid -1990s, no serious home system will be made without TEF analysis. Kurt Graffy of Paoletti/Lewitz in San Francisco has recently started working with custom installers of high -quality music systems. Graffy's firm is hired to find the offending surfaces in the mu sic room and suggest the placement of absorption and/or diffusors to establish a clean ISD gap and precise stereo imaging.
Ideally, the consultant should be engaged at the time a listening room is being designed, just as with a control room. Doug Jones, for example, has been engaged to design and super vise the building of several home listening rooms in recent months. His wedding of psychoacoustics and TEF has enabled him to make valuable contributions to the design of home entertainment rooms. I can't resist presenting the measurement shown in Fig. 19. It is a TEF RASTI measurement (RASTI stands for rapid speech transmission index). We now are able to measure speech intelligibility objectively, with as great an accuracy as live listening groups can do with carefully constructed intelligibility tests. How long do you suppose it will be until we can do the same for music? Selected Bibliography Barron, M. F. E., "The Effects of Early Reflections on Subjective Acoustic Quality in Concert Halls," Ph.D. Thesis, University of Southampton (1974). Blauert, Jens, Spatial Hearing (MIT Press, Cambridge, Mass., 1983). D'Antonio, P. and J. H. Konnert, "The Reflection Phase Grating Diffusor: Design Theory and Application," Journal of the Audio Engineering Society (JAES), Vol. 32, No. 4 (April 1984), pgs. 228-238. Davis, D. and C. Davis, "The LEDE Concept for the Control of Acoustic and Psychoacoustic Parameters in Recording Control Rooms," JAES, Vol. 28, No. 9 (September 1980), pgs. 585-595. Henry, J., "On Acoustics Applied to Public Buildings," paper presented to the American Association for the Advancement of Science, August 1856, later published in the Smithsonian Institution publication. Jones, D. R., "Designing a Stereo Room for Spatial Hearing Research on Stereo Imagery Using the TEF," paper presented at the International Computer Music Conference, Paris, 1984. Jones, D. R., W. L. Martens, and G. S. Kendall, "Optimizing Control Rooms for Stereo Imagery," paper presented to the Acoustical Society of America, Nashville, Tenn., November 5, 1985. Kendall, G. S. and C. A. P. Rodgers, "The Simulation of Three-Dimension al Localization Cues for Headphone Listening," Proceedings of the 1982 International Computer Music Conference. Kuttruff, H., Room Acoustics (Halstead Press, New York, 1973). Madsen, E. R., "Extraction of Ambiance Information from Ordinary Recordings," JAES, Vol. 18, No. 5 (October 1970), pgs. 490-496. Peutz, V. M. A., "Articulation Loss of Consonants as a Criterion for Speech Transmission in a Room," JAES, Vol. 19, No. 11 (December 1971). Rodgers, C. A. P., "Pinna Transformations and Sound Reproduction," JAES, Vol. 29, No. 4 (April 1981), pgs. 226-234. Schroeder, M. R., "Progress in Architectural Acoustics and Artificial Reverberation: Concert Hall Acoustics and Number Theory," JAES, Vol. 32, No. 4 (April 1984), pgs. 194-203. Snow, W. B., "Application of Acoustical Engineering Principles to Home Mu sic Rooms," IRE Transactions on Audio (November/December 1957), pgs. 153-159. Syn-Aud-Con Tech Topics, Vol. 12. No. 1 (Fall 1984). (adapted from Audio magazine, Aug 1987) Also see: Designing a Home Listening Room (Sept. 1987) = = = = |
Prev. | Next |