Sounds Casey O’Callaghan A dissertation presented to the faculty of Princeton University in candidacy for the degree of Doctor of Philosophy Recommended for acceptance by the Department of Philosophy November 2002 c Copyright by C. O’Callaghan, 2002. All rights reserved. Abstract This dissertation is about sounds and auditory perception. It is motivated in the spirit of work on the metaphysics and perception of color. In Chapter 1, I motivate and develop an account according to which sounds are events. In particular, a sound is the event of an object or interacting bodies disturbing a surrounding medium in a wave-like manner. The sound just is the disturbance event itself. I develop this ‘Event View ’ of sounds to account for several pervasive sound-related phenomena, including sound wave transmission through barriers and constructive and destructive interference. The subject of Chapter 2 is echoes. I argue that echo experiences are illusory experiences of ordinary primary sounds. Just as there is no new object that we see at the surface of a mirror, there is no new sound that we hear at a reflecting surface. Rather, the sound we hear as an echo just is the original sound, though its perception involves illusions of place, time, and qualities. The case of echoes need not force us to adopt a conception according to which sounds are persisting object-like particulars that travel through space. A theory of sounds must address the audible qualities: pitch, loudness, and timbre. In Chapter 3 I argue that despite the failures of simple forms of physicalism about the audible qualities to account for psychophysical evidence, a promising form of physicalism can be developed on the basis of recent accounts of pitch experience. According to this alternative view, pitch is a physical property that sounds possess independently of their relations to perceivers, and is not the “subjective correlate of frequency.” Pitch, however, is interesting only from an anthropocentric perspective. In Appendix A, I begin to develop the alternative (physicalist) view proposed in Chapter 3. Recent work on critical bands provides the basis for the account of pitch I advocate. Finally, I argue that the account in terms of critical bands can be extended to provide candidates for the properties of timbre and loudness. iii By listening, one will learn truths. By hearing, one will only learn half truths. Lucky Numbers 6, 14, 19, 27, 30, 34. From a fortune cookie. iv Contents Abstract iii List of Figures vii List of Tables viii Acknowledgements ix 1 Sounds and Events 1.1 What is a Sound? . . . . . . . . . . . . . . 1.2 Three Theories of Sound . . . . . . . . . . 1.3 Locatedness and the Wave View . . . . . 1.4 The Argument from Vacuums . . . . . . . 1.5 The Event View . . . . . . . . . . . . . . 1.6 Transmission . . . . . . . . . . . . . . . . 1.7 Destructive and Constructive Interference 1.8 Concluding Remarks . . . . . . . . . . . . . . . . . . . . 1 1 1 3 7 10 13 17 19 . . . . . . . . . . . 21 21 22 24 25 27 30 30 31 34 37 40 2 Echoes 2.1 Introduction . . . . . . . . . . . . . . . 2.2 The Event View of Sounds . . . . . . . 2.3 The Problem of Echoes . . . . . . . . 2.4 The Solution . . . . . . . . . . . . . . 2.5 Are Echoes Distinct Sounds? . . . . . 2.6 Do Echoes Show That Sounds Are Not 2.6.1 The Argument from Echoes . . 2.6.2 Sounds Do Not Travel . . . . . 2.6.3 Re-encounters . . . . . . . . . . 2.6.4 Some Unsavory Consequences . 2.6.5 Conclusion . . . . . . . . . . . v . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Events? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.7 2.8 Is the Illusion Tolerable? . . . . . . . . . . . . . . . . . . . . . Concluding Remarks . . . . . . . . . . . . . . . . . . . . . . . 3 Audible Qualities 3.1 Introduction . . . . . . . . . . . . . . . . 3.2 Simple Views of Pitch . . . . . . . . . . 3.2.1 The Very Simple View . . . . . . 3.2.2 The Simple View . . . . . . . . . 3.3 Problems with the Simple View . . . . . 3.4 The Standard (Subjectivist) View . . . 3.5 The Alternative (Physicalist) View . . . 3.6 Audible Qualities and the Event View of 3.7 The Doppler Effect . . . . . . . . . . . . 3.8 Concluding Remarks . . . . . . . . . . . A The Alternative View A.1 Pitch and Critical Bands . . . . . . . A.1.1 The Pitch of Complex Tones A.2 The Musical Relations . . . . . . . . A.3 Spectrum Shift . . . . . . . . . . . . A.4 Timbre and Loudness . . . . . . . . Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 42 . . . . . . . . . . 43 43 44 44 45 47 48 52 54 56 59 . . . . . 66 66 72 76 80 83 87 vi List of Figures 3.1 3.2 3.3 3.4 3.5 Sinusoidal Motion . . . Fourier Composition . . The Mel Scale of Pitch . The Bark Scale of Pitch Vibration Transmission . . . . . . . . . . . . . . . . . . . . 60 61 62 63 65 A.1 Critical Bandwidth as a Function of Frequency . . . . . . . . A.2 Equal Loudness Contours . . . . . . . . . . . . . . . . . . . . 85 86 vii . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . List of Tables 3.1 Selected Critical Bands . . . . . . . . . . . . . . . . . . . . . viii 64 Acknowledgements I thank the students and faculty of Princeton University, who have been so generous with their time while discussing these ideas. Particular thanks in this respect are due to Paul Benacerraf, John Burgess, Gilbert Harman, Scott Jenkins, Simon Keller, Sean Kelly, Michael Nelson and Jeffrey Speaks. In addition, Mark Johnston bought me a very important beer, Brian Loar’s guidance has been invaluable, and Ann Getson’s wit has been a presence. Special thanks are due to my advisor, Gideon Rosen, who has provided a paradigm of how philosophy should be done; to Peg Kelly and Frank Kelly; and to Emily Koehn, the most devout Event theorist. ix Chapter 1 Sounds and Events 1.1 What is a Sound? Sounds are the public objects of auditory perception. When a car starts it makes a sound; when hands clap the result is a sound. Sounds are what we hear during episodes of genuine hearing. Sounds have properties such as pitch, timbre and loudness. But this tells us little about what sort of thing a sound is—which metaphysical category it belongs to. This is the question I wish to answer. 1.2 Three Theories of Sound Locke held that sounds are properties of bodies. More specifically, he held that sounds were secondary qualities: sensible qualities possessed by bodies in virtue of the “size, figure, number, and motion” of their parts, but nonetheless distinct from these primary attributes. 1 Robert Pasnau has recently proposed an account according to which sounds are physical properties of ordinary external objects. 2 On what I shall call the Property View an object “has” or “possesses” a sound when it vibrates at a particular frequency and amplitude. Pasnau follows Locke in claiming that sound is a property of objects, though he reduces sound to the primary quality that is the categorical base of Locke’s power, i.e., that of vibration or motion of a particular sort. The received view of auditory scientists and physicists is quite different. It holds that a sound is a disturbance that moves through a medium such as 1 2 Locke, Essay II.viii (1975), p. 138. Pasnau (1999) and (2000). 1 CHAPTER 1. SOUNDS AND EVENTS 2 air or water as a longitudinal compression wave. Vibrating objects produce sounds, but sounds themselves are waves. When we hear sounds we do not hear bodies or properties of bodies; we hear the pattern of pressure differences that constitutes a wave disturbance in the surrounding medium. The common interpretation of Aristotle is that he held a very similar view. De Anima II.8 says that “sound is a particular movement of air,” 3 which seems to indicate that Aristotle held a version of the received view, or as I shall call it, the Wave View. We can, however, take our interpretative cues from other passages in the same chapter and arrive at a view that has certain advantages over the other two theories and will be at the core of the alternative I will develop. At 420b13, Aristotle says that “everything which makes a sound does so because something strikes something else in something else again, and this last is air.” 4 So, a striking causes or makes a sound when it happens in air. The sound itself is a movement. But the sound need not be the motion of the air itself. Instead it may be the event of that medium’s being disturbed or moved. The idea is to treat ‘movement’ as the nominalization of a transitive verb and focus on constructions like ‘x moves y’ instead of ‘y is moving’. “For sound is the movement of that which can be moved in the way in which things rebound from smooth surfaces when someone strikes them,”5 then means that sound is the air’s being disturbed by the motion of an object. A sound is not motion, but the act of one thing moving another. This is not the Wave View that most attribute to Aristotle,6 but the beginnings of the Event View of sound. According to the Event View, sounds are particular events of a certain kind. They are events in which a moving object disturbs a surrounding medium and sets it moving. The strikings and crashings are not the sounds themselves, but the causes of sounds. The waves in the medium are not the sounds themselves, but rather the effects of sounds. Sounds so conceived possess the properties we hear sounds as possessing: pitch, timbre, loudness, duration, and as we shall see, spatial location. When all goes well in ordinary auditory perception, we hear sounds as they are. 3 Aristotle, De Anima 420b10 (1987), p. 179. Aristotle, De Anima 420b13 (1987), p. 179. 5 Aristotle, De Anima 420a20 (1987), p. 178. My italics. 6 See Pasnau (2000). 4 CHAPTER 1. SOUNDS AND EVENTS 1.3 3 Locatedness and the Wave View According to the Wave View, sounds are waves: a particular sound is a train of waves that is generated by a disturbance and moves through the surrounding medium. But this is not how things seem. When we hear a sound, we hear it to be located at some distance in a particular direction. In ordinary cases sounds themselves, not merely their sources, seem to be located distally. Auditory scientists call this phenomenon ‘externalization’. 7 Sounds are not perceived to travel through the air as waves do. They are heard to be roughly where the events that cause them take place. If auditory experience is not systematically illusory with respect to the perceived locations of sounds, then sounds are not waves, since they are not perceived to be where the waves are. The argument depends on a phenomenological claim. Sounds are perceived to have more or less determinate locations. When we hear a clock ticking, the sound seems to be “over there” by the clock; voices are heard to be in the neighborhood of speakers’ heads and torsos; when a door slams in another part of the house, we know at least roughly where the accompanying racket takes place. I mean that we experience sounds, in a wide range of cases, to be located at a distance from us in a particular direction. When we do not, as when a sound seems to “fill a room” or “engulf” us, the sound is perceived to be “all around,” or at least in a larger portion of the surrounding space. Hearing a sound located “in the head” when listening to earphones is another sort of sound location perception, albeit a touch odd. 8 Given the phenomenological facts, the degree to which auditory location perception is illusory or misleading should follow from a theory of sound. No theory should make the fact of location perception a wholesale illusion, though individual instances of location perception might mislead about the actual locations of sounds. Thus, I might correctly hear a stereo speaker’s sound as located at the speaker itself; but I might undergo an illusion as of hearing the sound to be located five feet to the right of that speaker. In both cases, I correctly perceive the sound to have a location, but the experience is inaccurate in the second instance. Occasionally, sound location perception is to some degree anomalous, as when sound seems to be all around in a reverberant room, when it seems to be in the head during headphone 7 Gelfand (1998) refers to this phenomenon as ‘extracranial localization’: “Sounds heard in a sound field seem to be localized in the environment,” p. 374. 8 Gelfand (1998) refers to this phenomenon as ‘intracranial lateralization’: “Sounds presented through a pair of earphones are perceived to come from within the head, and their source appears to be lateralized along a plane between the two ears,” p. 374. CHAPTER 1. SOUNDS AND EVENTS 4 listening, or when the sound seems to be “behind” a jet plane overhead. Whether and when a sound can literally fill a reverberant room, be inside the head of a subject, or be behind an airplane will depend upon one’s theory. The phenomenon of locatedness spells prima facie trouble for the Wave View. Sound waves pervade a medium and move through it at speeds determined by the density and elasticity of the medium. Yet we neither hear sounds as air sloshing around the room nor as moving roughly 340 meters through the air each second. Imagine a scenario in which engineers have rigged a surround-sound speaker system to produce a sound that seems to be generated by a bell across the room. This sound subsequently seems to speed through the air toward you and enter your head like an auditory missile. This would indeed be a strange experience, one unlike our ordinary experiences of sounds generated by relatively stable objects and events. Sounds are perceived to be relatively stationary with respect to their sources. The sound of a moving train seems to move only insofar as the train itself moves. When the train stops moving, so does its sound. The trouble for the Wave View is serious. Since sounds are heard as having stable distal locations, either the sound is not identical with the sound waves, or we misperceive one important aspect of sounds. If the sound is identical with the sound waves, the situation is not that we sometimes misperceive sounds, as when a sound ahead is heard to be behind; rather, we systematically misperceive the locations of sounds. That is, we hear the locations of all sounds incorrectly since we never hear a sound to move just as wavefronts do. Since sounds are among the things we hear, we should take the phenomenology of auditory experience seriously when theorizing about what sounds are. If the phenomenon of locatedness is not systematic misperception, then sounds are not sound waves. The Wave theorist might reply, The immediate objects of auditory perception—what we hear— are waves. Sounds just are waves. Waves and their properties are the causes of perceptions of pitch, loudness, and duration; however, we hear these qualities to be located at the place where the waves originate, i.e., at their source. Sounds seem to be where their sources are, and to this extent, auditory perception is illusory. But this illusion is a beneficial one, given our interest in sound sources as constituents of the environment. It is no surprise that we hear sounds to be located where distal objects and events are. CHAPTER 1. SOUNDS AND EVENTS 5 The Wave theorist’s response avoids the conclusion that sounds are not identical with waves by accepting that we are subject to wholesale illusion in one salient aspect of auditory experience. The strategy is to assuage concern about the location illusion by providing another candidate for bearer of the spatial properties and by highlighting the illusion’s potential benefits. Notice the tactic: by invoking the location of the source, the Wave theorist avoids assigning potentially problematic locations to sounds. But sounds have pitch, timbre, and loudness. An account that entails that sounds are located where instances of pitch, timbre, and loudness are heard to be is preferable to one that convicts auditory perception of systematic illusion about the locations of its objects. Can we eliminate illusion from the Wave theorist’s account entirely? One promising approach rejects the phenomenological claim as it stands. Instead, it says that we hear sounds to have pitch, timbre, loudness, and duration, though not as having locations. Rather, we hear ordinary events and objects as located and as the generators or sources of audible qualities that lack spatial properties entirely. We do not mistakenly perceive the locations of sounds, we simply fail to perceive their locations. This is the Wave theorist’s best way to avoid the dilemma. She says sounds are not heard to have locations, they are heard to have located sources. The picture is this: sounds are waves; waves have sources; sounds are heard to come from their sources, but not themselves to have locations; only sources are perceived to have locations. This description provides a compelling account of the phenomenology that is consistent with the Wave View. Unfortunately for the Wave theorist, it fails. To see why it fails we need to consider just how a sound can seem to come from a location. The problem is that sounds themselves must be heard to have particular locations for sound sources to be heard as located. For the Wave theorist, the basic audible qualities are qualities of sounds, and sounds are waves. Thus, waves have the audible qualities. The response we are considering is that we hear objects and events as located by way of the sounds they generate. But we cannot hear just non-located audible qualities and located objects, full stop. This would amount to a precarious perceptual situation. How could non-located qualities provide locational information about their sources in perception? One way is for locational information to be encoded temporally, for example, by time delays between waves reaching the ears. This information must be conveyed somehow in conscious perception. In audition this is accomplished by awareness of the audible qualities of pitch and timbre, and their loudness and duration. If these are the aspects of sounds we experi- CHAPTER 1. SOUNDS AND EVENTS 6 ence, and if they are to convey information about the locations of material objects and events, then sounds themselves, and their qualities, must be heard as located. If the Wave View is correct, the location illusion remains. A distinction can be drawn between hearing sounds themselves as located and perceiving information about the locations of material objects, stuffs, and events in the environment by means of audition. The question is whether the latter would be possible without the former. If hearing sounds and their qualities as located is required in order to perceive or form judgements about the locations of material objects and events in audition, then the Wave View is forced to say that auditory perception is in one central respect illusory. Ordinary introspection of one’s auditory experience supports the claim that we indeed hear sounds as located. Imagine an experiment in which the engineers mentioned above generated a pure 1000 Hz tone of moderate loudness that seemed to be about ten feet directly ahead of you in the middle of the air. You are allowed to open your eyes and walk around, but you can neither see nor feel nor smell anything in the sound’s vicinity. Your experience would be of just a sound and its qualities, and not of any other objects, stuffs, or events located where it seems to be. The sound is heard as an independent existent located in space. Though most ordinary auditory experiences are of sounds generated by events and things, these sounds are heard as being entities distinct from their sources. Sounds are heard to have locations, by way of which they provide perceptual information about the locations of their sources. 9 If the phenomenological claim is an accurate description of the experience of sounds, and if it is true that in order to perceive the locations of sound sources, audible qualities and sounds themselves must be perceived as located, then either the Wave View attributes widespread illusion to auditory perception or the Wave View is false and sounds are not simply waves in a medium. Short of accepting and explaining the illusion, the Wave theo9 Another way we might imagine the situation is as follows. We hear sounds, but not as located in any way. We learn the locations of sound sources by an unconscious process something like what is referred to as ‘blindsight’ by perceptual theorists (see, e.g., Block (1995)). So, the locational information is represented through a course of subconscious perception. Our perceptual awareness of the connection between sounds and sources must then be explained. If this can be done and the description we are considering is correct, my claim that audible qualities are consciously heard as having locations must be false or due to widespread illusion. I think it is manifestly correct that audible qualities are heard as having locations, and that this is demonstrable by various behavioral and introspective tasks. Widespread illusion is precisely what the Wave theorist is trying to avoid. It is preferable to explore alternative conceptions of what sound is than to go to such lengths to make the theory fit with the phenomenology. CHAPTER 1. SOUNDS AND EVENTS 7 rist’s best strategy is to impugn my description of the phenomenology. She should say that sources, not sounds themselves, are heard as located. This requires rejecting the argument that in order to hear sound sources as located, sounds must be heard as located. The blindsight analogy raised in footnote 9 is one route around the dilemma for the Wave theorist shy about claiming that we fail to perceive, or systematically misperceive, the locations of sounds. I am convinced that the phenomenological claim is correct as it stands and that sources are heard as located only if sounds are. So I need a way to avoid this difficulty. My theory must not imply that sounds move through the air. The Property View is tailor-made to capture the phenomenology of locatedness. It, however, falls to a separate objection. 1.4 The Argument from Vacuums The Property View says that sounds are properties of things like bells, tuning forks, and whistles; more specifically, sounds are the vibrations of material objects. The view entails that sounds are roughly where we perceive them to be. Unfortunately, the Property View also entails that sounds can exist in the absence of a transmitting medium. That is, sounds can exist in a vacuum (just as things can have colors in the dark), since all that’s required for an object to have a sound is that it vibrate in the right way. Nevertheless, we have good reasons to believe that the existence of a sound requires a medium. If there can be no sounds in vacuums, the Property View is false. 10 In Berkeley’s first dialogue between Hylas and Philonous, Hylas argues against the Property View in favor of the Wave View by deploying the Argument from Vacuums. It begins with the premise that a bell struck in water or air makes a sound, but in a vacuum it does not. Hylas concludes that sound must be in the medium. PHILONOUS. Then as to sounds, what must we think of them: are they accidents really inherent in external bodies, or not? HYLAS. That they inhere not in the sonorous bodies, is plain from hence; because a bell struck in the exhausted receiver of an air-pump, sends forth no sound. The air therefore must be 10 This argument is independent of my causal argument against the Property View (see “What Sound Is Not: Pasnau on Sound” (ms)). According to the causal argument, the Property View cannot accommodate the fact that sounds are generated by their sources— that is, that there exists a causal relation between a sound and its source. See “What Sound Is Not” for further discussion of why the Property View fails. CHAPTER 1. SOUNDS AND EVENTS 8 thought the subject of sound. [The sound which exists without us] is merely a vibrative or undulatory motion in the air. 11 The argument is: 1. A bell struck in a vacuum makes no sound. 2. So, sound does not exist in the absence of air. 3. So, air is the subject of sound (i.e., the Wave View is true). 4. The Property View is false. Notice a few things. If a bell struck in a vacuum makes no sound, then sound does not exist in the absence of air and the Property View is false. 12 But it does not follow that the Wave View is true. Air might be required for the existence of sound without itself being the subject of sound. Even if its first premise is true, the Argument from Vacuums does not establish the truth of the Wave View. Room exists for an alternative theory of sounds according to which no sounds occur in vacuums. Furthermore, the first premise must be established if it can be used against the Property View. Why say there are no sounds in vacuums? Hylas baldly assumes there are not. We would like to have some reason, preferably independent of an explicit theoretical commitment, for denying (or affirming) that sounds exist in vacuums. A first pass: “When the bell is struck in a vacuum we know there is no sound because none can be heard by any ordinary creatures; if a medium is added, we can hear it, so the sound must require the medium.” Problem: The fact that no sounds are ever heard in the absence of a medium shows only that a medium is required for there to be veridical perception of sounds. It does not show that a medium is necessary for there to be a sound. 13 This is the Property theorist’s wedge in the Argument from Vacuums. Suppose one strikes a bell in a vacuum chamber containing a (hypothetical) perceiver. The perceiver can hear nothing. Our problem is that without a theory of what sounds are we are unable to confirm whether or not there is a sound. Barring a declared theoretical commitment, how do we decide if the bell makes a sound? 11 Berkeley (1975), pp. 171–2. Also quoted in Pasnau (1999), p. 321. Air is meant to be representative of any medium in which sound waves travel. 13 Berkeley, of course, had reason enough to conclude that there are no sounds in vacuums, since he accepted that nothing exists unperceived. 12 CHAPTER 1. SOUNDS AND EVENTS 9 Perhaps because the bell struck in a vacuum is not a possible object of auditory experience, it does not make a sound. Though I see no good reason to deny that there are sounds beyond the ken of perception, this argument gets us closer to what we are looking for. A sound, if anything, is the bearer of the properties of pitch, timbre, and loudness. Suppose we could establish that there is neither pitch, nor timbre, nor loudness when the bell is struck in the vacuum. We could then reasonably conclude that there is no sound. The bell struck in a vacuum has no sound because it has none of the qualities necessary for the existence of a sound. The sound of a bell seems to have different qualities when the bell is struck in air and water, and different ones yet in helium and liquid mercury. When the very same striking event occurs in a vacuum, it is inaudible. If a sound exists in a vacuum, it must have some definite pitch, timbre, and loudness. What loudness, for example, does it have? The loudness it would have been heard to have if it had been surrounded by water? The loudness it would have been heard to have if it had been surrounded by air? A decision here will be to a significant extent arbitrary and will not reflect the relevant ways in which the loudness of a sound depends upon the medium in which it is generated.14 The Property theorist might hope that ideal or standard conditions for perceiving the “true sounds” of things can be formulated as they can for colors.15 But the way in which daylight or white light might be counted as ideal for revealing the true colors of things finds no analog in sound. Neither air nor water nor helium does a substantially better job divulging the subtle vibrations of an object in the way that full-spectrum light reveals the reflective properties of a surface. If there is no ideal medium in which to hear the true sound of an object, that is, if the qualities of a sound depend upon the medium in which it is generated, then it is doubtful whether the object vibrating in a vacuum has a pitch, timbre, or loudness. It is therefore doubtful whether there is any sound. The medium-dependence of audible qualities shows that we are justified in drawing a stronger conclusion than that some necessary condition for sound perception is missing in a vacuum. A necessary condition for there to be a sound is missing.16 If sounds do not occur in vacuums, the Property 14 See §A.4 for further discussion of this dependence. Pasnau (1999) appeals to just such a hope, p. 322. 16 Might we say that sounds are properties that objects have only when in the presence of a medium, and thereby save the Property View? The sound property assigned to the object must, however, depend on the specific properties of the particular medium surrounding it in order to avoid the objection raised in the text. The sound differs only when the medium differs. This is no longer the Property View. It is a relational view 15 CHAPTER 1. SOUNDS AND EVENTS 10 View is false. Moreover, we showed in §1.3 that the Wave View entails systematic illusion about where sounds are. The Event View, however, is a natural alternative that attributes the right locations to sounds and does not entail that sounds exist in vacuums. The Event View is that particular sounds are events in which a medium is disturbed or set into wave-like motion by the movement of a body or interacting bodies. These disturbance events take place where we perceive sounds to be, and, because no medium is present to be affected, a vacuum contains no sounds. 1.5 The Event View Particular sounds are events.17 Sounds take time and involve change—at a minimum they begin, and usually they end. A number of qualitatively different stages or a single tone of uniform loudness may compose a sound. The sounds are the events in which a medium is disturbed or changed or set into motion in a wave-like way by the motions of bodies. Events such as collisions and vibrations of objects cause the sound events. Among their effects may be sound waves propagating through a medium and the auditory experiences of perceivers. Medium-disturbing events are what we hear to have particular pitch, timbre, loudness, and location. A body counts as in a state of sounding—making a noise—just in case it is in the midst of generating or causing a particular sound. Whenever there is a sound there is a sounding. The tuning fork struck in air is a simple case. The striking is an event that “makes” a sound in virtue of the process by which the arms of the fork oscillate and create regular compressions and rarefactions in the surrounding air. Its creating the disturbance constitutes the tuning fork’s sounding. The event of the tuning fork’s disturbing the medium is the sound. We perceive this sound event to have a constant pitch and timbre, a duration, a location, and diminishing loudness. In contrast, the sound of an owl’s call is a more complex event characterized by a temporally extended pattern of changing pitch, timbre, and volume. Each call sounded is an event consisting of the involving object and medium which is closer to the truth about sound, not the view that sound is an intrinsic property of objects. 17 My theory of sounds as events should be relatively insensitive to what particular theory of events is the correct one (e.g., Lewis (1986), Davidson (1970), Kim (1973)). Within reason, whatever events turn out to be, sounds should be events. Accordingly, I wish to work with the intuitive notion of events as particulars which take time, occur over an interval, and may or may not essentially involve change. CHAPTER 1. SOUNDS AND EVENTS 11 owl’s lungs and syrinx disturbing the surrounding air in a given pattern. The tuning fork and the owl alike are recognizable by the sounds they create. Auditory perception also makes us aware of events in our environment. We learn by audition how the furniture is arranged and when it is being moved. How is this possible if sounds themselves are the events that we hear? The Event View says that a sound is an event whose cause is the event heard to “have” or “make” the sound, and implies that the sound and its cause are in close spatio-temporal proximity. When we hear the sound of a glass breaking, that sound is an audible event constituted by the fracturing glass affecting the air. The breaking of the glass causes the medium-affecting event that is the sound event. This latter event has the audible qualities of pitch and loudness. The medium-affecting event is near the breaking event, but the two do not occur in just the same space-time region. That the two sorts of events occur close to each other, however, does not sufficiently explain why we are aware of sound generating events in auditory perception. A sound also carries qualitative information that can be used to identify its generating event after perceivers learn to associate the sound with the cause. The sound’s pattern of pitch, timbre, loudness, and duration indicates that a glass has broken; the location of the sound points us in the direction of the mess. So, there is the event of an object or substance setting a medium into periodic motion. This is a sound. The kind of motion depends on the form and makeup of the object or substance, what it does to disturb the medium, and the physical characteristics of the medium itself. The sound event has a location and a pattern of pitch, timbre, loudness, and duration. There are also the generating events that cause sounds and the objects that are said to make a sound in virtue of instances of their sounding. Sounds are individuated along three primary dimensions: causal source, spatio-temporal continuity, and qualitative change. Intuition is sometimes silent, but we do have implicit in our practices principles for saying when sounds are the same or different. The Event View captures these principles. To count as the numerically same sound particular, a candidate must have the very same token causal source and be spatially and temporally continuous throughout its entire history. If either the causal source changes or there is a spatial or temporal discontinuity, we say that there are different relevant sound particulars—a temporally seamless transition from a trumpet playing B-flat to another trumpet playing the same note counts as involving two different sound tokens. The sense in which the sound from a single trumpet is different when it seamlessly goes from playing a B-flat to an A is that CHAPTER 1. SOUNDS AND EVENTS 12 the trumpet’s state of sounding is different. Perhaps different sound events correspond to these different states of sounding at the two times. Still, there is one sound event of which each note instance is a part, and in this sense both are parts of a single continuous sound. Such a sound might extend over considerable time and space and change greatly in its qualitative characteristics. At times it may be loud and high-pitched and at others it may be faint and low, but as long as it has the same causal source in terms of its generating event or object, and is spatio-temporally continuous, it counts as the very same sound particular. It follows that numerically distinct instances of sounds that fall under the same qualitative characterization are not the very same sound in any sense stronger than qualitative identity. Temporally discrete sounds from the same causal source and spatially separated sounds generated by different sources can at best be different instances of the same qualitative sound type. Philosophers sometimes do, however, speak of performances of songs and symphonies as events that are tokens of sound types, despite the fact that they need be neither temporally nor spatially continuous—they may incorporate periods of silence and multiple sources. We can say the same of bird calls. But these are complex events that involve patterns of individual sound events when they occur. Distinct sound particulars are arranged to comprise a whole that may require or allow for discontinuities of various kinds. The ontology of music and complex sound universals enjoys its own vast literature. What I want to point out is that the Event View is capable of capturing the ways in which we take sounds to be individuated. The principles I have mentioned may be disputed; but there is often obscurity about how events are to be individuated. The Event View, in that case, predicts— correctly—that there is a certain amount of obscurity and arbitrariness in our verdicts concerning how many sounds we have heard. 18 The Event View is a natural way to avoid the objections posed in §1.3 and §1.4. Particular soundings have audible locations determined by where the medium-disturbing process occurs. Sounds, then, move through space in just those ways we expect them to, for example, when a train passes in the distance. The subject-directed missile-like sound does not ordinarily arise. The Event View also accounts for what we learn about sound from the Argument from Vacuums: we are justified in claiming that a medium is necessary for there to be a sound. Since a medium is required for there to be a medium disturbance, there is no sound in a vacuum. The Event theorist 18 Thanks to Gideon Rosen for disputing several of my sound individuation principles and helping me to see this last point. CHAPTER 1. SOUNDS AND EVENTS 13 maintains that sounds are neither entirely in the surrounding medium nor simply properties of objects. If the arguments against the prevailing views are compelling, the Event View is a theoretically cogent solution. Several lines of objection force elaboration of the Event View. The Event View provides for natural accounts of several phenomena that pose difficulties for any theory of sound. 1.6 Transmission So far, I have said that a sound is an event of a medium’s being disturbed or set into motion in a particular way by the activities of an object, body or mass. But this seems too lenient, to allow too many sounds. Consider the following two forms of objection. First Form Suppose you are underwater and hear the sound of something that happens in the air above, say, the striking of a bell. The Event View seems to imply that there is a sound at the interface of the air and water since indeed there is a medium-affecting event there. The air, a mass or body, sets the water, a medium, into motion. This is phenomenologically inaccurate. We do not hear the sound to be at the surface of the water; we hear it to be above in the air.19 Second Form The preacher outside Marx Hall is loud. When I shut the window I do not hear him as well. The window muffles the sound. Nevertheless, the window also sets the medium inside the room into motion. According to the Event View, is the sound located at the windowpane? We do not hear it as being there—the sound still seems outside. In both forms, sound waves generated in one medium pass into another kind of medium. The first describes travel across a single interface; the second involves travel through a solid barrier. In each, at the relevant interface— the air-water interface in the first case and the window-room interface in the second—the motion of a body disturbs the medium it adjoins. Yet since we do not ordinarily take ourselves to hear sounds at such places, intuition 19 Here, and in the discussion that follows, I will ignore misperception of the sound’s location that results from refraction. This does not affect the arguments and conclusion. CHAPTER 1. SOUNDS AND EVENTS 14 has it that no sound occurs at either the interface or the barrier. Must the Event theorist count these events as sounds? 20 The problem of transmission is not unique to the Event View. Each of the views canvassed faces a version of the objection. The Property View is in roughly the same straits as the Event View. The Property View implies that the sound is a property of the air mass in the first case and the windowpane in the second, since each vibrates at a particular frequency and amplitude. Even the Wave View, on which sounds are waves, faces a dilemma. What is the source of the sound? Is it the bell or the air-water interface? The preacher or the window? Each is in a sense the cause of the waves “in the medium” in which the sound is heard. An acceptable version of the Wave View must acknowledge that we perceive locations in auditory perception, even if these are the locations of sound sources. Just as the Event theorist needs to say which events are the sounds, the Wave theorist must say which things count as sources of sounds. Though the problem is not unique to the Event View, the Event theorist owes an account of sounds and transmission. The Event theorist’s options are: (a) deny there is a sound where transmission occurs and explain why the Event View does not entail that there is; (b) accept that sounds accompany transmission events and reconcile this with the intuitive description of the experience. Contrary to first appearances, option (b) is somewhat attractive. 21 Ultimately, however, this re20 In fact, there is a slightly more troubling extension of this objection. Sound waves are transmitted by means of collisions among the particles of a medium. If each one of these collisions is an event in which an object (a particle) disturbs a medium, then the Event View seems to imply that each of these events is a sound. We might respond that these molecules do not count as objects in the ordinary sense enlisted in the conception of a sound. This is unsatisfactory. We are discussing ontology, and must not artificially limit the scope of what counts as an object in order to suit the purposes of our favorite theory. At any rate, this extension illustrates the potential extent of the difficulty that sound wave transmission poses for the Event theorist. 21 Suppose we accept that what occurs at a transmitting surface or barrier is a sound— that the medium disturbance is a genuinely audible event. Granted, this event is caused in very different way from the primary sound produced by an event such as striking a bell or shutting the door. It comes about because sound waves reach a medium with a considerably different density and elasticity and transfer a portion of their energy to that medium (the sound transmission coefficient of a medium is a constant that depends on the density, mass, and elasticity of the material. The portion of wave energy transmitted from one medium to another is a direct function of the degree of similarity between the sound transmission coefficients of the two materials). But what occurs at the transmitting surface or barrier is still notable from the point of view of the Event theorist. The surface’s activity is the proximal cause of motion in the surrounding medium that culminates in auditory experiences. Two puzzling facts about auditory perception in such cases remain. First, transmitted sounds seem to come from where the original, or primary, sound event CHAPTER 1. SOUNDS AND EVENTS 15 sponse and the burgeoning world of sounds it proposes are unsatisfactory. It strains the imagination to suppose that a multiplying of sounds occurs each time sound waves travel across an interface or through a barrier. Our accounting should be more sober. Suppose we deny that a sound occurs when a “new” medium is disturbed by a pre-existing sound wave. Option (a) suggests a different way to conceive of the perceptual situation. We say that the interface or barrier distorts our perception of the primary sound’s location and qualities, not that we perceive a secondary sound with its own location and set of qualities caused by the primary sound. A single sound exists above the water or outside the window, but one may not have an ideal experience of that sound if impediments to perception intervene. This picture is more accurate from a phenomenological standpoint. The indications are that we have a perceptual bias toward the locations of sound generating events, that is, cases in which sounds are caused to exist by events of the everyday material sort. Auditory perception makes us aware of sounds produced by events such as doors shutting and waves breaking— sound generating events. We hear the sound created by the striking of the bell above water, and the sound of the preacher proselytizing outside the takes place. The sound of the preacher does not ordinarily seem to be at the window; it seems to be outside. If there is in fact a sound at the window, it must preserve cues about the location of the initial sound. We perceive sounds as localized thanks to information contained in sound waves. When the waves from a source arrive at a surface or barrier, most of the information they carry about the direction and distance to that source remains intact when the abutting medium is set into motion. The result is a sound perceived to be located at the primary source, not at the transmitting barrier. If sounds occur where transmission does, the secondary sound preserves the primary sound’s location information. This accounts for the misperceived locations of transmitted sounds. Second, we classify sounds according to the types of events that we perceive to produce them. Yet sounds do not ordinarily appear to be produced by collisions between sound waves and surfaces or barriers. The primary sound, however, determines the purported secondary sound’s qualities. Though less loud, the supposed secondary sound’s qualitative and temporal profile would be roughly that of the primary sound. The secondary sound, which is caused by waves from the preacher’s voice reaching a window, would thus resemble the primary sound. Given the preservation of location cues and the qualitative similarity of primary and secondary sounds, secondary sounds would be carriers of information about occluded primary sounds and would not be perceived to arise from events that occur at the window. This view attributes a great deal of illusion to auditory perception. The previously mentioned attraction is that it makes being a sound an intrinsic property of events. If being a sound is being a medium-disturbing event, then what makes something a sound is just that it has the physical makeup required of such a disturbing. There are not further extrinsic requirements stemming from the event’s causal or temporal relations. If a window interacts with a medium to produce a disturbance, that window makes a sound no matter how its activities arose. CHAPTER 1. SOUNDS AND EVENTS 16 window. Events of transmission occur when the waves from one sound event cause motion in an object or body that is passed on to another medium. We do not hear events of transmission or indeed anything at their locations when we hear a sound beyond an interface or through a barrier. The language of this distinction suggests a theoretical solution consistent with the Event View. To speak of a sound as generated by a source implies that the sound is caused by and distinct from the event that brought it about; we also speak of sound waves being generated by sound producing events. The idiom suggests that neither the sound nor the sound waves exist in any form prior to the event of generation. In contrast, the idiom of transmission applies to an event that is involved in passing along a wave disturbance that already exists. When a transmission event causes a medium disturbance of the sort that poses trouble for the Event theorist, both events depend for their existence upon a prior sound event. The distinction between medium disturbing events that introduce sound waves into an environment and those that transmit them is natural and based on the events’ roles in a regular causal network. The medium disturbing events that are the sounds are the events in which a wave disturbance is introduced into the environment by the activities of some material object, body, or mass. Disturbance events in which a prior sound’s waves are passed on or transmitted into a different medium are not in any ordinary sense sounds. An event of transmission must have a sound in its causal history, and a (generated) sound need not. Being a sound is a matter partly of occupying a particular causal-functional role. A central feature of the causal role distinguished by how we speak is that sounds are events caused by generating events such as collisions, but are not caused by waves simply passing through barriers and interfaces. Sound generating events such as cymbal collisions cause or generate sounds. Sounds are events in which a disturbance is introduced into a surrounding medium, that is, they are events in which an object actively disturbs a medium. The sound waves that result from the sound event, which are also in a sense generated, are transmitted through the surrounding medium and the materials and objects it contains. Events in which the motion of sound waves causes barriers and intervening matter to set adjacent media into motion are not events in which a disturbance is introduced into the environment. Such transmission events are among the ordinary inaudible activities of sound waves that carry information about the initial CHAPTER 1. SOUNDS AND EVENTS 17 sound and propagate through various materials. Transmission events do not ordinarily give rise to sounds.22 This account appeases intuition. The problem of deciding which of multiple sounds we listen to when sound waves pass through an interface or barrier does not get off the ground. But the “innocent” picture according to which being a sound is entirely a matter of what happens near the surfaces of objects whose activities affect a medium is threatened. We must adopt a broader perspective that acknowledges the causal relations of several distinct kinds of events. This is not cause for alarm, nor is it a surprise given the organization of sound-related experience. Sounds furnish us with awareness of sound generating events, which are of paramount interest for what they tell us about the world. They tell us such things as how the furniture is arranged and when it is being moved. Transmission events, however, enjoy little utility beyond what we learn through their effects on how we perceive the primary sounds they occlude: when we perceive a sound as muffled, we learn that a barrier may intervene. Given, first, our interest in ordinary events that take place among material bodies, and, second, how these events are related to sounds, it is no wonder that the primary disturbances should be distinguished by audible qualities. 1.7 Destructive and Constructive Interference As commonly demonstrated in high school physics classrooms, sound waves interfere with each other. Suppose you are in an anechoic room in which two tuning forks tuned to E above middle C are simultaneously struck. As you move around the room, there are places from which you hear the sound to be soft and places from which you hear the sound to be loud; there are places from which you hear neither sound. This phenomenon occurs because at any time the total pressure at a point in the room equals the algebraic sum of the pressures of all the sound waves at that point. It is therefore possible, when sound waves are out of phase with each other, for the total pressure at some point or in some area to remain constant while separate sound waves pass through that point or area simultaneously. A listener positioned at such a point hears nothing. When 22 I say “ordinarily” in light of the following sort of case. Suppose sound waves reach a barrier and induce vibrations in that object. The barrier might then itself generate a sound in addition to the sound whose waves induced the barrier’s vibrations. This is not an ordinary case of sound wave transmission, however, and should be subsumed instead under resonance. Resonating is sounding since the resonating object actively disturbs the medium, and does not merely passively transmit existing sound waves. CHAPTER 1. SOUNDS AND EVENTS 18 sound waves cancel, the interference is destructive. Likewise, when the waves are completely in phase at a point, the total pressure varies with the sum of the components’ amplitudes. The sound seems twice as loud as either tuning fork at these points thanks to constructive interference. Altering the phase or vibration characteristics of one of the tuning forks may result in beating, a periodic variation in perceived amplitude from a particular point. Here is the problem. Take the example of complete destructive interference described above. The Wave theorist can explain that you hear no sound from where you stand because the pressure is constant at that point and hence there is no sound.23 By contrast, the Event View implies that each tuning fork makes a sound even though you hear neither one from the point of interest. If sounds are not sound waves and the Event View is correct, then you hear no sound at all when there are two. Is the gap a fault line in the Event View? Interference phenomena do not undermine the Event View. The interference arguments do show that waves carry information about sounds. The Event theorist should not deny this when he says that the sound is not identical with the waves. Waves can be involved in the process by means of which a sound is heard without the sound’s just being the waves. The Event View provides an intuitive and compelling alternative to the standard account of destructive interference. The Event View says there are two sounds, two events of a disturbance being introduced into a medium. These disturbances travel as compression waves and may reach a perceiver, where they cause perceptions of the original sound event. Waves obey the principles of interference, and if no variations in pressure exist, no sounds are heard. Ordinarily, a lack of pressure variations indicates the absence of sounds and sound sources. Complete destructive interference resembles the absence of sounds because factors conspire to create nodes where the pressure does not vary. These factors include the spatial arrangement of the two sources, the frequency and amplitude at which the sources oscillate, and the temporal relations among the activities of the sources, i.e., the phase difference of the sources. A perceiver located at a node will hear neither sound, and may believe that no sounds occur. This does not entail that the room contains no sounds. The observer is simply unable to perceive the sounds because of her particular point of view. 23 Of course, there are still in a sense two waves passing through that area, but their summed amplitude is zero. So, in a sense, there are two sounds at that point even though none is heard. The Wave theorist does not escape entirely. If, however, by ‘the wave’ we mean something that depends only on the total pressure at various points, there is no wave and no sound at the point of interest. CHAPTER 1. SOUNDS AND EVENTS 19 That there are indeed two sounds can be confirmed in several ways. One can move to a point where one or the other sound is audible, move one or both of the sources so that the nodes are shifted, alter the phase difference in the vibrations of the two objects to remove nodes completely, or simply remove one of the sources to eliminate interference entirely. These exercises show that each tuning fork makes a sound that can be heard independently of the other in the right circumstances. Sometimes, however, another sound’s presence can interfere with perceiving a given sound. Experience need not reveal from a particular vantage point all the surrounding environment’s sounds. What we perceive from a very limited vantage point need not be the entire story about sounds. The case of constructive interference is very similar. Due to the spatial and temporal relations among events of sounding, a perceiver in the right location may experience multiple sources to have greater loudness than any single source present. This is again the result of the additive properties of sound waves. It is less surprising that the subject’s loudness experience should increase in the presence of two sources than that it should decrease, as in destructive interference. Beating is perhaps less intuitively comprehensible, but is also an explicable result of how the source events are arranged in time and space, and of the subject’s vantage point on these events. 24 1.8 Concluding Remarks The Event View replaces the picture according to which sounds fill the air and travel as waves. Instead, sounds are events that occur where objects and bodies interact with the surrounding medium. Sounds are events that take place near their sources, not in the intervening space. Sound waves travel through the air carrying information about these distal events, and are the proximal causes of sound experiences in subjects; however, sound waves are not sounds. The revision more accurately captures how we experience sounds to be. 24 Experiences related to interference phenomena can, on the positive side, provide information about the sounds actually present that would be difficult to obtain otherwise with unaided perception. Experience can tell us when two pitches are identical or if two sounds are exactly in phase when such facts are beyond the scope of ordinary perceptual discrimination. We tune a guitar by eliminating the beating between the open A string’s note and the A of a pitch pipe. The two are in tune when the sound does not waver. Furthermore, one can tell that two sounds are exactly in phase when their combined loudness peaks. CHAPTER 1. SOUNDS AND EVENTS 20 The Event View is a natural account of what sounds are that avoids the dilemma concerning where sounds are located. It implies that sounds are distally located and stationary relative to their sources without making them solely the properties of material things. We should not accept the view that sounds are properties of objects because we have good reason independent of the received view to think that sounds cannot exist in vacuums. The event that the Event theorist identifies as the sound cannot occur in the absence of a medium. Taking sounds to be particular events of objects disturbing a surrounding medium also furnishes a unified picture of what counts as a sound in cases that pose problems for any such theory. Sounds do not occur at barriers where transmission takes place. The phenomena accompanying constructive and destructive interference arise because of the spatial and temporal relations among sound sources and because information about sounds is transmitted by waves. In the next chapter, I develop an account according to which hearing an echo is hearing with distortions of place and time. The Event View entails no mysteries about sounds and sound experience. 25 25 Thanks to Gideon Rosen, Sean Kelly, Simon Keller, Jeffrey Speaks, David Lewis, and Scott Jenkins for comments and discussion of these issues. Chapter 2 Echoes 2.1 Introduction Suppose you are at a fireworks display. You stand in an open field with a single large brick building behind you. A colorful bomb’s recognizable boom follows on the heels of its visual burst, but a moment later the boom’s echo sounds at the brick wall behind the field. You have just heard a primary sound followed by its echo. I have argued that sounds are events. In particular, a sound is the event of an object or interacting bodies disturbing a surrounding medium in a wave-like manner. According to this Event View, the sound just is the disturbance event. A theory of sound should provide an account of echoes and echo perception. Echoes pose two potential problems for the Event View. First, echoes appear to be distinct sounds located at reflecting surfaces. But since the brick wall, for instance, merely reflects sound waves and does not actively disturb the surrounding medium, the Event View appears to have no sound to identify as the echo. Second, if the existence of echoes shows that sounds themselves travel and can be re-encountered, then sounds are not the events I have suggested. I shall argue that echoes are not distinct from primary sounds, but also that the case of echoes does not show that sounds travel. The account of echoes and echo perception that I offer is that hearing an echo is hearing the primary sound, but with distortion of place, time, and qualities—hearing an echo is thus like seeing an object with a mirror. The experience of the echo and of the primary sound seem to have distinct objects because we ordinarily perceive events in their entirety only once. Sounds need not travel since that by means of which we hear them does. 21 CHAPTER 2. ECHOES 2.2 22 The Event View of Sounds Recall the motivations for the Event View. The received view of physicists and auditory scientists, which I have called the Wave View, is that sounds are longitudinal pressure waves that propagate through a surrounding medium. My central argument against the Wave View also provides a positive reason to recognize as the sound the particular event that I identify. The argument rests on the phenomenological claim that sounds themselves, and not just their sources, are perceived to be located. Ordinarily, they are heard to be located at a distance and in a particular direction. That sounds, not just sources, are heard to be located can be appreciated in several ways. First, sounds, if anything, are characterized by the audible qualities of pitch, timbre, and loudness. Reflection upon experience indicates that the bearers of audible qualities are themselves distally located. Second, we form perceptually-based beliefs concerning the locations of material things and events on the basis of auditory experiences. Such beliefs are made possible, in part, by impressions of the sounds things and events produce. If information about the locations of things and events is to be gleaned from hearing sounds, then either sounds must be heard to be located roughly where those things and events are, or sounds must be heard to come from such locations. How are we to take talk of sounds’ being heard to “come from” a location? It might be that sounds are heard to come from a particular place by being first heard at that place, and then being heard at intermediate locations. This is not the case with ordinary hearing. Sounds are not heard to travel through the air as scientists have taught us that waves do. Sounds themselves are heard to be located. The sense in which sounds do “come from” particular locations is that they have causal sources in those locations. If auditory experience is not systematically illusory with respect to the perceived locations of sounds, and if auditorily-based beliefs about the locations of things are possible, then the Wave View is false. On the other hand, Robert Pasnau has recently defended what I have called the Property View, according to which sounds are properties of the objects ordinarily taken to be their sources. 1 Pasnau’s Property View fares better with respect to the perceived locations of sounds, but fails to accommodate the fact that sounds are generated by their sources—that is, that there exists a causal relation between a sound and its source. 1 Pasnau (1999) and (2000). CHAPTER 2. ECHOES 23 It also entails that sounds can exist in a vacuum. I have argued that a descendent of Berkeley’s Argument from Vacuums can be developed which shows that a surrounding medium is a necessary condition on there being a sound. The perceived sound of a tuning fork struck in air differs from the perceived sound of the same tuning fork struck in water. In helium, the sound seems different still. What is the sound of the tuning fork in a vacuum? Since there is no analog for sound of white light’s or daylight’s normative significance in revealing the surface properties of a thing—no single medium is substantially better than a host of others at revealing the way an object vibrates—there is no medium that has any claim to reveal the “true sound” of the tuning fork. There is then no principled way to decide what audible qualities the tuning fork has in the vacuum. Since the qualities of the sound depend upon the medium in which it is made, and since any decision about which audible qualities the tuning fork has in a vacuum will be largely arbitrary, we are justified in concluding that the tuning fork has no sound in the vacuum. The medium-relativity of audible qualities indicates that a theory of sound should entail that sounds cannot exist in a vacuum. My Event View identifies the sound with the disturbing of the surrounding medium by the activities of an object or interacting bodies. Information about sounds is transmitted by means of pressure waves, but the sound is not the waves. The Event View captures the locatedness of sounds and preserves the causal connection between sounds and their sources. It also entails that a surrounding medium is necessary for there to be a sound. I have provided accounts of wave transmission through barriers and of constructive and destructive interference. 2 Now I will address echoes, a special case of sound wave reflection. 2 When sound waves pass through barriers, no new sound occurs since only a preexisting wave disturbance is passed on. This is not a case of sound waves being created by activities that do not amount to sound waves. During destructive interference, sounds that are actually present in the environment are not heard because of the way information about sounds is transmitted, i.e., by waves that undergo interference effects. Complete destructive interference mimics the situation in which there is no sound at all (and therefore no sound waves). Constructive interference results in experience as of a sound that is louder than any sound actually present. This, too, results from the properties of sound waves. It is more intuitively palatable, however, that two sounds should present themselves as a single sound of greater loudness. CHAPTER 2. ECHOES 2.3 24 The Problem of Echoes An account of echoes and echo perception should say what an echo is, explain the distinctive phenomenology of echo experiences, and respect scientific descriptions of the physical and perceptual processes involved. When you hear the sound of the firework and its echo, certain features of the experience are distinctive. First, you hear the echo after you hear the primary sound.3 Next, you hear the primary sound to be located near the explosion itself, but you hear the echo to be located near the reflective brick wall. Though the echo appears to be a distinct sound, investigation (perhaps mere visual awareness) reveals no sound source at the brick wall. Nothing at the reflecting surface generates the sound. Finally, the echo and primary sound ordinarily have similar qualities and duration. The degree of distortion depends on the qualities of both the sound and the reflecting surface. Now, when the firework explodes, the surrounding air is disturbed, and pressure waves travel outward toward you and toward the brick wall. When the waves reach you, they contribute to your experience of the primary sound. As waves reach the brick wall, an elastic collision takes place and the wall re-directs the waves. These re-directed waves reach you and produce the experience as of a second sound distinct from but somehow related to the initial sound. The trouble for the Event View is that a mere elastic collision occurs at the brick wall. The brick wall does not introduce a disturbance into the surrounding medium in virtue of its own activities. It simply gets in the way of sound waves that are already present. The Event View appears to have no disturbance event to identify as the echo. If, however, sounds are particulars that persists and travel through the medium, a simple resolution exists. Hearing an echo after a primary sound is hearing the very same sound particular at two different stages of its continuous career. This, however, is incompatible with the Event View, according to which sounds are events whose locations are stationary relative to their 3 This time gap is essential to enjoying the distinctive echo experience. When secondary sound waves arrive at the ears less than about 50 ms after primary sound waves, the result is an experience as of a single sound located between the two wave sources. Between roughly 50 ms and two seconds, the result is experience as of a primary sound and an echo. When the arrival delay is greater than about two seconds, experience is as of two separate sounds that are entirely unrelated. I will focus on cases in which the echo experience occurs and hope to generalize. CHAPTER 2. ECHOES 25 sources. If the case of echoes shows that sounds themselves travel and can be re-encountered, then sounds are not the events I have suggested. 2.4 The Solution The echo just is the primary sound. According to the Event View, the echo is the primary sound event perceived with distortion of place, time, and qualities. The illusion is due to the behavior of sound waves and how sounds are perceptually localized. Sound waves, which transmit information about sounds, are the proximal causes of auditory experiences. Their direction of onset determines the perceived location of a sound, and their rate of travel results in the delayed echo experience. Filtering and dispersal at the reflecting surface account for differences in the echo’s perceived qualitative profile. Hearing an echo, then, is hearing a primary sound. You hear the primary sound, then after a short delay you hear the sound qua echo. What you hear in each instance is the very same sound—the primary sound. The primary sound is not an object-like particular that travels through space. It is an event that occurs only once at the location of the sound generating event. The traces of the primary sound—its sound waves—travel and encounter reflecting surfaces. When the waves return, an appropriately situated subject will have an experience as of an echo, a seemingly distinct (though somehow causally related) sound located in the direction of the reflecting surface. The subject, however, hears only the original primary sound. This account is analogous to a plausible treatment of seeing objects with mirrors. Mirrors facilitate our seeing the very same objects and events that occur in front of them, albeit with distortion of place (and perhaps qualities). Likewise, reflecting surfaces allow us to hear the very sounds that occur in front of them, albeit with distortion of place and time. But just as there is no new object that you see when you look in a mirror, there is no new sound that you hear at a reflective surface. Hearing an echo after first hearing the primary sound is, on this account, an unobjectionable sort of re-encounter with the very same sound. The sound event occurs once, say between t b and te . You experience it once between t1 and t2 and again between t3 and t4 (both later than t2 ) because the waves it creates return. The sound neither travels nor returns to you; you experience the same distal event because of the way the event’s traces travel. The situation is something like this. Suppose you hear the sound of a firework. You then travel faster than the speed of the sound waves, CHAPTER 2. ECHOES 26 overtake them, and halt. You then hear the sound again—it seems to be in the same place it was before. We need not say that the sound travels, only that the sound waves travel. Because information about sounds is transmitted through a medium via relatively slow waves, you are lucky enough to experience the same sound event twice. The medium disturbance you hear when you hear the sound for the second time is the very same disturbance event you heard earlier. Echo perception is similar. A reflecting surface, however, saves you the trouble of supersonic travel. You pay the price with distortion of location. Hearing an event that is past is thus like seeing an event that is past. When you see a supernova from across the galaxy, you see that event as it happened long ago. But you experience it to be present—to be taking place now. So your experience includes a temporal illusion. Now, suppose there were big mirrors in outer space. You could then see the very same earthly event twice: once when it happens and once after its traces are reflected. I could watch Game 7 of the 2001 World Series on November 4, 2001, and then watch it again on November 4, 2002, in a mirror located one-half light year away. If the mirror was big enough, I might even think there was a game being played on a far-off planet that looked remarkably like Earth. The case with echoes is a less exaggerated parallel. Hearing an echo doesn’t involve such great distances. Still, the echo experience and the primary sound experience seem not to have the same object. If the echo is the same sound as the primary sound, why do we not recognize them as such?4 The apparent distinctness is due to the nature of events and to how we conceive of them in contrast to objects. If a primary sound and an echo were the same object experienced at different times during one continuous career, we would expect ordinary object recognition and re-identification to occur, given their qualitative similarity. With objects we count on this sort of recognition to ground the perceived continuity of the material world. 5 Events and time-taking particulars, however, are tied to a specific time and place when they occur. Though Game 7 of the 2001 World Series might 4 Of course, we sometimes fail to recognize an object as one we have seen before, but when the object is qualitatively similar on both occasions this requires special circumstances, such as great distances or the type of disorder mentioned in the footnote below. Barring failures of recognitional capacities (including memory), we can usually be made to recognize persisting objects. 5 Capgras syndrome is one form of delusional misidentification syndrome (DMS) in which patients suddenly begin to believe that people and objects familiar to them have been replaced by exact qualitative duplicates. This failure of perceived continuity is notable and debilitating. See Breen, et al. (2000). CHAPTER 2. ECHOES 27 have been located at any of various times and places, it it fact occurred November 4, 2001, at Phoenix, Arizona. That very event cannot occur again or elsewhere. And we implicitly recognize this: similar events experienced at different times and places are taken to be distinct events. So, if we happen to perceive the very same event over again, it should seem like a distinct event.6 Since echo phenomenology arises when the very same event is heard to be at a later time and different place, precisely what we should expect is that the echo should seem distinct from the primary sound. The perceived distinctness of echoes from primary sounds is predicted by the Event View. So the Event View has the resources to count echoes as real sounds, despite the absence of its characteristic disturbance event from the perceived location of the echo. The account relies on securing the correct way to conceive of echoes and echo perception. Upon doing so, we see that the Event View has the right event on offer—the primary disturbance event. 2.5 Are Echoes Distinct Sounds? Given that sounds are stationary relative to their sources, and that echoes seem to be located at a distance from primary sounds, it is natural to say that echoes are distinct from their primary sounds. The account I have provided holds otherwise. What reasons have we to think that primary sounds and echoes are not distinct sounds? In a recent paper,7 Matthew Nudds has argued that if echoes are distinct from primary sounds, then an echo cannot be distinguished from a qualitatively similar sound produced by a different source. We do distinguish between the firework’s echo and a similar bang made by a firecracker tossed out one of the brick building’s windows. This is supposed by Nudds not to be possible if echoes are distinct from primary sounds. This argument fails, so it cannot be used against the distinctness claim. Suppose we accept that echoes are distinct sounds located at reflecting surfaces. Insofar as the primary sound, the echo, and the qualitatively similar sound from a different source are all full-blown sounds, it is no objection 6 What about watching an instant replay? The same event (metaphorically) perceived a second time does not seem distinct. But this is simply a matter of habituation—we are used to seeing instant replays, which we know are representations of prior events. If you have been asleep since the days of entirely live television, you might take a live picture and its replay to be pictures of qualitatively similar events. Thus, we might become used to thinking of echoes as primary sounds heard again with distortions of place, time, and qualities. 7 Nudds (2001), p. 227. CHAPTER 2. ECHOES 28 that we cannot distinguish the echo and the qualitatively similar sound simply on the basis of their intrinsic properties. Neither has intrinsic properties that confer upon it a particular status—i.e., primary sound or echo—qua sound and provide grounds for distinguishing it from the other. But we can distinguish echoes from mere qualitatively similar sounds by their causal relations. Echoes are sounds caused in part by sounds and sound generating events that occur elsewhere. The firework’s echo is a sound at the brick wall that has in its causal history the sound generating event and sound that occur when the firework explodes. The mere qualitatively similar sound, however, has as its cause a sound generating event closer to home, and lacks the history of an echo. The sound of a firecracker that goes off near the brick wall may be mistaken for the echo of the firework’s sound. But it lacks the right sort of causal relation to the firework’s explosion and its sound. 8 8 What is “the right sort” of causal relation? Suppose the firecracker’s sound is causally related to the firework’s sound, perhaps by a sound detector that activates a detonator. Still arguing that the primary sound and the echo are distinct sounds, we say that the echo is a sound that has in its causal history a sound generating event and sound located elsewhere, where nothing intervenes but sound waves that propagate normally. Suppose my striking a tuning fork causes, by no intervening means other than sound wave propagation, a second tuning fork to begin vibrating and sounding on its own. Does the condition fail to distinguish between echoes and resonant sounds? In the case just described, the second tuning fork’s sound is generated by the activities of that tuning fork, even though those activities are caused by sound events that occur elsewhere. But an echo is arguably not generated by the events that occur at its location. An echo is a sound that is entirely generated elsewhere. The reflecting surface makes no contribution to the sound, where this notion is fleshed out in terms of sound energy or a measure on the qualities of loudness and pitch. The resonating object, however, contributes to the sound. According to this proposal, when sound waves are reflected, a sound occurs. The sound does not travel; the sound waves travel. The primary sound generating event causes the echo, and only sound waves intervene. An echo is distinguishable from a qualitatively similar sound since it is generated by events that occur elsewhere and has a sound in its causal history. This is how to argue that primary sounds and echoes are distinct. I do not find this satisfactory. It is a plausible principle that sound generating events do not produce sounds at a distance. If the foregoing were correct, when sound waves from a sound generating event encountered a reflecting surface, that sound generating event would produce a new sound. Such “generation at a distance” might be explained in terms of sound wave behavior, but suggests an ad hoc tailoring of the theory to meet the phenomenology: why should the interaction between sound waves and a reflective surface give rise to an entirely new sound? Now, we should not take too seriously our pre-theoretical verdicts about the causal structure of sound generating events and various types of sounds. It might be that a stable structure that acknowledges echoes as distinct sounds can be devised. The strategy, as I indicate in the text, is not worth pursuing. CHAPTER 2. ECHOES 29 Despite the phenomenological distinctness of primary sounds and echoes, there are still four reasons that together suggest that echoes are not independent sounds that reside at reflecting surfaces. First, awareness of an echo normally furnishes awareness of the event that made the sound. Whether this awareness is direct or indirect derives from whether sounds furnish direct or indirect awareness of such events, not anything special about echoes. Both the sound of the firework’s blast and its echo furnish awareness of the explosion, so if you only heard the echo you would still be aware of the explosion. 9 Awareness of ordinary events by means of echoes is not a deficient way to be auditorily aware of such events. This suggests that the echo just is the explosion’s sound, not a qualitative duplicate that furnishes, as it were, false or indirect awareness of earlier events. Second, we do not attribute dispositions to produce sounds with particular audible qualities to reflecting surfaces. Brick walls and canyons are not disposed to make or have sounds when they reflect sound waves. Such dispositions are attributed to the proper range of what we take to be sound sources. But sounds normally seem to be caused, produced, or generated roughly where they are perceived to be located. When a sound does not seem to be located where there is a sound source, this is likely the result of a physical-cum-perceptual process by which a sound is heard where there is none. Third, what occurs at the reflecting surface is not the introduction of a wave disturbance into the surrounding medium. An elastic collision between the surface and the medium occurs, causing the direction of wave propagation to change. The reflecting body passively re-directs the waves. Since there is no new sound source—on either a Wave View, a Property View or an Event View—it is reasonable to conclude that the echo is not a new sound. Finally, as mentioned in §2.4, an analogy with the visual case of seeing an object with a mirror is compelling. Just as there is no distinct object located at the mirror’s surface, there is no distinct sound located at the reflecting surface. Illusion of place occurs when objects are seen with mirrors, and 9 Consider the following unusual phenomenon. A man is working on your house. When you are in the right part of the house, and the wind is blowing in the right direction, you cannot hear the sound of the hammering on the other side of the house. But, you can hear the echo of the hammering as reflected off the exterior wall of a house down the street. To hear this echo is still to be aware of the hammering. Thanks to Gideon Rosen for this example. CHAPTER 2. ECHOES 30 likewise, distortions of place and time occur when reflecting surfaces enable us to hear the sounds that occur in front of them. These four claims together suggest that echoes are not distinct sounds that occur at reflecting surfaces. The account I have given explains the apparent distinctness of primary sounds and echoes while maintaining that primary sound and echo perception share a common external object. 2.6 Do Echoes Show That Sounds Are Not Events? The observation that echoes are not distinct from primary sounds does not lead immediately to the account of echoes and echo perception that I give in §2.4. Suppose a sound is a particular that persists over time and travels like an ordinary material object. An echo is then a sound that has existed for some time and that has rebounded from a surface. Hearing a primary sound and its echo would then be hearing the very same persisting particular at different stages during its continuous career. 10 If the case of echoes forces such a conception upon us, then sounds are not events and sounds are not stationary relative to their sources, so the Event View is false. 2.6.1 The Argument from Echoes Matthew Nudds has recently picked up this line of thought and offered the following argument against the claim that sounds are events. One of Newton’s minor triumphs in the Principia was a derivation of the velocity of sound. To test his derivation he measured the time for an echo to return from the end of a colonnade in Neville’s Court of Trinity College. Lacking anything resembling a stop watch, he adjusted a pendulum to swing in rhythm with the echoes of successively made sounds (Westfall (1980), pp. 455– 456). Newton used the pendulum to measure the time it took for a sound to travel from its source down the colonnade to a wall and then back again; he heard the particular sound that he had produced a moment earlier reflected back to him. Here we have a simple example of a sound existing even after the event which produced it ceases (we can suppose) to exist, and which moves independently of whatever produced it. It is also an example of a 10 Note that this conception fits nicely with the Wave View, though someone who endorses it need not be a Wave theorist. You might think that sounds travel through space without identifying the sound with the waves. CHAPTER 2. ECHOES 31 single sound being heard more than once: Newton re-encounters a particular sound, as he must do if he is to time its journey up and down the colonnade. Sounds, then, are particulars. Normally we think of events as things that, unlike objects, cannot be re-encountered. The fact that we can re-encounter sounds suggests that they are perhaps best thought of as kinds of objects, rather than as events; either way, not as properties. 11 What I shall call the Argument from Echoes is this: 1. Newton measured the speed of a sound by measuring the time it took for the sound to travel down a colonnade and back. 2. Hearing an echo is re-encountering a particular sound. 3. Events, unlike objects, cannot be re-encountered. 4. Therefore, sounds are object-like particulars and not events. 2.6.2 Sounds Do Not Travel The account just sketched is incompatible with my conception of sounds and echoes. According to the Event View, sound waves, but not sounds, travel independently of the objects and events that generate them. This view is meant to capture the central phenomenological fact about locational hearing—that sounds are located at a distance and in a particular direction. Echoes, too, seem to be located distally near reflective surfaces. Given that we do not hear sounds to move as sound waves do, we want a non-question-begging reason to believe (1), according to which sounds themselves travel. Consider the following two alternatives to (1). 1*. Newton measured the speed of sound waves by measuring the time it took for him to hear an echo after hearing the primary sound. 1**. Newton measured the speed of sound waves, but not of a sound by measuring the time it took for him to hear an echo after hearing the primary sound. Though (1**) is a strong replacement for (1), (1*) is uncontroversial. But (2) does not follow from even (1*). Hearing an echo may not be re-encountering the same sound at a later stage of its career, even if we owe the episodes 11 Nudds (2001), p. 221–222. CHAPTER 2. ECHOES 32 of hearing to the same sound waves. The conclusion that sounds are not events does not follow from the argument reconstructed with (1*). It is certainly widely accepted that sounds travel. But why? In contrast to (1*), (1) is not obvious. By now the following theme is well-rehearsed: sounds are usually heard to travel only when their sources do. If sounds seemed to you to emerge from their sources and to speed through the air toward your head, we would send you to a specialist. Maybe echoes are special. Perhaps the Argument from Echoes reflects the fact that hearing a sound followed by its echo forms the basis of a strong intuitive case for the belief that sounds travel. After all, the argument is not meant to pre-judge the theoretical question of what a sound is. Rather, it aims to enlist conceptions about sounds and echoes that are implicit in ordinary beliefs to show that sounds are not events. Indeed, the perceptual facts in the case of echoes ground a natural inference to the effect that sounds travel. But the transition from experience to belief is not a matter of simply lending credence to that which is already apparent in experience. Believing that sounds travel requires more than just accepting the way things appear to you. Take a sound, s. That s travels independently of its source is not ordinarily part of the content of an auditory experience, and it does not arise as part of the content of the experience when a sound is followed by an echo.12 So the case of echoes must present further reasons that make the belief that sounds travel intuitive. One preliminary. We do assume that each sound is generated by some event that is not a sound. It appears to be a working “assumption” of the auditory perceptual system that each sound must be produced by some event in the environment.13 Furthermore, it is an introspectible part of experience when hearing a sound that the sound is a sound of something going on in the surroundings. The belief is plausible since empirical evidence repeatedly confirms that sounds are caused by “happenings” that take place among 12 One might think that this observation is all that is required to show that sounds do not travel, if one thinks that a sound cannot have properties beyond those it is experienced to have. This is too easy. I leave it open that sounds can have properties we are unable to detect with our ears alone, and that there can be sounds we are entirely unable to hear. Sounds may indeed move through space; whether or not they do will depend on what the correct theory of sound is. This is not to say that our goal should not be a theory that coheres with the contents of experience—sounds are, after all, among the things we hear. We would like it if perception served as a reliable guide to reality, but whether it does with respect to sounds is a matter to be decided by a variety of concerns. 13 See Blauert (1997), especially Chapter 2, “Spatial Hearing with One Sound Source”, and Chapter 3, “Spatial Hearing with Multiple Sound Sources in Enclosed Spaces”. CHAPTER 2. ECHOES 33 material things—we are hard-pressed to find sounds for which investigation reveals no source events. Now the fact that echoes strike us as causally related to events that occur elsewhere—in fact, as generated by distal events—establishes a connection to what we take to be the source of the primary sound. The echo of the firework’s blast strikes us as being connected to the explosion that generates the primary sound. Though the primary sound and the echo seem distinct, they appear to share a common causal history. But since no source appears to exist where the echo is heard to be, from (i) the qualitative similarity of primary sound and echo; (ii) the time delay between the experience of primary sound and of echo; and (iii) the shared causal history of the primary sound and echo, it is natural to conclude that what we think of as the primary sound and the echo are the very same sound. Once we overcome the perceived distinctness and identify the primary sound and the echo in thought, two facts—that the primary sound is heard to be near the source at one time, and that the echo is heard to be at some distance from the source slightly later—may encourage us to believe that the sound has travelled. Note that forming this belief requires us to ignore or implicitly reject the possibility that two different sounds (with different locations) are generated by the same event.14 This itself requires accepting either that a sound cannot be generated at a distance from its source or that each sound has a unique generating event. The latter is somewhat difficult to motivate, but the former is quite plausible. Even after accepting all of that, you are only entitled to believe that the sound travelled from the perceived location of the primary sound to the perceived location of the echo. Nothing about the naı̈ve perceptually-based way of thinking about the situation allows you to go beyond this to the belief that sounds travel as waves do, i.e., that the sound travelled from its source to the reflecting surface and back to you. The case of echoes adds little to the case for (1). The force of (1) must come from one of two sources: an appeal to science or an inference that fills the gap in the above line of thought. It is common knowledge that information about sounds is transmitted by pressure waves in a medium. To accept that sounds travel because sound waves travel, however, is to complacently accept a caricature of scientific doctrine. 15 Part 14 I will assume that my account of §2.4, in which hearing an echo involves illusions of place and time, does not strike us intuitively. 15 Disagreement about what a sound is exists even in the scientific community. Frequently researchers analyze and divide the terrain, and avoid commitment on the more CHAPTER 2. ECHOES 34 of the business of science is to explain by way of discovering causal connections. Though sounds cause auditory experiences, and pressure waves cause auditory experiences, the identification is unwarranted. To conclude, from ‘A causes C’ and ‘B causes C’, that ‘A = B’, is as fallacious as concluding, from ‘guns kill people’ and ‘people kill people’, that ‘guns are people’. Given that sounds and sound waves differ in significant causal and perceptible respects, it begs the question to conclude on the basis of science that sounds travel. Consider the following two inferences that attempt to bridge the gap between sound sources and perceivers. i. The sound was generated over there; I am here; since I heard the sound it must have travelled to reach me. ii. I heard the sound; then I travelled faster than the speed of sound waves, stopped, and heard it again; so, the sound must have travelled. But since the sound itself is perceived in every case to be stationary and distant, these inferences are either question-begging or fallacious. It begs the question to assume that the sound is identical with the waves, and therefore travels. It is fallacious to conclude that since distance separates perceivers from sound generating events, sounds must travel. Sounds need not travel since that by means of which perceivers hear them—sound waves—can move independently of sounds. Premise (1) of the Argument from Echoes is seriously undermotivated. 2.6.3 Re-encounters I have tried to undermine the claim that sounds travel independently of their sources in part because the conception of sounds that it underwrites is unattractive. That hearing an echo after hearing its primary sound is re-encountering the same persisting sound particular finds little support in perceptual experience. As I pointed out in §2.4, primary sounds and their echoes seem to be distinct. A single object perceived at different times does not. Hearing an echo is unlike re-encountering a person you have met before, and it is unlike general question. To the extent that a commitment is made on the issue of identification, physicists most frequently speak of sounds as waves. But there is a tendency among perceptual researchers to speak of distally located acoustic events and sounds in addition to sound waves and auditory experiences. See Bregman (1990), Luce (1993), Carlile (1996), Blauert (1997), and Gelfand (1998), for illustrations of this contrast. CHAPTER 2. ECHOES 35 glimpsing someone carrying home the vase you saw earlier in a store window, even though the echo has an equally rich qualitative signature. Echo perception does not bear the marks of object-recognition and identification that characterize the experience of material objects and continuants with relatively stable qualities. We find it reasonable to count echoes and primary sounds as distinct before doing philosophy. But to count earlier and later states of persisting things as distinct objects is counterintuitive. 16 Matters get worse. Given a plausible thesis concerning sounds, accepting an interpretation of (2) that makes the Argument from Echoes valid leads to trouble. First Thesis Sounds are extended in time. Just as concrete objects occupy space, sounds occupy time. Each sound has a duration: it begins at one time; it ends at another (if it ends at all); it may involve a great deal of qualitative change. We arrive at the First Thesis by way of two lemmas. Lemma One Sounds are not, or certainly do not seem, wholly present at each moment at which they exist. Compare this with objects. It is plausible that all that is required to be a particular object can be present at a moment, and thus, that a particular object can be wholly present at each moment at which it exists. At any particular moment, however, only a portion—a temporal part—of a sound is present. That is, rather than a sound in its entirety, only a part corresponding to a property profile at a time is present at each moment during which the sound exists. Lemma Two Having a qualitative profile over time is central to the identity of a particular sound. Consider a spoken word or a particular squawk from a raven. Each has qualitatively different stages that constitute it, and sounds with different profiles over time are different sounds. The two lemmas and the First Thesis are independently plausible statements that express judgements grounded by our intuitive conception of sounds. 16 In the debate about whether each material object is a series of suitably related temporal parts or a single thing that undergoes changes in properties through time, that is, whether things perdure or endure, it is the latter endurantist view that appears natural to common sense and perceptual experience (see, e.g., Thompson (1983), who says of the metaphysic of temporal parts: “It seems to me a crazy metaphysic—obviously false,” p. 210). The perdurantist must motivate the view with philosophical considerations. CHAPTER 2. ECHOES 36 Now the Argument from Echoes relies on the claim that events are not the sorts of things that we can re-encounter. We need a working conception of what it is to encounter something in the relevant sense. I will continue to assume that to encounter something is just to enjoy perceptual awareness of it. So to the extent that we can speak meaningfully about encountering an event, nothing more is required than to be in a suitable relation of perceptual awareness to it. An alternate sense of ‘encounter’ means roughly the same as ‘bump into’. This sense seems appropriate if sounds are the travelling causes of auditory experiences, as the Argument from Echoes would have it, but applies paradigmatically to objects and not to events. Since the Argument from Echoes employs the fact that sounds can be re-encountered to show that they are not events, this must not be the sense of ‘encounter’ in question. Otherwise, to show that we bump into sounds would suffice to show that they are not events. But the argument seems to be: We can certainly encounter events, we just cannot re-encounter them. So, if the argument turns on something peculiar about echoes, ‘encounter’ must mean something like ‘perceive’ or ‘enjoy perceptual acquaintance with’. 17 The claim must be that while we can re-encounter sounds, we do not normally think that we can re-encounter, that is, re-perceive, events. Sharpening the sense of ‘re-encounter’ in this way is not enough to make the Argument from Echoes valid. Consider the following thesis. Second Thesis, Weak Version We sometimes re-encounter a particular sound at a later stage during its completion. Suppose I wake up to hear the loud, high-pitched beginning of the local emergency siren’s wail. I then descend into the silent basement for two minutes, after which I emerge to hear the nearly completed sound’s fading low-pitched moan. Each time I experience a different part of the sound. So, we can re-encounter a particular sound by encountering different parts of the same temporally extended sound. Events, however, are often re-encountered in this way, so the Weak Version of the Second Thesis is consistent with sounds’ being events. What the Argument from Echoes needs in order to show that sounds are not events is that we can re-encounter the very same parts of a particular sound on separate occasions. From this it follows that if we can encounter all of the parts of a sound on each of two different occasions, we are capable of re-encountering the sound in its entirety. The import of (2) must be the following strengthened version of the Second Thesis. 17 To make the notion line up with ordinary usage might require adding some proximity requirement. Perceptual acquaintance suffices for the arguments that follow. CHAPTER 2. ECHOES 37 Second Thesis, Strong Version We sometimes re-encounter a particular sound in its entirety. The Strong Second Thesis is the one intended to ground the argument that sounds are not events. We do not ordinarily think of events such as particular closings of doors and birthday parties as things that can be reencountered in their entirety since they occur at specific times and places. Halloween is a different event each year that belongs to a class of like events that each occur on some October 31. While we take for granted reencounters with persisting objects, to re-encounter a particular Halloween would take a trick, such as time-travel. In my account of echo perception, the reflection of sound waves from a surface serves as the occasion for such a trick. You re-perceive the primary sound event because its traces rebound and return. So for the Argument from Echoes to be valid, the import of (2) must be that Newton re-encountered, without any tricks and in its entirety, the very same sound particular. The trouble comes when we accept the First Thesis with its lemmas while accepting (1) and a version of (2) that implies the Strong Second Thesis. From the Strong Second Thesis, a particular sound can [without tricks] be encountered in its entirety on two separate occasions. From the First Thesis, each sound has duration—it begins and ends. Now consider hearing a particular sound and its echo. It is an experiential fact that between t 1 and t2 , call this interval tp , we experience the primary sound. Likewise, the echo is heard during some later interval te , which begins at t3 and ends at t4 . By (2) and the Strong Second Thesis, the sound heard at t e is the same (entire) particular sound heard at t p . Now, in light of (1), the sound must travel and exist continuously between t p and te . It follows from this that the re-encountered sound is perceived to begin and end multiple times during which it continuously exists. Since something cannot continue to exist after it has ceased to be, either sound duration perception—perceiving that sounds begin and end—is illusory and the grounds for the First Thesis are undermined, or the claim that sounds are persisting particulars that can be re-encountered in their entirety is false. 2.6.4 Some Unsavory Consequences Suppose (1), (2), and the Strong Second Thesis are correct. A proponent might come to their defense as follows. CHAPTER 2. ECHOES 38 When you perceive a sound to begin and end, you perceive the sound’s spatial boundaries, its “front” and “back”, as it travels past. We mistake the beginning of our experience of the sound for the temporal beginning of the sound, and the end of our experience for the temporal end of the sound. The sound, properly speaking, begins and ends only in the spatial sense. It has a front and a rear, the experiences of which lead us to believe that the sound begins to exist and then ceases. By experiencing the spatial boundaries of a sound multiple times, the re-encountered sound can seem to begin and end entirely on separate occasions. The explanation of our perceptually-based confidence in the First Thesis adverts to encounters with the spatial boundaries of sounds as they travel past. These encounters correspond to perceived beginnings and endings of sounds, while only our experience of the sound can properly be said to begin and end. From this conception, several unsavory consequences follow which together suggest that we are on the wrong track if our goal is to provide a theory of the objects of auditory experience that is phenomenologically apt. These consequences draw attention to desiderata that an alternative might satisfy. First, either it paradoxically implies that sound generating events, such as firework explosions and door closings—events that cause sounds—are not events, or it implies that we enjoy veridical perceptual awareness of the durations of sound generating events by way of illusory awareness of the durations of sounds. When we perceive sounds we are aware of the events that bring them about. To hear the sound of the firework is to be perceptually aware of the event that caused the sound—it is to be aware that something has exploded; to hear the sound of a door closing is to be aware that a door is closing. Since even on the conception of a sound in question, the primary sound and the echo share a common sound generating event, we perceive the very same sound generating event when we encounter the sound and the echo. If to perceive an event is all that is required to encounter that event, we re-encounter the sound generating event when we hear an echo after hearing its primary sound. Since according to (4) we do not re-encounter events in their entirety, the case of echoes shows that sound generating events are not events! That is absurd. Suppose we say that auditory perceptual awareness of sound generating events is indirect and that we can re-encounter events indirectly by directly perceiving something else, e.g., in this case a sound CHAPTER 2. ECHOES 39 and an echo.18 When we perceive something to begin and end by means of audition, our awareness is as of directly perceiving a sound to begin and end. We are supposing, however, that this awareness is illusory—that we only directly perceive the spatial boundaries of a sound that lacks duration of the sort we attribute to it. How then are we indirectly aware of the duration of the sound generating event? Our indirect awareness of the duration of the sound generating event is supposed to occur in virtue of our direct awareness of some aspect of the sound itself.19 But it is strange to say that we indirectly perceive something (the duration of a sound generating event) by way of directly perceiving something else (the spatial boundaries of a sound) of which we have no introspectible awareness. It is stranger yet to say that we indirectly but veridically perceive something (the duration of a sound generating event) by way of an illusion of direct perception (of the duration of a sound). If veridical and introspectible awareness is required of that which is directly perceived in order for it to furnish veridical indirect awareness, then the appeal to indirect perception (of the sound generating event and its duration) does not avoid the absurd conclusion. The requirement may be rejected if a more permissive attitude is taken toward the standards that must be met for perceptual awareness, but even in that case a considerable burden must be shouldered. The following must be accepted. that sounds do not have the durations we think they have; that it is possible to re-encounter, in the sense of ‘indirectly perceive’, events; and that it is possible to veridically perceive the durations of sound generating events by way of the illusion of sound duration. Suppose that the First Thesis must be rejected, and that perceiving a sound involves perceiving a set of adjacent qualities that move through space. When you hear a continuous scale played on an oboe, you experience one note after another as they reach and move past you. What begins and ceases is the stream of qualities that passes you and continues through the medium. But then perceiving the qualities of sounds is also shot through with illusion. Qualities which are at your ears, on this proposal, are heard to be elsewhere. Each of the oboe’s notes is perceived as a stationary complex 18 So also we may watch the dousing of the fire, then perceive it again, indirectly, by way of the resulting puff of smoke. 19 For it seems patently false—a violation of the transparency of auditory perception— that we indirectly perceive the duration of a sound generating event by way of directly perceiving the duration of our auditory experience. CHAPTER 2. ECHOES 40 of audible qualities located in the vicinity of the oboe. If the sound moves through the air, then this proposal is akin to the claim that colors travel through space as light does though we see them to be located at surfaces. Though sounds are not perceived as qualities of surfaces, they are perceived as distally located. The forms of projective illusion attributed to perception are similar.20 Thus, the second unsavory consequence of this view is that the qualities of sounds are mistakenly perceived as located near their sources. Third, it follows that the structure of our perceptual judgements and the resulting conception of how sounds exist unperceived are faulty. We think that sounds can exist unperceived in the following sense. When a car passes me and goes out of range, I cannot hear its sound. The sound still exists, but I cannot perceive it given my location and auditory capabilities. We do not ordinarily think that a sound exists unperceived in the sense that it exists after we have heard it and after the sounding has stopped. But on the proposal being entertained, sounds exist long after the events that produce them do, and, indeed, continue to exist well after we hear them. So, even though you think the sound of a spoken word no longer exists after you hear it, it goes on existing for as long as the corresponding waves travel through the medium. 2.6.5 Conclusion It is more difficult to motivate the claim that sounds travel than you would expect from the way we sometimes talk. Echoes offer little to further the case. The claim that sounds travel also underwrites a conception of sounds and echoes that conflicts with central elements of the way we take sounds to be. Since the Argument from Echoes assumes that sounds are persisting particulars that travel and can be re-encountered, it fails to show that sounds are not events. 2.7 Is the Illusion Tolerable? The Event theorist’s account of echo perception posits spatial, temporal, and qualitative illusions. Why, since I took the Wave View to task for its systematic location illusion, is this not objectionable? One might even think 20 Note that this is not the same kind of projective illusion which is supposed to occur when mental properties are experienced as properties of external objects, which some theorists (e.g., Boghossian and Velleman (1989) and (1991)) believe occurs during color perception. CHAPTER 2. ECHOES 41 that the Wave View fares better in the case of echoes, since it only posits a location illusion and an explicable duration illusion. Illusions, per se, are not reasons for objection. But we are interested in perception for what it can potentially reveal to us about the world. To the extent that we take perception to be a reliable guide to certain aspects of the world, we have an interest in reducing the amount of illusion we attribute to perceptual experience. But illusions can inform us about the mechanisms involved in perception and about aspects of the world which we cannot directly observe. So we also have an interest in discovering the illusions we actually fall prey to. If, however, an account proposes an illusion whose spell we have independent reason to believe we are not under, all else equal we should prefer an alternative which does not posit that illusion. I have suggested that we have no reason to think that we are under the illusions posited by the Wave View, and also that in the case of echoes there are compelling reasons to believe that we do fall prey to the illusions posited by the Event View. The Event View’s illusions have a signficantly different status from the Wave View’s. According to the account I have given of echoes and echo perception, the illusions arise as a special case. While in ordinary sound experience we hear sounds for the most part as they are, the experience of an echo occurs only in quite special circumstances. The illusions posited by the Wave View are systematic and pervasive. The Event theorist’s illusions are explicable. Quality illusions occur because filtering and scattering that occur at reflecting surfaces distort information contained in sound waves. Illusions of place and time occur because sound wave reflection mimics the situation in which a sound source exists at the reflection site. So there are external states of affairs which explain why experience attributes the properties it does when the illusion occurs. Because of this, the illusions in the case of echo perception are also predictable once the mechanisms of ordinary sound perception are known. Finally, the illusions that occur during echo perception are analogous to illusions that occur in the visual case which we find interesting but unproblematic. Hearing the firework’s echo to be at the brick wall is like facing a large mirror and thinking there is a piano in a room ahead of you. Hearing the echo to occur after the primary sound is like seeing a supernova from across the galaxy.21 21 You might worry that if the Event View is correct, the time-gap that occurs even in ordinary sound perception—because sound waves travel relatively slowly—is problematic. But it is simply an exaggeration of the time-gap that occurs in vision, and goes undetected CHAPTER 2. ECHOES 2.8 42 Concluding Remarks I have provided an account of echoes and echo perception that is consistent with the Event View of sounds, according to which an echo just is a primary sound whose perception involves distortion of place, time, and perhaps qualities. Echoes are not distinct from primary sounds. The impression of distinctness occurs because when we immediately perceive happenings that are past we perceive them as present, and because without special tricks we perceive events in their entirety only once. Once we recognize that echoes are not distinct from their primary sounds, we need not be forced into the view that sounds are object-like particulars which persist and travel. This view rests upon shaky foundations and has consequences which force us to reject claims central to our perceptual beliefs about sounds. Without that view the Argument from Echoes does not establish the conclusion that sounds are not events. So we should prefer the Event View, according to which a sound is the event of an object or interacting bodies disturbing a surrounding medium in a wave-like manner. 22 unless somewhat large distances are involved. Note that Armstrong (1961) confusedly thought sounds might not pose a time-gap problem. In the case of a star, it may be questioned whether our immediate perception really involves any temporal illusion. It may be suggested that what we immediately perceive is not the star, but a present happening, causally connected with the extinction of the star many years ago. The star sends a message to us, as it were, and we immediately perceive the message, not the star. Now this suggestion may be correct in the case of a sound [fn. This was pointed out to me by Dr. A.C. Jackson]. There seems to be some force in thinking of sound as actually spreading out from its source, like a balloon rapidly inflating. (And here I am not speaking of the sound-waves.) So when two people ‘hear the same sound’ it may be argued with some plausibility that they immediately hear two different things, because they are in different positions (pp. 147–148). Armstrong goes on to reject the analogy in the case of a star, since the only immediate object of sight is “the star itself” and not light waves. He concludes that we sometimes immediately perceive past happenings, though they seem present. I find it implausible that a sound is like a rapidly inflating balloon, and that two people cannot hear the same sound. So I reject Armstrong’s characterization of sound and accept that hearing past sounds is like seeing past events. Both involve a time-gap and a temporal illusion. 22 Thanks to audiences at Princeton University, Florida State University, U.C. Santa Barbara, and University of St. Andrews for helpful discussion. Chapter 3 Audible Qualities 3.1 Introduction Sounds are events in which objects or interacting bodies disturb a surrounding medium. Particular sounds have pitch, timbre, and loudness. A flute’s notes are higher pitched than a tuba’s; middle C played on a piano has a character different from middle C played on a trumpet; jet planes make louder sounds than tractors. Given how audition works, physics has taught us that frequency, wave shape, and intensity determine which pitch, timbre, and loudness a sound auditorily appears to have. A theory of sounds must address the audible qualities. What exactly are pitch, timbre, and loudness, and how are they related to the physical properties of frequency, wave shape, and intensity? Given psychophysical evidence which indicates that there is neither a linear nor a logarithmic function from the frequency of a sound to the pitch it is experienced to have, auditory researchers have adopted the view that pitch is a subjective or psychological property.1 That is, pitches strictly only belong to experiences. They draw similar conclusions in the cases of loudness and timbre. I shall suggest, however, that pitch, timbre, and loudness—though not identical with frequency, spectral composition, and intensity—can be identified with physical properties of sounds themselves. The Standard (Subjectivist) View of the audible qualities does not follow from the failures of simple forms of physicalism about pitch, timbre, and loudness; indeed, a promising physical candidate for pitch can be extracted from recent work on pitch perception. My focus primarily will be on pitch and problems that arise in saying what pitch is, but I shall suggest that the account can be extended to timbre 1 See §3.3 and §3.4 for a full discussion. 43 CHAPTER 3. AUDIBLE QUALITIES 44 and loudness. According to this Alternative (Physicalist) View, the audible qualities lack the independent scientific interest attached to those of the Simple Views; they are, however, of interest from an anthropocentric point of view. The alternative holds that pitch, timbre, and loudness are properties that sounds have in addition to frequency and intensity. Since pitch, timbre, and loudness can be understood in terms of frequency and intensity, and since frequency and intensity are properties ordinarily ascribed to waves, pitch, timbre, and loudness appear to be properties of waves, not sound events as I have construed them. I shall argue, however, that frequency and intensity can be ascribed to sounds conceived as events. Thus, sound events as I have characterized them have pitch, timbre, and loudness in addition to frequency and intensity. The Event View of sounds is therefore capable of providing an account of the audible qualities according to which they are objective, physical properties of sounds. 3.2 3.2.1 Simple Views of Pitch The Very Simple View The Very Simple View of pitch can be extracted from the story we learned about sounds in primary school. We were taught that the pitch of a sound is the frequency of a pressure wave that travels through a medium such as air, water, or helium.2 Whatever doubts we have about the natures of colors, accepting the Very Simple View makes us happy to say that pitches reside in the world beyond our minds. That is, that the quality we discern when we hear pitch is just the property of having some frequency. When we tune a trombone, one way to get its pitch right is by adjusting the frequency of an F with an electronic tuner. This Very Simple View accounts for salient aspects of the experience of pitch. It explains the linear ordering of pitches: as frequency increases, so does perceived pitch. A natural account of the musical relations also follows from the Very Simple View of pitch. Small whole-number frequency ratios form the bases of the octave (1:2), fifth (2:3), fourth (3:4), et al. One gets the palpable sense that the natures of such audible relations are revealed by this discovery. 2 Frequency is just the number of cycles per second, measured in Hertz (Hz). CHAPTER 3. AUDIBLE QUALITIES 3.2.2 45 The Simple View The Very Simple View is far too simple. The identification of pitch with frequency approaches adequacy for pure or sinusoidal tones—sounds whose accompanying waves are constituted by sinusoidal pressure variations. 3 (See Figure 3.1: Sinusoidal Motion). For sinusoidal tones, increasing frequency increases perceived pitch, and decreasing frequency decreases perceived pitch. Pure sinusoidal tones are rarely encountered in nature, and many complex sounds that are not themselves sinusoids are perceived to have pitch. Fourier showed that any complex sound which is not itself a sinusoid can be analyzed in terms of sinusoids of various frequencies in differing proportions. Fourier’s Theorem applies in virtue of the additive principles of wave-like motion. So we can characterize any sound by citing the (maximum or rootmean-square4 ) amplitude or intensity of each of its sinusoidal constituents. (See Figure 3.2: Fourier Composition of a Square Wave). The constitutive role of individual sinusoids can be taken quite literally: the phenomenon of sympathetic resonance demonstrates that complex tones are made up of sinusoids of different frequencies. A pure sinusoidal tone from one tuning fork causes a second tuning fork with the same characteristic frequency to begin sounding in virtue of its own induced sinusoidal motion. A complex tone from a human voice, one of whose Fourier-predicted components shares a tuning fork’s characteristic frequency, induces that tuning fork to resonate just as a sinusoidal tone of that frequency does. But a complex sound without a Fourier component at the tuning fork’s characteristic frequency induces at best a weakened resonant sounding. The best explanation of sympathetic resonance is that a tuning fork’s resonant sounding is caused by a sinusoidal constituent which is genuinely present in the complex sound. Pure sinusoidal tones are a variety of periodic sound. That is, they repeat a certain motion regularly over any given interval. Some complex sounds are also periodic. As a matter of empirical fact, just the periodic sounds have pitch. Though periodic tones may occur within more “messy” or “noisy” sources and thus cause pitch experiences, and sounds that are not exactly periodic may appear to have pitch, pitched tones are generally periodic sounds. If a complex signal is periodic and repeats at regular in3 I shall argue in §3.3, however, that the Very Simple View fails even for pure sinusoidal tones. 4 The root-mean-square amplitude is a measure of the average magnitude of a varying instantaneous amplitude. It is given by the square root of the mean of the squared instantaneous amplitudes. For a sinusoid, the root-mean-square value is 0.707 times the peak amplitude. Gelfand (1998), pp. 19–21. CHAPTER 3. AUDIBLE QUALITIES 46 tervals, the frequency of each of its components (or partials) must have an integer-multiple relationship to a fundamental frequency. The fundamental frequency of a complex sound is the greatest common whole-number factor of the sound’s constitutive frequencies. Thus, the fundamental frequency of the complex (square) tone in Figure 3.2 is 1000 Hz. Helmholtz demonstrated that the fundamental frequency of a complex periodic tone determines its perceived pitch. 5 So the Simple View of pitch is that the pitch of a sound is identical with its fundamental frequency; that is, the pitch of a periodic sound is the greatest whole-number frequency by which the frequency of each of its sinusoidal components is divisible without remainder. According to this formulation a sinusoidal component at the fundamental frequency may or may not actually be present in the complex sound. The phenomenon of the “missing fundamental” demonstrates that the fundamental frequency component need not be present for a sound to have the same pitch as a sinusoid of that frequency. A tone with constituents at 1200 Hz, 1300 Hz, and 1400 Hz has the same perceived pitch as a 100 Hz sinusoid.6 Telephones, which filter low frequencies, illustrate the principle. One hears a man’s voice to have the same pitch in person and over the telephone, though the fundamental is absent from the telephone speaker’s sound.7 The Simple View identifies pitch with fundamental frequency, which depends upon the frequencies of a sound’s sinusoidal constituents. It follows that pure sinusoidal sounds and complex sounds such as those that exhibit sawtooth or square wave patterns can share the same pitch in virtue of sharing a fundamental frequency. The Simple View identifies pitches with 5 Helmholtz (1954). See Helmholtz (1954) and Schouten (1940). 7 Terhardt, e.g., (1974) and (1979), has argued that this simple conception is inadequate to account for all types of pitch phenomena. He has distinguished between spectral pitch, which is pitch heard in virtue of constituent frequencies that are actually present in the sound, and virtual pitch, which is heard despite the absence of a spectral constituent. Terhardt has pointed out that a view analogous to the Simple View fails to account for the fact that a single complex sound is often heard to have multiple pitches, determined by both spectral and virtual pitches. He has also noted that both types of pitch may be heard simultaneously, even at the same frequency! Terhardt’s conception depends, however, upon individual pitches, spectral and virtual, being determined by constitutive frequencies that are present in the sound. Indeed, individual spectral and virtual pitches are identified with particular frequencies. So the Simple View that pitch is fundamental frequency may not accommodate all salient pitch phenomena, but it is sufficient for my purposes that once determined, pitches are identifiable with particular frequencies, and that pitch determination is a matter that depends entirely upon which frequencies are present in a complex sound. 6 CHAPTER 3. AUDIBLE QUALITIES 47 particular frequencies, and thus preserves the Very Simple View’s influential account of the musical relations. It has the further advantage of being supported by the physiology of auditory perception. The basilar membrane, located within the cochlea, is like a long trapezoidal ribbon, different parts of which most actively resonate in sympathy with a particular sinusoidal frequency. The basilar membrane thus performs a sort of Fourier analysis, decomposing a complex signal into its sinusoidal components. This spectral information is converted into electrical potentials by the hair cells, which activate auditory neurons. Given frequencies activate portions of the auditory nerve that are “tuned” to those frequencies, and this tuned or tonotopic organization continues up through the auditory cortex where higher cognitive processes determine pitch or fundamental frequency from spectral information about the sound. 8 The significance of Fourier decomposition and tonotopic organization is that frequency parameters appear to be preserved and represented through various stages of the auditory physiology. In fact, electrodes can recover and reproduce a sound presented to the ear from the subsequent auditory nerve signal. 9 It appears that pitch perception depends upon determining the frequencies present in a complex signal. If pitch is fundamental frequency, the problem of pitch perception is physiologically tractable. 3.3 Problems with the Simple View The results of psychoacoustics give strong evidence of a subjective pitch scale that correlates rather loosely with fundamental frequency. Though pitch changes only if frequency changes, the magnitude of a pitch change is neither identical with nor a constant function of the magnitude of its corresponding frequency change. Consider two types of psychoacoustic experiment.10 The first type consists of ‘Fractionalization’ experiments. Subjects are presented with a given tone, and are then instructed to adjust a second (simultaneously presented) tone so that its pitch is one-half that of the first tone.11 This process is repeated for many tones of different frequen8 A cat’s cortex is arranged in individual columns which are tuned to characteristic frequencies. See Woolsey (1960) and Gelfand (1998), Chapter 6. 9 This is the so-called Wever-Bray effect. “Wever and Bray reported that if the electrical activity picked up from the cat’s auditory nerve is amplified and redirected to a loudspeaker, then one can talk into the animal’s ear and simultaneously hear himself over the speaker.” (Gelfand (1998), p. 136). 10 As described in Gelfand (1998), p. 354. 11 Stevens, Volkmann, and Newman (1937). CHAPTER 3. AUDIBLE QUALITIES 48 cies. The second type are called ‘Equal Intervals’ experiments. In one such experiment, subjects are instructed to adjust the frequencies of five tones until they are separated by equal pitch intervals. 12 Fractionalization and Equal Intervals experiments yield a remarkably consistent scaling of pitch as a function of frequency, according to which equal pitch intervals do not correspond to equal frequency intervals. 13 Doubling frequency does not uniformly double pitch: the frequency of a 1000 Hz tone must be tripled in order to double its pitch; doubling the pitch of a 2000 Hz tone requires quadrupling its frequency. The accepted pitch scale extracted from psychoacoustic data assigns to equal pitch intervals equal magnitudes in units of mels. The mel scale is therefore an extensive or numerical pitch scale, in contrast to the intensive frequency scale for pitch. 14 (See Figure 3.3: Pitch in Mels as a Function of Frequency).15 Suppose we accept that the notion of pitch magnitude is well-founded, and that Fractionalization and Equal Intervals experiments reliably determine the relationship between perceived pitch and frequency. If pitch perception is for the most part veridical, it follows that pitch is not identical with frequency. It also follows, perhaps surprisingly, that the octave relation is not a doubling of pitch. The octave relation is in some sense “the same again (but higher/lower)”, but it is not “double in pitch”. Likewise, none of the musical relations is a simple, small whole-number ratio between pitch magnitudes. We require another characterization of the octave, fifth, et al., as relations between sounds with particular pitches. The Simple View is inadequate if we wish to identify pitch with some physical property of sounds that we veridically perceive. The relational structure among pitches differs from the relational structure of frequencies. In particular, changes in pitch magnitude do not correspond to uniform changes in frequency magnitude. 3.4 The Standard (Subjectivist) View The accepted view among auditory researchers is that pitch is a subjective or psychological quality merely correlated with the frequency of a sound. Gelfand (1998), for instance, states that “in formal terms, pitch is the psychological correlate of frequency, such that high frequency tones are heard 12 Stevens and Volkmann (1940). Stevens, et al. (1937) and Stevens and Volkmann (1940). 14 Extensive scales preserve ratios between quantities, but intensive scales need not. 15 Another accepted pitch scale is slightly different. I shall discuss, in §3.5 and in Appendix A, the Bark scale introduced by Zwicker (1961) and Zwicker and Terhardt (1980). 13 CHAPTER 3. AUDIBLE QUALITIES 49 as being ‘high’ in pitch and low frequency tones are associated with ‘low’ pitches.”16 The Standard View among psychoacousticians is that pitch is a mental quality in virtue of which we imperfectly perceive the frequencies of sounds. Thus stated, the accepted view is that pitch is a quality of experiences much like a “quale” or “subjective feel” associated with frequency perception. The Standard View is a form of philosophical subjectivism that arises in response to the divergence between pitches and the external physical properties thought to be responsible for pitch experiences. Pitches are thought to be internal, mental properties, and thus subjective in a natural sense. On the Standard View, sounds have frequency but not pitch. As such, it is a form of error theory concerning auditory perceptual experience. Since we take auditory experiences to furnish awareness of sounds and their pitches, if the Standard View is correct then pitch is what Alex Byrne has aptly described as “a perfectly monstrous illusion.” 17 Our ascriptions of pitch properties to sounds are simply never true. But it just does not follow from the fact that perceived pitch intervals do not correspond to like frequency intervals that pitch is a property of experiences that cannot be ascribed truly to sounds. I wish to propose that an Alternative View of pitch according to which pitches are physical properties of sounds is equally viable and captures more of the desiderata that should be met by a philosophical theory of sounds. 18 If sounds have a physical property whose variations correspond uniformly to variations in perceived pitch and that is responsible for pitch experiences, then we can avoid both eliminativism about pitch and error theories concerning pitch perception.19 16 Gelfand (1998), p. 353. Similarly, Mestre, et al. (1998) say, “pitch is the subjective quality associated with frequency.” 17 Byrne uses this phrase to refer to the status of colors according to error theorists. Byrne (forthcoming), p. 9. Boghossian and Velleman (1989) and (1991), and Hardin (1988) are contemporary proponents of color eliminativism. 18 In Appendix A, I attempt to develop such a physicalist alternative and to defend it against objections that purport to demonstrate its inability to account for important features of pitch and pitch experiences. 19 What about other options that neither attribute systematic error to pitch perception nor eliminate pitches from the world of sounds? What we would like is a view that ascribes pitches to sounds and entails that most of the time, when things are going well with hearing, we perceive sounds to have the pitches they actually have. One such view is that pitches are dispositions, construed objectively, to cause subjects to have pitch experiences. We must take care in formulating dispositionalism. If pitches are dispositions to produce auditory experiences with a certain phenomenal character (compare, with respect to color, CHAPTER 3. AUDIBLE QUALITIES 50 For now, we can summarize the respects in which the Standard View falls short of a fully satisfactory account of the audible qualities, and thereby illustrate the motivations for an alternative. (1) If we accept that pitch is the property of sounds that is causally responsible for our experiences as of sounds’ having pitch, and we accept that sounds are external, objective (mind-independent) entities, we will prefer an account according to which instances of the audible qualities are (1a) entirely external to perceivers and (1b) objective qualities that are instantiated in the world independently of perceivers and their responses. (1a) weighs against the Standard View, and (1b) also counts against other forms of subjectivism that assign distal locations to pitch instances. 20 Peacocke (1984)), we have no direct perceptual access to the pitches of sounds since we are immediately aware only that we have an experience with certain phenomenal pitch properties. We can then infer or otherwise come to believe that sounds have dispositions to cause in us particular phenomenal pitch properties, but we will never be auditorily aware that some sound has a particular pitch. Byrne (ms) offers a more developed version of this argument against what he calls ‘reductive dispositionalism’. This form of dispositionalism does not leave us much better off than the Standard View. Better to say that pitches are dispositions to produce experiences as of a sound’s having a certain pitch. Byrne (ms) calls this view ‘nonreductive dispositionalism’. But the nonreductive dispositionalist must accept that auditory experiences present relational properties of sounds—e.g., being disposed to sound high-pitched to perceivers of kind k—that we are strongly inclined to think are monadic, or that auditory experiences do not present sounds as having pitches, properly understood. Neither disjunct is attractive, but perhaps nonreductive dispositionalism, suitably motivated, is capable of capturing most of what we want from a theory of pitch. I do not hope to settle the question of whether pitches, and sensible qualities more generally, are dispositions. My aim is to present a view of pitch according to which pitches are physical properties of sounds, and to defend it against objections that purport to demonstrate its inability to account for important features of pitch and pitch experiences. If sounds have a physical property that is responsible for pitch experiences, variations in which correspond to variations in perceived pitch, then we can avoid eliminativism about pitch and error theories of pitch perception. Given that the physical bases of dispositions are central to understanding what dispositions are and when they are correctly ascribed (cf. Martin (1994), Lewis (1997), and Fara (2001)), my project should be of interest to the dispositionalist who aligns herself against the Standard View according to which pitch is a psychological or subjective property. 20 Might sounds indeed bear pitch properties that are external to subjects though still in some sense subjective or mind-dependent? If so, the extreme conclusions of the Standard View might be avoided while maintaining the subjectivity of pitch. The most familiar form of subjectivism about sensible qualities is formulated in terms of dispositions to produce characteristic kinds of perceptual responses (e.g., Locke (1975) and McGinn (1983)). Pitch, then, is the subjective property sounds have in virtue of causing subjects to have pitch experiences. But nothing about a sound’s being disposed to produce pitch experiences makes it the case that its pitch “depends” in some interesting sense upon subjects. Rosen (1994) presents convincing arguments that to claim of certain properties that they CHAPTER 3. AUDIBLE QUALITIES 51 (2) The Standard View entails that sounds themselves are not immediately characterizable in terms of pitch. If sounds can be correctly described and classified by their pitches without reference to their frequencies or mention of perceivers’ responses, no widespread-error theory, eliminativism, or projectivism is correct for pitch. (3) Knowledge about at least some of the properties of sounds seems to be possible solely on the basis of (perhaps suitably idealized) perceptual experiences. If such knowledge is not acquired by inference, quasi-inference, or some other indirect method, then the Standard View and other error theories about the processes involved in perception and belief formation about the audible qualities are false. (4) Many philosophers maintain that the contents of perceptual experiences are exhausted by their representational or intentional contents—that experiences can be fully characterized by how they present the world as being. Intrinsic qualities of experiences, in particular, need not be mentioned in a full characterization of experiential contents. 21 The Standard View, however, essentially makes reference to the psychological property of pitch to account for the character of subjects’ auditory experiences. If intentionalism is the correct theory of perception, an explanation of the content of auditory experience that adverts to a non-psychological pitch property must be given. (5) Perception seems to be a relatively reliable mode of access to information about the world. We think that beliefs formed on the basis of our perceptual experiences are at least prima facie justified. To this extent we have reason to resist the claim that auditory experiences are in some important respect illusory. A view that vindicates pitch experience is preferable to one that attributes to hearing some substantial measure of error, either in that we distort frequency relations or in that our pitch experiences fail to discern any real property of sounds. are “response dependent” is to mistake a distinction at the level of concepts or representations for a distinction at the level of properties or facts. Perhaps pitches are basic, subject-dependent properties of sounds that cannot be identified with dispositions. But that makes pitches and their subjectivity, alike, mysterious from the perspective of a moderately naturalistic understanding of the world. The Standard View thus appears to be the most viable view that is in some way subjectivist and which meets the standards of explanation required by auditory science. 21 See, e.g., Harman (1990), Dretske (1995), Lycan (1996), Tye (1995) and (2000), and Byrne (2001). CHAPTER 3. AUDIBLE QUALITIES 3.5 52 The Alternative (Physicalist) View Physicalism about pitch amounts to the claim that there are properties of sounds themselves that can be described in the terminology of a physical theory and which correlate well with the pitch experiences of normal human hearers who are in circumstances that are favorable for hearing the pitches of sounds.22 We need only look for an alternative to physicalism if such a property is not in the offing or if further philosophical considerations prevent us from identifying pitch with any property expressible in the language of physics.23 The Standard Subjectivist View is motivated by the implicit assumption that the psychophysical results discussed in §3.3 show that no physical property of sounds can be identified with pitch since the most likely candidate—frequency—cannot. I wish to challenge this implicit assumption. Proponents of the Standard Subjectivist account of pitch classify sensory response types according to the frequencies and intensities of the sounds that give rise to those responses. In fact, Zwicker and Terhardt (1980) express the subjective pitch of a sound as a function of its (fundamental) frequency, and derive from this function a pitch scale in units called ‘Barks’ which, in its relationship to frequency, closely resembles that of the mel scale. 24 (See Figure 3.4: Pitch in Barks as a Function of Frequency). But frequency is, of course, a physical property of sounds. The physicalist about pitch has as her target precisely a property of sounds that varies with frequency just as the subjectivist’s psychological pitch does. I argue in Appendix A that extant accounts of pitch experience provide the materials to characterize just such a property. Accepted subjectivist accounts, including Zwicker and Terhardt’s, explain pitch experiences in terms of the activities of different portions of the auditory system that respond to the presence of energy within ranges of frequency (critical bands) which vary in magnitude throughout the audible spectrum. (See Table 3.1: Selected Critical Bands). But such responsiveness on the part of the auditory perceptual system is responsiveness to the presence of energy within certain perceptually salient frequency ranges since the responses vary in proportion to critical band energies. The energy distribution across an ordering of those frequency ranges is thus a good candidate 22 Similar claims can be made for timbre and loudness. One such consideration may arise out of cases of spectral shift or inversion of sensible qualities with respect to external physical properties. See Appendix A, §A.4 for discussion. 24 The function from frequency to pitch in Barks (b) is: 23 b = 13 arctan(0.76f /1000Hz) + 3.5 arctan(f /7500Hz) 2 . CHAPTER 3. AUDIBLE QUALITIES 53 upon which to base a realist account of pitches as properties of sounds. Characteristics of the distribution of energy across ordered frequency ranges are perfectly objective properties that can be ascribed to sounds independently of considerations about perceivers. The physical property of sounds that serves as pitch candidate is thus causally responsible for the activity of the auditory perceptual system that subserves pitch experience. Energy within a particular range of frequency causes activity proportional to that energy in a portion of the auditory system which is tuned to that frequency range. Since that kind of activity is the basis of pitch experience, pitch experiences are perceptual responses to the physical pitch candidate. But since particular pitch experiences are not simply responses to the frequencies of sounds, pitch perception is not just frequency misperception. Pitch is therefore not the “psychological correlate” of frequency. If the physicalist proposal is correct, then certain sounds have pitch in addition to frequency and intensity. Pitch is not identical to frequency, but according to the current proposal, the pitch of a sound is intimately related to its (fundamental) frequency, just as frequency is related to positions over time. Pitch is thus a property that depends upon the frequencies of a sound’s simple components—given a full characterization of the attributes of the frequency ranges to which audition is sensitive, the pitch of a sound can be determined from its frequency constituents. Pitch, that is, can be computed as a complex function of frequency. Zwicker and Terhard’s function thus specifies the relationship between the frequency and the objective pitch of a sound. Pitch, however, is anthropocentric. Humans and creatures like us are sensitive to pitch in virtue of how our auditory systems are arranged, but pitch is not terribly interesting from either a physicist’s point of view or from the perspective of giving the simplest complete characterization of sounds. It may be that pitch perception is more efficient from the standpoint of designing the mechanisms for auditory perception, or is more useful for our needs than frequency perception would be. Neither the fact that pitch is not useful in scientific discussions of sound, nor the fact that only creatures with a given type of sensory apparatus can detect pitch, however, implies that pitches are not real properties of sounds that can be captured in physical terms. The account developed in Appendix A, in terms of weighted energies within an ordering of frequency ranges, can be extended in several important directions. First, to give an account of the pitches of complex sounds; second, to give an account of the musical relations that hold between sounds CHAPTER 3. AUDIBLE QUALITIES 54 in virtue of pitch; third, to explain how timbre and loudness might be objective properties of sounds. The accounts given in Appendix A show that the physical properties upon which the account of pitch is based play a prominent role in demonstrating that the audible qualities need not be considered mere psychological correlates of the objective properties—frequency, spectral composition, intensity—in terms of which sounds are ordinarily characterized. So, according to the alternative, pitch is an objective property of sounds that is not identical with (fundamental) frequency. Pitch is the physical property that disposes sounds to produce experiences as of pitch in suitably equipped perceivers. This is the property in virtue of which sounds that differ in timbre and loudness can be equivalent in “height”, and in virtue of which periodic or musical sounds can be ordered according to the ratio scale obtained by psychophysical methods. Particular sounds have pitch in addition to frequency, though pitch is the more salient property from the point of view of auditory perception. Pitch lacks the naturalness of frequency, and is thus interesting only from an anthropocentric perspective. 3.6 Audible Qualities and the Event View of Sounds I have been speaking as if the audible qualities should be understood in terms of properties ordinarily ascribed to sound waves. Frequency and intensity, in terms of which the critical bands account of Appendix A is developed, are wave properties. The account is thus available to proponents of the Wave View of sounds. But the Event View holds that sounds are events. In particular, sounds are events in which objects or interacting bodies disturb a surrounding medium in a wave-like manner. Since I have claimed that pitch, timbre, and loudness, in addition to frequency and intensity, are properties of sounds, they must be properties of the events I have identified as sounds. There is, however, a clear sense in which frequency and intensity can be ascribed to sound events. First consider frequency as it is ascribed to waves. Frequency is a function of positions and times. The positions of particles in the medium at different times characterize their vibratory motion, which in turn determines the pressure at a point in the medium. Frequency is the number of cycles of motion per second that particles undergo, which we can track by noting their points of maximum displacement. But particle displacements are caused by the activities of the bodies that disturb them, and sounding CHAPTER 3. AUDIBLE QUALITIES 55 bodies can themselves be ascribed a frequency. 25 Since the displacement of a vibrating body immediately causes surrounding particles to be displaced, the frequency with which a sounding body reaches one point of maximum displacement corresponds directly to the frequency of the resulting wave in the medium. (See Figure 3.5: Vibration Transmission). Now, for simplicity we can treat the location of the disturbing event at a time as the surface of interaction between the object and the medium. 26 When tracked through time, this surface itself vibrates with a frequency that corresponds to the vibration frequency of particles that constitute the surrounding medium. Let the frequency of the sound event be defined as the vibration frequency of the disturbance surface. Since wavelength is inversely proportional to wave frequency (frequency equals wavelength divided by the speed of sound waves in the medium), the number of wave peaks that pass through a point in a medium during a given time interval is just the number that were created by the sounding body’s maximum displacement during an earlier time interval of the same duration. 27 The frequency of the sound event—that is, the frequency of the medium-disturbing event—is therefore identical to the frequency of the subsequent wave. So the Event View ascribes to sounds frequencies that match those of their waves in the medium. Pitch is a complex function of frequencies according to the Alternative (Physicalist) View. Since the frequency of the sound event equals that of the resulting wave, and since frequency is itself a function of locations and times, pitch is a complex function of the location of the sound event over time. Sounds then have the same Bark values as the waves they produce. Sound events can therefore be characterized by pitches expressed in Barks. Acoustical measurements of sound level are ordinarily expressed in decibels of intensity level (dB IL = 10 log(I/I 0 )).28 Intensity is defined as the power (the rate at which energy is emitted (P = ∆E/∆t)) passing through a 25 Many sounding bodies have complex vibration patterns with different modes of sinusoidal vibration. In this case, a sounding can be ascribed a complex pattern of frequencies. 26 Cf. Bennett (1988): “The ‘location’ of an event is its spatiotemporal location, i.e. where and when it occurs. . . . A zone may be sizeless along one or more of its dimensions: . . . some fill spatial volumes and presumably others occupy only planes, lines, even points,” p. 12. 27 Frequency is thus independent of the speed at which waves travel in a medium. Where I0 is a reference intensity level. Also note that since intensity is proportional to the square of pressure, converting from decibels of intensity level (dB IL) to decibels of sound pressure level (dB SPL) requires squaring pressure values. 28 CHAPTER 3. AUDIBLE QUALITIES 56 surface per unit area (I = P/A). We can therefore express the intensity of a sound event as the power transmitted per unit area on the surface that gives the sound event’s location. This quantity is, in fact, often cited by physicists interested in sound production. Because the power transmitted through this surface depends upon the specific properties of the surrounding medium, the intensity of the sound event varies with different media. Since the loudness of a sound is a function of its energies or intensities within various critical frequency bands, the loudness of a sound event is medium-dependent. The Event View gives a more satisfactory account of a sound’s loudness than does the Wave View because the Event View captures the phenomenon of loudness constancy. That is, we do not perceive a sound itself to change in loudness when we move away from it—a speaker’s voice does not itself get louder just because I move to the front of the room. I may be in a better position for optimal or comfortable hearing, but the qualities of the sound do not differ with distance from its source. According to the Event View, a sound’s intensity is a matter of the power through the surface that demarcates the disturbing-event’s location. And loudness is a complex function of the sound’s intensity within various critical bands. But since intensity diminishes as the inverse-square of distance from a sound source, the Wave View, which attributes loudness (as a function of intensity) to sound waves, does not capture loudness constancy. According to the Wave View, the loudness of a sound itself changes as the sound moves away from its source. 3.7 The Doppler Effect When a sound source and a subject move relative to each other, the pitch of the sound appears to the subject different from the pitch of the sound when the source and subject are stationary. An approaching train’s whistle sounds higher in pitch than a stationary one, and a receding train’s whistle sounds lower in pitch; a stationary train’s whistle sounds higher or lower pitched when a subject approaches or recedes. The Event View apparently implies that the sound of the whistle remains constant across all such cases— the medium-disturbing event does not evidently change in light of source or subject motion. The Doppler effect comes in two forms. When a subject moves toward (or away from) a stationary source (perceiver-motion Doppler ), he experiences the sound as having a higher (or lower) pitch than when he is stationary. More (or fewer) wavefronts reach the organs of hearing per unit of time and therefore cause an experience as of a higher (or lower) pitch. CHAPTER 3. AUDIBLE QUALITIES 57 The Wave View explains the perceiver-motion Doppler effect nicely. The subject has an illusory or erroneous pitch experience because he moves relative to the stationary source. The Event View can offer a similar explanation. The subject undergoes an illusory pitch experience because information about sounds is transmitted through the medium by waves. The sound of the stationary train’s whistle remains the same despite the subject’s motion. Source-motion Doppler presents a unique problem for the Event View. When a source moves toward (or away from) a stationary subject, the apparent pitch shift differs from that which occurs when the subject moves at the same rate toward a stationary source. 29 According to the Wave View, the sound itself changes in character with source motion since the waves in the medium have a different frequency. Sound waves are actually more tightly bunched as a result of the source’s motion through the medium. The subject thus enjoys a veridical experience of a sound with different qualities. The Event View, which identifies the sound with the disturbing event, cannot readily explain the apparent change in sound quality in terms of wave bunching. The pitch of the sound is determined by the vibratory motion of the surface that defines the event’s location—but that vibratory activity remains constant despite source motion. Event theorists have two options. We can try to accommodate a difference in pitch in the sourcemotion Doppler, or we can deny that either type of Doppler effect amounts to a genuine difference in the sound’s pitch. The latter subsumes both types of Doppler to perceptual illusion. 29 Perceiver-motion Doppler: Suppose a subject moves toward a sound source which emits 1000 waves per second (1000 Hz) at 100 meters per second. Since the speed of sound waves (c) in air is 340 meters per second, the wavelength (λ) of 1000 Hz waves is 0.34 meters (λ = c/f ). So, in one second she will pass 1000 waves plus 100/0.34 waves, and the apparent frequency is 1294 Hz. Source-motion Doppler: Suppose a source emitting 1000 waves per second moves toward a stationary subject at 100 meters per second. After one second, 1000 waves will occupy a distance of 240 meters. This results in a wavelength of 0.24 meters and frequency of 1417 Hz. (Physics-faq/acoustics: Acoustics FAQ). CHAPTER 3. AUDIBLE QUALITIES 58 I opt for the latter.30 We should say that the sound event maintains a stable qualitative character through changes in its position relative to observers—whether due to its own motion or to the motion of observers. The source-motion Doppler is thus an apparent shift in the pitch of a sound. Is it independently plausible that both sorts of Doppler effect are cases of perceptual illusion? Suppose that despite auditory appearances there is a form of constancy judgement for pitch in both sorts of Doppler effect. Such constancy judgements should be present in both the perceiver-motion and the source-motion Doppler. That is, suppose that we withhold endorsement of the claim that a sound shifts in pitch when either a subject or a sound source is in motion with respect to the other, and instead judge that the sound maintains a constant character despite our motion or the motion of the source. It is plausible that such constancy judgements actually occur (and can be elicited) when we hear sounds with moving sources—I don’t actually think that a train’s sound changes when it speeds past the platform. Rather, I’m inclined to think that I hear the train’s sound without distortion when we both are stationary. If this is genuinely plausible, then we have good reason to count both sorts of Doppler effect as perceptual illusions, just as we do when visually-perceived qualities are Doppler “shifted”. Despite the way things visually appear, once we take into account the motions of celestial bodies, we do not judge that the colors of the stars are the ones they visually seem to have. Thus, the Event View implies what is already reasonable, that relative motion is not sufficient to change the audible qualities of a sound. 30 Why? Suppose that the pitch of a sound does change with source motion. We might say that the disturbance event is different in this case because the interaction between source and medium differs as a result of source motion. It is reasonable that the event itself is different because its effects are different. We should resist this for at least two reasons. First, though different causes and effects are sufficient for there to be a different particular event, in this case they are not sufficient to characterize the different qualitative character of the disturbing event. That is, pitch is a quality of sounds, and different effects do not give reason to attribute a difference in pitch to the disturbance event without sufficient intrinsic differences in the sound event itself. Since the disturbance event, and in particular its vibratory motion, does not change in relevant respects when it is in linear motion, we should not say that its pitch changes. Second, even if we did say that the sound event changes with source motion, it would follow that in cases of source-motion Doppler the sound has a different qualitative character at different places on the surface that gives the event’s location. That is, the pitch of the sound at the front of the whistle is different from the pitch of the sound at the rear of the whistle. If motion suffices for pitch change in one direction, it suffices for pitch change in the other. So the character of the medium disturbance varies with location along the event surface. It is hard to see how linear motion alone could justify attributing different pitch qualities to different spatial parts of a sound event. CHAPTER 3. AUDIBLE QUALITIES 3.8 59 Concluding Remarks I have argued that the failure of simple forms of physicalism about the audible qualities does not rule out the prospect of an account according to which pitch is an objective, physical property of sounds. With attention to current accounts of pitch experience, which are framed in terms of responsiveness to sounds with certain physical attributes, we can extract a physical property that both causes pitch experiences and varies with perceived pitch in the right ways. The account also can be extended to capture timbre and loudness. Pitch is thus a property of sounds that is discerned during auditory experiences, but which is not identical with fundamental frequency. Pitch is also not simply the subjective or psychological correlate of frequency; it is an objective property that sounds have in addition to frequency. Frequency may be a more natural property by which to classify sounds from the perspective of the physical sciences, but pitch is an anthropocentrically interesting property to which human hearers are sensitive. Since pitch is a complex property that depends upon the frequency and intensity of a sound, and frequency and intensity can be ascribed to medium disturbance events without damage to those notions, the Event View has available a full account of the audible qualities. CHAPTER 3. AUDIBLE QUALITIES 60 Figure 3.1: Sinusoidal motion. [From Gelfand (1998), Hearing: An Introduction to Psychological and Physiological Acoustics, Third Edition, New York: Marcel Dekker, Figure 1.5, p. 15, with permission.] CHAPTER 3. AUDIBLE QUALITIES 61 Figure 3.2: Fourier composition of a square wave. [From Gelfand (1998), Hearing: An Introduction to Psychological and Physiological Acoustics, Third Edition, New York: Marcel Dekker, Figure 1.12, p. 24, with permission.] CHAPTER 3. AUDIBLE QUALITIES 62 Figure 3.3: Pitch in mels as a function of frequency. [From Gelfand (1998), Hearing: An Introduction to Psychological and Physiological Acoustics, Third Edition, New York: Marcel Dekker, Figure 12.1, p. 354, with permission.] CHAPTER 3. AUDIBLE QUALITIES 63 Figure 3.4: Pitch in Barks as a function of frequency. [From Gelfand (1998), Hearing: An Introduction to Psychological and Physiological Acoustics, Third Edition, New York: Marcel Dekker, Figure 12.2, p. 355, with permission.] 64 CHAPTER 3. AUDIBLE QUALITIES b Bark 0 fl ,fu Hz 0 1 100 2 3 fc Hz b Bark ∆fG Hz 50 0.5 100 150 1.5 100 250 2.5 100 350 3.5 100 200 400 5 510 6 630 450 570 7 8 10 1270 5.5 2000 6.5 140 840 7.5 150 15 2700 16 3150 17 3700 18 4400 1000 8.5 160 19 9.5 10.5 210 1600 11.5 240 20 6400 21 7700 22 9500 1850 12.5 280 23 1480 1720 24 b Bark ∆fG Hz 1850 12.5 280 2150 13.5 320 2500 14.5 380 2900 15.5 450 3400 16.5 550 4000 17.5 700 4800 18.5 900 5800 19.5 1100 7000 20.5 1300 8500 21.5 1800 10500 22.5 2500 13500 23.5 3500 5300 190 1370 fc Hz 2320 120 700 1170 12 13 110 920 1080 11 4.5 770 9 fl ,fu Hz 1720 14 300 4 b Bark 12 12000 15500 Table 3.1: Selected critical bands. Bark value b, lower (f l ) and upper (fu ) frequency limit of critical bandwidths, ∆f G , centered at fc . [Adapted from Zwicker and Fastl (1999), Psychoacoustics: Facts and Models, Second Edition, New York: Springer-Verlag, Table 6.1, p. 159, with permission.] CHAPTER 3. AUDIBLE QUALITIES 65 Figure 3.5: Vibration transmission. [From Gelfand (1998), Hearing: An Introduction to Psychological and Physiological Acoustics, Third Edition, New York: Marcel Dekker, Figure 1.4, p. 14, with permission.] Appendix A The Alternative View A.1 Pitch and Critical Bands What, then, is pitch? The Standard (Subjectivist) View supposes that pitch is a subjective attribute of sensations or experiences because the relationship between frequency and experienced pitch is neither linear nor logarithmic. Consider pitch, expressed in mels, as a function of frequency (recall Figure 3.3). The standard account holds that this reflects the relationship between frequency as an objective property of sounds and pitch as an attribute of experiential states. But I shall claim that nothing about the case should keep us from thinking that both pitch and frequency are objective properties of sounds. The plausibility of this claim depends upon whether we can provide a candidate property that varies as pitch does with frequency and which explains why pitch is related to frequency as it is. The function from frequency to pitch would thus inform us about the relationship between two different properties of sounds, rather than about the relationship between properties of sounds and sensations. Recent accounts of pitch provide insight into both why the standard view is a subjectivist view, and into how an objectivist theory of pitch should be developed.1 Zwicker and Terhardt (1980) have developed an instructive account of the relationship between frequency and pitch. They have derived a function from frequency to subjective pitch, expressed in units called ‘Barks’, which strongly resembles that of the mel scale. 2 (See Figure 3.4: Pitch in 1 The scientific details of discussion to follow are drawn primarily from Zwicker and Fastl (1999), especially Chapters 4 through 7, Gelfand (1998), and Zwicker and Terhardt (1974). 2 The term ‘Bark’ is derived from the name of Barkhausen, an auditory scientist who studied the relationship between loudness and intensity (Zwicker and Fastl (1999), p. 160). 66 APPENDIX A. THE ALTERNATIVE VIEW 67 Barks as a Function of Frequency). The function from frequency to pitch in Barks (b) is: b = 13 arctan(0.76f /1000Hz) + 3.5 arctan(f /7500Hz) 2 . The resulting Bark scale, like the mel scale, preserves perceived ratios and assigns the same magnitudes to pitch differences judged to be equivalent. Now, the Bark scale is based on the notion of a critical frequency band. Critical bands are supposed to be psychologically real entities which explain the perceived pitch of a tone given its frequency and energy properties. A critical band is characterized by a frequency range (its critical bandwidth) around a given center frequency, to which that band is responsive. Evidence for the existence of critical bands comes from experiments involving masking, where one tone is used to interfere with a subject’s ability to perceive or detect the presence of another test tone. Masking experiments provide significant evidence for critical bands because the capacity one tone has to mask another is not independent of their respective frequencies. In particular, a masking tone whose frequency is near to that of a test tone more effectively masks the test tone than does a masking tone whose frequency is farther from that of the test tone. The effectiveness of masking decreases as frequency separation increases. This gives evidence of a sort of frequency selectivity within the auditory processing system. That is, as discussed in §3.2.2, different portions of the auditory system deal with particular frequency ranges. When tones are near in frequency, masking occurs because the sensation produced by one tone interferes with that of the other. As frequency separations increase, different portions of the auditory system carry information about the sounds, so less interference results and masking is less effective. Critical bands depend upon the tonotopic organization of the auditory system. Critical bands are ordinarily thought of as like filters with a roughly triangular shape around a given center frequency. That is, they pass the most energy at the center or characteristic frequency, and pass less energy as a tone deviates from this center frequency. Beyond the critical bandwidth, they pass no energy. According to this model, the auditory system is like a bank of many overlapping filters that are responsive to different frequency ranges and whose outputs determine the qualities a sound is heard to have. Critical bands and their characteristic properties can be discerned by a number of experimental procedures. 3 One such procedure discerns critical bandwidth by determining the minimum bandwidth of noise required to 3 See Zwicker and Fastl (1999), Ch. 6, for a survey of such procedures. APPENDIX A. THE ALTERNATIVE VIEW 68 maximally mask a simple tone at the center frequency. For a test tone at a given center frequency, a narrow band of noise around the center frequency begins to mask the test tone. As the noise band widens, it further masks the test tone until a bandwidth is reached beyond which further noise does not contribute to masking. This is said to be the critical bandwidth. Now, the effectiveness with which noise or a masking tone interferes with the detection of a centered test tone drops off with departure from the center frequency until the critical bandwidth is exceeded and no further masking results. So, the critical bandwidth is that bandwidth of noise that masks a simple tone at a given frequency just as effectively as wideband white noise. Because frequency components farther from the center frequency contribute less to masking than those nearer to the center frequency, each frequency within a critical band can be assigned a weighting with respect to the relative importance to masking of energy at that frequency. From the perspective of understanding pitch, critical bands are very significant for two reasons. First, critical bandwidth varies greatly with frequency. (See Figure A.1: Critical Bandwidth as a Function of Frequency). Critical bandwidth is approximately 100 Hz for frequencies up to approximately 500 Hz, but above 500 Hz a rough estimate of critical bandwidth is 0.2 times the center frequency. Each critical band deals with a unique range of frequencies, and that range increases substantially as center frequency increases. This leads to the second significant (and surprising!) result: critical bandwidth correlates very well with pitch. Critical frequency bands simply correspond to nearly equivalent pitch distances (approximately 100 mels per critical band or one Bark). Whether we take a critical band centered at 1000 Hz (critical bandwidth 160 Hz) or one centered at 5000 Hz (critical bandwidth 1000 Hz), critical bandwidths amount to equivalent pitch distances.4 4 In addition, each Bark corresponds to rough 27 units of the minimum detectable pitch difference or just noticeable different between tones; a unit of just noticeable frequency difference, however, grows substantially as frequency increases. Just noticeable frequency differences range from ∼ 2 Hz at very low frequencies to nearly 200 Hz at high frequencies (Zwicker and Fastl (1999), p. 161). Critical bands also correspond to roughly equal distances along the basilar membrane or cochlear partition (1 Bark ≈ 1.3 mm), whereas equal cochlear distances correspond to increasing frequency ranges from apex to base (0.2 mm ≈ 15–20 Hz at apex, 0.2 mm ≈ 500 Hz at base (Zwicker and Fastl (1999), p. 160)). APPENDIX A. THE ALTERNATIVE VIEW 69 From these two facts, we can characterize the difference in pitch between two frequencies, ∆p(f1 , f2 ), in terms of the function, G(f ), from center frequency to critical bandwidth.5 Z f2 G(x) dx. ∆p(f1 , f2 ) = f1 Pitch differences are a function of the critical bandwidths and center frequencies of the various experimentally determined critical bands. But because the basilar membrane is thought to be an important part of the basis for both critical bands and pitch perception, and because the basilar membrane is organized such that less physical space is assigned to a given frequency range as frequency increases (see footnote 4), researchers posit that there are only a finite number of critical bands whose center frequencies become more widely spaced as frequency increases. That is, each unit of physical space along the basilar membrane subserves the same number of critical bands, but increasing frequency ranges, so the center frequencies of critical bands are separated by increasing frequency differences. Critical band center frequencies decrease in density along the frequency scale. The Bark scale thus assigns one unit to each critical bandwidth. Each unit of Barks corresponds to the same number of centered critical bands. (See Table 3.1: Critical Bands). What’s significant, again, is that the procedure does not begin by determining the different frequency ranges that correspond to equal pitch ranges. Rather, it begins by discerning the masking relationships between tones of different frequencies. These masking relationships show that critical bands exist and make apparent the frequency selectivity of the auditory system. But that critical bands discerned in this way should amount to equal pitch intervals is anything but evident. It is nothing short of surprising that an accurate pitch scale can be obtained simply by assigning one unit of criticalband rate to each experimentally discerned critical frequency band. Zwicker and Terhardt’s Bark scale is thus an expression of pitch in intervals that correspond to the psychoacoustically determined critical bands, and the result strongly resembles that of the mel scale of ratio pitch. The Bark scale yields an expression of subjective pitch as a function of frequency, but also explains this relationship in terms of the psychological and physiological mechanisms of sound perception. Now, according to the “filters” conception of critical bands, the experienced attributes of a sound depend upon the output or critical band level 5 I owe thanks to Adam Elga for discussion of how to express this formulation. APPENDIX A. THE ALTERNATIVE VIEW 70 (in terms of energy) of each of the critical bands in the filter bank. What is distinctive about the critical band levels caused by sounds that are heard to have pitch? How do they differ from those produced by sounds experienced to be unpitched? A wideband white noise whose energy is independent of frequency results in a roughly equivalent critical band level for each of the overlapping critical bands. If the critical band profile for a sound is a plot of the critical band levels that sound produces for each critical band, the critical band profile of white noise is a flat line at some energy level. 6 However, for a narrower band of noise, critical bands within which some portion of that energy falls have greater critical band levels than critical bands within which none of the noise falls. The resulting critical band profile is that of a plateau. But noise has no pitch. Sinusoids are the simplest pitched sounds, and are the constituents of more complex pitched sounds. Sinusoids produce a distinctive critical band profile. A sinusoidal tone at a particular frequency causes a substantially greater critical band level for critical bands centered near that frequency than for any other critical band. The critical band profile for a sinusoid is that of a sharp peak at the critical bands centered closest to its frequency. If the experienced attributes of a sound depend upon its critical band profile, then the apparent pitch of a sinusoid depends upon its producing a maximum critical band level in a very small subset of adjacent critical bands among the many ordered critical bands.7 A proponent of the Standard View might identify pitches with critical band maxima or with something that depends upon such activity. Either way, the pitch that a sound appears to have depends upon the profile of energies across many critical bands. Pitch is thus either identifiable with or the immediate result of a certain type of critical band activation pattern. Since critical bands are internal psychological entities, pitch is an internal subjective property. So the Standard View posits internal psychological entities—critical bands— to explain, among other things,8 the dependence of masking on frequency separation. A function from frequency to subjective pitch can be derived just by taking into account the experimentally determined attributes of the sequence of critical bands. The pitch of a sinusoidal tone depends upon the pattern of critical band activations it produces. The critical band profile associated with a sinusoidal tone is that of a peak critical band level in a 6 But see §A.4 for an amendment required in light of loudness phenomena. Indeed, very narrow band noise is often heard to have a pitch, in particular for noise bands at high frequency. 8 Critical bands also figure prominently in accounts of loudness. See §A.4. 7 APPENDIX A. THE ALTERNATIVE VIEW 71 single critical band or in a few adjacent critical bands. Thus, subjective or psychological pitch depends upon a critical band’s having a significantly greater level of activity than the others. This critical band’s location in the ordering of critical bands determines the Bark value associated with the tone. But something analogous to critical band levels or profiles can also be ascribed to sounds themselves. If that is correct, then pitch can be seen as an objective property that sounds possess. We begin with the straightforward recognition that critical bands are themselves characterized in terms of a center frequency, a bandwidth, and a scaling factor for energy at each frequency within the critical bandwidth. But frequency and energy or intensity are properties of sounds. So if we take a critical band to be a simple frequency range around a given center frequency, then sounds themselves fall within critical bands in virtue of their frequency characteristics. Given the scaling factor associated with each frequency within a critical band, a critical band can be characterized as a class of pairs that consist of a frequency and an energy weighting factor. The highest energy factor determines the center frequency of the critical band, and energy weighting factors decrease with distance from the center frequency. Now we can ascribe to every sound a critical band profile in the following manner. Ascribe to the sound a critical band level for each critical band. The critical band level of a sound, for a given critical band, is the sum of the products of the sound’s energy at a frequency and the scaling factor for that frequency, for each frequency within the critical band. The critical band profile for a sound is given by ordering the critical band levels for all critical bands; it specifies the amount of energy that the sound has within each of the fully characterized critical bands. Once we determine the critical band profile of a sound, we have a basis from which to determine its various attributes, including its pitch, loudness, and timbre. As before, for a sinusoidal tone to have a particular pitch is for it to have a single critical band maximum within its critical band profile. The critical band with the peak critical band level determines the sound’s pitch in Barks. Pitch is thus an objective though complex property of sounds. Having a pitch is not a simple matter of having a particular frequency, though frequency determines pitch given a specification of critical bands. The account depends upon rejecting the claim that pitches, and critical band profiles in general, are just properties of the auditory perceptual systems of subjects. Rather, the alternative holds that pitches and critical band levels are physical properties of sounds that our auditory apparatus is suited to detect. APPENDIX A. THE ALTERNATIVE VIEW 72 Activity internal to our perceptual systems reflects the presence of energy within various critical bands; that is, various parts of our auditory physiology, and the psychological mechanisms they ground, are tuned to energy within different frequency ranges. So the property of having energy within the specified critical bands is the property discerned in sound perception. Having a critical band level sufficiently greater than other critical band levels amounts to having a pitch. An analogy with a device that determines the acceleration of an object illustrates the essence of the contrast between the Standard View and the alternative I have proposed. Acceleration is a property that an object has which can be calculated as a function of its position over time—acceleration depends upon positions and times, which look to be the simplest observable properties of an object that characterize its activity. Now, we are not inclined to say that the acceleration detector measures a property whose nature depends upon the construction of the device, in the sense that there could be no such property if the device did not exist. Once we recognize that some property of the object corresponds to the device’s output, there is no temptation to say that the calculated acceleration is a property of the detecting device or that acceleration metaphysically depends upon detecting devices. Acceleration is a property of the object which the device is designed to detect. Similarly, once we recognize that critical band properties of sounds correspond to pitch experiences, there is little temptation to identify pitch as the mere subjective correlate of frequency. Pitch experiences track a property of sounds that varies with frequency. 9 Pitch is thus a complex property of sounds that depends upon their patterns of critical band levels. It is given by a complex function of the frequencies at which a sound has energy, and the function from frequency to pitch in Barks expresses the pitch of a sound itself—not merely a sensation. Since this function is derived from the attributes of the critical bands, and because critical band properties can be ascribed naturally to sounds, pitch is a property of sounds that is related to but not identical with frequency. A.1.1 The Pitch of Complex Tones The foregoing account of pitch in terms of critical band energies applies straightforwardly to simple sinusoidal tones, for which a single critical band has a maximum value. But complex sounds with many sinusoidal con9 Of course, critical band properties may not be of interest to physicists in the way that acceleration is. They are, however, of interest to psychophysicists and to those who wish to understand the distal physical causes of pitch experiences. APPENDIX A. THE ALTERNATIVE VIEW 73 stituents also have pitch. The standard account of pitch perception for complex tones has it that fundamental frequency is determined through analysis of a tone’s Fourier frequency components. How can the alternative account I have sketched explain pitch perception for complex tones without adverting to fundamental frequency perception? Recent accounts of pitch perception posit a complicated analysis of constituent frequencies to extract fundamental frequency. I shall consider one such model of pitch perception—that of Terhardt, e.g., (1980), (1982a), and (1982b)—and argue that it can be extended to the critical bands account of pitch developed above. The result is that pitch perception for complex sounds involves determining a critical band value in Barks; but the critical band value itself corresponds roughly to fundamental frequency. Complex sounds have more widely distributed critical band energy patterns than do simple sinusoids, but complex periodic tones still exhibit local maxima at several critical bands. Pitches are therefore best conceived as types of critical band profiles. A sound whose critical band profile has many separate peaks corresponding to its sinusoidal constituents can be a member of the same pitch as one whose critical band profile has a single peak. The first problem is how to determine which complex sounds are members of the same pitches as sounds that have single-peaked critical band profiles. A simple answer is that they share a fundamental frequency, but we would like an answer framed in terms of aspects of critical band profiles. The second problem is how, if we are not perceiving fundamental frequency (because we do not perceive frequency at all), we come to perceive simple and complex sounds as having the same pitch. I shall suggest that Terhardt’s account provides the materials to deal with both problems. Call each local maximum a pitch determinant with a particular Bark value. Each pitch determinant corresponds to a particular frequency; for harmonic complex sounds—ones with pitch—these frequencies are integer multiples of each other. No such simple relationship exists, however, between the corresponding Bark values of pitch determinants. For instance, a complex harmonic sound with 200 Hz fundamental frequency and harmonics at 400 Hz, 600 Hz, and 800 Hz (or f, 2f, 3f, 4f) has pitch determinants with Bark values of roughly 2 Barks, 4 Barks, 5.75 Barks, and 7.2 Barks. Terhardt’s account of pitch perception relies on learned “templates” for particular harmonic sequences, which are formed on the basis of regular experience with harmonic complexes, in particular those which are predominant in APPENDIX A. THE ALTERNATIVE VIEW 74 speech.10 We form the ability to recognize particular harmonic complexes, such as (150 Hz, 300 Hz, 450 Hz, 600 Hz, . . . ), through their spectral constituents. We thereby acquire internal templates with such constituents as components. But each component of a template has a corresponding value in Barks, so the templates can themselves be considered as sequences of pitches in Barks (e.g., (B1, B2, B3, B4, . . . , Bn)) as opposed to frequencies (with the form (f, 2f, 3f, 4f, . . . , nf)). Templates construed in such terms amount to sets of critical band maxima. Each constituent of a complex tone is a member of various templates in which it occurs as an upper harmonic. Terhardt’s claim is that an analysis on the subharmonics of each constituent of a complex tone determines the pitch that tone is perceived to have. 11 The analysis proceeds as follows. For each of the n constituents of a complex tone, the first eight subharmonics of fn are given by the fundamentals of the first eight harmonic sequences or templates of which fn is an upper harmonic.12 Consider the complex sound with constituents at 200 Hz, 400 Hz, 600 Hz, and 800 Hz. The subharmonics for 200 Hz are first determined: 100 Hz is the second subharmonic (1:2) of 200 Hz since 200 Hz is the second upper harmonic in the sequence (100 Hz, 200 Hz, 300 Hz, . . . ); 66.7 Hz is the third subharmonic (1:3) since 200 Hz is the third upper harmonic in the sequence (66.7 Hz, 133.3 Hz, 200 Hz, 266.7 Hz, . . . ). The final five subharmonics of 200 Hz are given by the ratios 1:4–1:8. Subharmonics are then determined for each of the complex tone’s constituents. A complex periodic sound’s constituents share various subharmonics. Call a shared subharmonic a pitch candidate. Though several subharmonics may coincide for two or more of the constituents, and therefore determine several pitch candidates, the pitch candidate shared by the largest number of constituents determines the perceived pitch of a complex sound. A comparison across subharmonics reveals that the fundamental frequency is shared as a subharmonic by the most constituents. That is, 200 Hz, 400 Hz, 600 Hz, and 800 Hz all share only 200 Hz as a subharmonic. Thus, 200 Hz determines the perceived pitch of the complex sound. Sometimes other pitch candidates are selected as “the pitch” of a complex sound, and often 10 Such learning is thought to take place very early in life, and probably begins in the womb. See §A.2 for discussion of the basis of template formation—of why such templates are acquired. 11 I will adopt, for simplicity, the non-standard convention of including a tone itself among its subharmonics and upper harmonics. 12 So, for fn , the first eight subharmonics are given by gm where fn = 1g1 , fn = 2g2 , fn = 3g3 , . . . , fn = 8g8 . APPENDIX A. THE ALTERNATIVE VIEW 75 other pitch candidates are discernible within the complex sound. However, a complex sound is most commonly pitch-matched with a sinusoid at its fundamental frequency. This analysis explains both the “spectral” pitch of complex sounds with a constituent at the fundamental frequency, and the “virtual” pitch of sounds with a missing fundamental. I claim that this analysis can be carried out with templates consisting not of frequencies in whole-number multiple relationships, but of pitch values in Barks which are derived from the critical bands model. Suppose we have a complex tone consisting of constituents at 2 Barks, 4 Barks, 5.75 Barks, and 7.2 Barks. Each of these constituents appears in learned templates derived from experience with complex harmonic sounds. Just as with the frequency model, coincidence among the subharmonics of the pitch determinants gives the pitch candidates and the pitch of the complex sound (which corresponds roughly to the fundamental frequency). Since templates are just representations of sequences of simple pitches, the pitch of a complex sound can be seen as settled by the pitches of its constituents and their subharmonics, which are given by simple harmonic sequences of critical band values. Having a particular pitch, for a complex sound, is a matter of having critical band maxima that substantially coincide in their membership in other salient (harmonic) sequences of simple pitches (alternatively, salient critical band profiles). Thus, though the pitch value of a complex sound has a corresponding frequency value that roughly equals the sound’s fundamental frequency, what we perceive is its pitch. The point of the subharmonics analysis is to determine which harmonic sequence each of the complex sound’s constituents are most likely to be members of. Pitches, then, can be seen as types of critical band profiles whose members exhibit a particular kind of subharmonic sharing. The subharmonic shared determines the pitch value in Barks. Though pitch perception for complex sounds may be a quite complicated affair, it need not rely on determining fundamental frequency. The alternative account holds that pitch is an objective, physical property of sounds. The motivation for construing pitch as a subjective or psychological quality stemmed from the discrepancy between frequency and perceived pitch. Given that sounds indeed have the properties I have identified as the pitch properties, and that these properties are causally responsible for what we take to be veridical pitch experiences, explanation of pitch perception need not advert to mind-dependent properties. Pitches as the alternative characterizes them are not, however, terribly interesting from the point of view of the physical sciences. But they are salient from the perspectives of auditory experience and the mechanisms by which we hear APPENDIX A. THE ALTERNATIVE VIEW 76 sounds. Pitch is thus an anthropocentric property. 13 But the property of having a pitch—that is, having or determining a maximal energy within a given member of an ordering of frequency ranges—is neither mental nor mind-dependent in any important respect. A.2 The Musical Relations The musical relations—octave, fifth, fourth, et al.—are commonly analyzed as small whole-number frequency ratios between tones. But the octave, fifth, et al., seem to be relations sounds bear to each other in virtue of their pitch. If pitch is not just frequency, we need an account of the musical relations that captures their distinctive features in terms of pitch. This need is pressing. If we do not directly perceive frequency, but only pitch, as a property of sounds, how do we perceive octave relations if not by perceiving pitch relations? Tones that stand in musical relations share certain affinities with each other. The octave is the limit case of affinity, in that tones that stand in the octave relation are “the same” in some sense while still different in pitch height.14 Tones separated by a fifth share more affinity than tones in other musical relations, but less affinity than octave-related tones. An account of the musical relations should explain the relevant respects of affinity or sameness. So far I have been speaking of the pitch ordering as a simple linear ordering from lower to higher. This can be adapted to incorporate the musical relations. Imagine twisting the line into a helix around a vertical axis so that tones separated by an octave—tones that are “the same” in the relevant respect—have the same angle of rotation around the vertical axis. So, for example, all of the C’s fall on a vertical line in ascending order of height. According to this representation of pitch, the same notes at different pitch heights have the same angular position, and each rotation of the helix corresponds to an octave. We want to know why certain sounds that differ in pitch fall on a particular vertical line around the helix—that is, we want to know the respect in which tones separated by an octave are the same. Since pitch is explicated in terms of critical bands, we want to know how that re13 Just as colors are anthropocentric properties according to the view that colors are types of surface spectral reflectances. See, e.g., most recently, Bradley and Tye (2001), and Byrne and Hilbert (forthcoming in BBS ). 14 It bears restating that the octave is not a doubling in pitch, though those with musical training sometimes apply the standard frequency analysis of the octave and mistakenly take it to be a doubling of pitch. APPENDIX A. THE ALTERNATIVE VIEW 77 spect of similarity can be gleaned from the critical bands explication of pitch. Three empirical facts warrant mention. First, octave judgements on average diverge slightly from 2:1 frequency ratios. It follows that normal subjects either misperceive frequency ratios or perceive as the octave something other than small whole-number frequency ratios. Second, subjects make far fewer correct octave judgements for tones that are separated by more than an octave or two. Finally, octave judgements concerning sinusoidal tones are much more difficult to make than octave judgements for complex tones. Complex sounds, recall, have multiple pitch determinants corresponding to local maxima among critical band values. The pitch-determining analysis for harmonic complex sounds yields multiple pitch candidates given by coinciding subharmonics of the pitch determinants, but the subharmonic shared by the most pitch determinants is the pitch. The analysis into subharmonics depends upon the notion of templates formed on the basis of experience with harmonic sounds, primarily as they occur in speech. The question arises, Why these particular templates? It may be that brute and simple repetition is responsible for our acquiring templates with pitch values corresponding to frequency multiples (f, 2f, 3f, 4f, . . . ). A much more plausible explanation is that the elements of such templates, when they occur simultaneously, have a high degree of what researchers have described as ‘sensory consonance’.15 There are several factors that contribute to sensory consonance, including the phenomena described as “roughness”, “sharpness”, and “tonalness”. Details aside, this is significant because roughness, sharpness, and tonalness are each given an account in terms of critical band energies. So, Zwicker and Fastl state with respect to roughness: It is assumed that our hearing system is not able to detect frequency as such, and is only able to process changes in excitation level or in specific loudness at all places along the critical-band rate scale; thus the model for roughness should be based on the differences in excitation level that are produced by the modulation [which results in roughness]. . . . Because masking level is an effective measure for determining excitation, it can be used to estimate the excitation-level differences produced by amplitude modulation.16 Similarly, regarding sharpness: 15 16 See, e.g., Zwicker and Fastl (1999), Gelfand (1998), and Terhardt (1974). Zwicker and Fastl (1999), p. 261. APPENDIX A. THE ALTERNATIVE VIEW 78 It is helpful in developing a model of sharpness to treat sharpness as being independent of the fine structure of the spectrum; the overall spectral envelope is the main factor influencing sharpness. In Chaps. 6 and 8, it was shown that the spectral envelope is psychoacoustically represented in the excitation level versus critical-band rate pattern, or in the specific loudness versus critical-band rate pattern. For narrow-band noises . . . sharpness increases with critical-band rate for center frequencies below about 3 kHz (16 Bark). The increment . . . increases strongly for higher critical-band rates.17 The point is that roughness and sharpness are fully explicable in terms of energy within particular critical bands, so that sensory consonance depends entirely on the amount and distribution of energy across various critical bands. Particular harmonic templates can therefore be expressed as sequences of critical band values that have a high degree of consonance. Again, complex harmonic sounds often have multiple pitch candidates which result from coincidence of subharmonics. In some cases a pitch candidate other than one corresponding to the fundamental frequency is even identified as the pitch of the sound. But subsequently-presented complex tones that stand in the octave relation have several pitch candidates in common, and indeed have many subharmonics in common beyond those that constitute pitch candidates.18 Given that alternate pitch candidates for a single complex sound are actually perceptually discriminable (with proper direction of attention), tones that stand in the octave relation have a substantial degree of perceptible commonality. Contrast the octave with the fifth. Two tones related as fifths have fewer pitch candidates in common, and also have fewer subharmonics in common beyond those pitch candidates.19 But the affinity between notes separated by a fifth can be seen to 17 Zwicker and Fastl (1999), pp. 241–242. Take two complex tones with fundamental frequencies of 200 Hz and 400 Hz (with corresponding pitch values of 2 and 4 Barks) and exactly four additional upper harmonics each: 400 Hz, 600 Hz, 800 Hz, 1000 Hz; and 800 Hz, 1200 Hz, 1600 Hz, 2000 Hz, respectively. These two tones have in common pitch candidates at 100 Hz, 133 Hz, 200 Hz, and 400 Hz. In addition, they each have subharmonics at 57.1 Hz, 66.7 Hz, 80 Hz, 114.3 Hz, 150 Hz, 250 Hz, 266.7 Hz, 300 Hz, 333.3 Hz, and 500 Hz. (Each of these frequency values in Hz can be converted to a pitch value in Barks). 19 Suppose we consider the previously-mentioned complex tone with 200 Hz fundamental and a subsequent complex tone with 300 Hz fundamental frequency (each with their first four additional upper harmonics: 400 Hz, 600 Hz, 800 Hz, 1000 Hz; and 600 Hz, 900 Hz, 1200 Hz, 1500 Hz, respectively). These two complex tones have in common, from among their respective groups of subharmonics, pitch candidates only at 100 Hz and 200 Hz. 18 APPENDIX A. THE ALTERNATIVE VIEW 79 stem from their common pitch candidates and a fair number of common subharmonics. This similarity, however, is less pronounced than that between octave-related tones. The respect in which octave-related tones are the same is that they have a high degree of coincidence among their pitch candidates and share a substantial number of subharmonics. 20 Two differently pitched tones therefore stand in the successive octave relation just in case they share the maximum number of pitch candidates possible given the pitch determinants each contains. This sharing results from coincidence among the subharmonics of their respective constituents. Tones stand in the relationship of fifths when their respective subharmonics exhibit a given pattern of sharing with less coincidence. The account can be extended to capture the relative similarity of tones standing in the other musical relations. This avoids the need to posit a basic relation for each of the musical relations. Accounting for the musical relations in terms of shared pitch candidates and subharmonics explains the empirical facts mentioned above. First, the account depends upon positing templates that consist of critical band values, which templates are already needed to explain pitch perception for complex sounds. Template development most likely depends not simply upon frequency ratios, but upon more basic perceptible aspects of tones that are themselves explained in terms of critical bands. Discrepancies between the perceived octave and 2:1 frequency ratios occur because the critical band values that constitute particular templates need not correspond exactly to harmonic sequences. Next, multi-octave judgements are more difficult than single-octave judgements because octave perception depends upon subharmonic coincidence. Complex tones separated by multiple octaves simply have fewer of their first eight subharmonics in common as the number of octaves increases. Octave judgements for sinusoidal tones are, for a similar reason, more difficult to make than octave judgements for complex tones. Simple tones have fewer subharmonics to compare and share no pitch candidates. In sum, successive octave-related tones share pitch candidates in virtue of having in common many of their constituents’ coinciding subharmonics. This sharing is thus a matter of the two tones’ having in common memThey also have in common subharmonics at 50 Hz, 85.7 Hz, 120 Hz, 150 Hz, 300 Hz, and 400 Hz. There are simply fewer opportunities for coincidence than with octave-related tones. 20 Simple sinusoidal tones are thought by researchers to stand in musical relations only in a sense that is parasitic upon the musical relations of complex tones. See Gelfand (1998), p. 356. APPENDIX A. THE ALTERNATIVE VIEW 80 bership in salient harmonic templates. The templates are themselves determined by perceptible aspects of sounds, such as tonality, roughness, and sharpness, that constitute sensory consonance. Each of these attributes is explicable in terms of the critical band energies of various types of sounds. The templates are acquired through experience with harmonic sounds— speech, in particular—but reflect genuinely objective characteristics of sounds. So, sharing subharmonics is a matter of belonging to the same simple pitch sequences, which are characterized by the consonance of their elements. The octave is the maximum degree of such sharing among sounds that differ in pitch. Recall the pitch helix. When pitch height is expressed in terms of Bark values, the octave relation—the relation that exists between successive pitches with the same angular position—is determined by common membership in particular harmonic sequences. Such harmonic sequences are determined by characteristics of critical band energy profiles and explain the pitch of complex sounds. The critical bands account of pitch therefore explains the affinities of tones that stand in musical relations. A.3 Spectrum Shift Spectrum inversion scenarios are familiar from discussions of color and color perception.21 Audition presents an actual instance of a less drastic qualitative shift. Pitch discrimination depends upon activity at various places along the basilar membrane which is transduced by hair cells into electrical impulses along the cochlea. Cochlear implants are designed to replace damaged hair cells by stimulating cochlear nerves just as the basilar membrane would. A processor first analyzes the stimulus into various critical band energies and then presents electrical impulses of appropriate strengths to tuned nerves. But electrodes often are not placed with exact precision, so the implant delivers impulses to nerves that are tuned to inappropriate frequency bands. The result is that sounds seem higher or lower in pitch than they do to normal perceivers. Suppose Shifty has undergone cochlear implant surgery after losing his hearing due to hair cell damage. Sounds at 10 Barks (1270 Hz) now seem to him the same in respect of pitch as sounds at 12 Barks (1720 Hz) used to seem when his hearing was normal. Suppose Norm has undamaged normal hearing. Sounds at 10 Barks seem to Shifty the way sounds at 12 Barks seem to Norm. Now imagine that Shifty was completely deaf from birth before receiving cochlear implants. 21 See, e.g., Hardin (1988), Block (1990), and Byrne and Hilbert (1997). APPENDIX A. THE ALTERNATIVE VIEW 81 Pitch shift due to cochlear implants presents a physicalist about pitch with two challenges. I shall consider them in turn. First, the physicalist must make it plausible that Shifty misperceives the pitches of sounds. For suppose that Shifty and Norm are both presented with a sound at 10 Barks. An intuitive description of the case is that (i) Shifty and Norm have different pitch experiences in response to the very same sound. If (ii) neither of them mistakenly perceives the pitch of the sound, then (iii) pitch must not be the physical property I have suggested—since pitch and Barks come apart, pitch is not what is given by Bark values. Why accept (ii), that both Shifty and Norm have veridical pitch experiences when presented with a 10 Bark tone? Given that it is natural to say that Shifty indeed misperceives the pitches of sounds, (ii) is only plausible if we already hold a version of internalist subjectivism about pitch according to which pitch can vary independently of external physical stimulus, or if sounds have multiple pitches capable of being discerned by different perceivers. But if we accept that pitch is the property of sounds that is responsible for the pitch experiences of ordinary hearers, then Shifty simply misperceives pitch. Couldn’t most of the population be fitted with cochlear implants, thereby making the notion of a “normal hearer” useless? Not so. ‘Normal’ in this context does not mean simply “common”. Rather, it refers to the usual human means of sound perception. The cochlea is suited to detect the presence of energy within various portions of the audible spectrum, but cochlear implants alter the ordinary causal processes involved in audition, and therefore contribute to abnormal pitch perception. Once we rule out skepticism about whether those with auditory perceptual systems quite like our own enjoy pitch experiences that are similar in character to ours, pitches are just the properties of sounds responsible for those common experiences. Abnormal pitch perception is characterized by deviance from such common pitch experiences. Since Shifty’s experiences exhibit such deviation, he misperceives the actual pitches of sounds. This suggests a second argument. Despite the different way in which Shifty’s experiences are caused, they are caused by sounds with particular Bark values in a reliable way. In fact, since Shifty’s auditory experiences are reliably caused by sounds with particular Bark values, and since he uses the language of pitches just as we do, we might think that his thoughts and experiences track just the same pitches that Norm’s do. If so, he does not misperceive the pitches of sounds since his experiences purportedly represent the very same property that Norm’s do. Sounds at, for example, 10 Barks just produce in them experiences with different subjective characters. If so, APPENDIX A. THE ALTERNATIVE VIEW 82 either (i) is false and Norm and Shifty enjoy the same pitch experiences, or pitch experience is more than just a matter of representing an external physical property. Shifty does not, however, perform the same as Norm does in various psychophysical experiments. Importantly, he performs systematically differently on masking experiments used to determine critical bandwidths. While for Norm a band of noise 200 Hz wide maximally masks a tone of 10 Barks (1270 Hz), Shifty requires a band 260 Hz wide to mask the same tone. Likewise, Shifty’s just noticeable frequency differences differ from Norm’s throughout the audible spectrum. Shifty and Norm even make different judgements about the musical relations among tones. The upshot is that since Norm’s critical band sensitivity differs from Shifty’s, their pitch experiences track different properties of sounds. When Shifty hears a sound at 10 Barks (1270 Hz), he perceptually responds to the presence of energy in a larger frequency band than Norm does. Shifty’s pitch experience for a 1270 Hz tone reflects an energy maximum in a 260 Hz wide critical band. Since the pitch a sound is heard to have depends upon its having a critical band maximum within a particular ordering of bands, and since Shifty responds to sound energies within differently centered critical bands, he and Norm hear different pitch properties. But the properties Shifty detects are analogous in structure to those that ordinary perceivers detect. Shifty’s pitch experiences in response to 1270 Hz tones and Norm’s pitch experiences in response to 1720 Hz tones both reflect the presence of energy within a 260 Hz critical bandwidth. So Shifty misperceives pitches because, in light of his miswiring, his 260 Hz wide critical band is miscentered. Shifty and Norm indeed have different pitch experiences, and the difference in Shifty’s pitch experience is captured by his representing a different property of sounds. Why are the critical band properties discriminated by Shifty not equally deserving of the name ‘pitch’ ? Creatures could have evolved, after all, with auditory systems responsive to those properties of sounds. Would they not then have counted as experiencing pitch? Pitches are anthropocentric properties on the physicalist account I have proposed. They are physical properties of sounds that interest us in light of our perceptual capacities. It may be that other creatures could discern properties of sounds quite similar to our pitches, but those would not be our pitches. They would be remarkably similar, and ordered critical bands would explain why their pitch-like properties exhibit a relational structure quite like ours. But they would not be the same properties of sounds that are responsible for our experiencing sounds to have pitch. APPENDIX A. THE ALTERNATIVE VIEW A.4 83 Timbre and Loudness The critical bands account of pitch can be extended to timbre and loudness. Since timbre depends upon the spectral constituents of a sound regardless of phase, the specific timbric quality of a sound is determined by its distribution of energy across all critical bands. Though two sounds may share a pitch, their individual characters are constituted by the presence of energy within different critical bands. Timbre is a matter of a sound’s critical band profile. Loudness poses two unique complications for a theory of the audible qualities. First, sound intensity (in dB) correlates only loosely with perceived loudness. For instance, the loudness of a 40 dB tone at 4.9 Barks (500 Hz) is the same as that of a 60 dB tone at 0.5 Barks (50 Hz). (See Figure A.2: Equal Loudness Contours). Second, two tones of equal intensity that are separated by less than a critical bandwidth have a combined loudness proportional to the sum of their individual intensities. So two 60 dB tones at 8.5 Barks (1000 Hz) match the loudness of a single 63 dB tone at 8.5 Barks (10 log(2I/I 0 ) = 3 dB increase). This continues to be the case as the frequency difference between the two test tones is increased up to the critical bandwidth (160 Hz). However, with frequency separations greater than one critical bandwidth, the combined loudness of the two tones increases gradually until, at a separation of 2000 Hz, their combined loudness matches a single tone of 70 dB—roughly double the loudness of a 60 dB tone. With large frequency differences, the combined loudness of two tones is given by the sum of their individual loudnesses. Furthermore, the loudness of uniform noise is 3.5 times that of an 8.5 Bark (1000 Hz) tone of the same intensity level. Zwicker and Fastl conclude from this that critical bandwidth plays an important role not only for the loudness of noises of different bandwidth, but also for the loudness of two-tone complexes as a function of the frequency separation of the tones.22 The critical bands account of pitch can thus be extended to provide an account of the objective loudness of a sound. The weighting of energies within particular critical bands must accommodate the relative contribution to loudness made by sounds with energy in those critical bands. Indeed, the specific loudness of a narrow-band tone, relative to other tones with the same intensity, depends upon its weighted energy within an appropriate critical 22 Zwicker and Fastl (1999), p. 212. APPENDIX A. THE ALTERNATIVE VIEW 84 band. The total loudness of a tone complex can then be understood as an integration of specific loudness over all critical bands. 23 Two tones that fall within a critical band have less energy over all critical bands, and therefore less loudness, than equivalent tones separated by one or more critical bands. The critical bands account thus provides an account of loudness that lends intelligibility to empirical results which confound simpler views. 24 23 See Zwicker and Fastl (1999), Chapter 8, “Loudness,” for a full explication. Thanks, in particular, to Adam Elga, Gilbert Harman, Gideon Rosen, and Jeffrey Speaks for discussion and comments on the material in Chapter 3 and Appendix A. 24 APPENDIX A. THE ALTERNATIVE VIEW 85 Figure A.1: Critical bandwidth as a function of frequency. [From Zwicker and Fastl (1999), Psychoacoustics: Facts and Models, Second Edition, New York: Springer-Verlag, Figure 6.8, p. 158, with permission.] APPENDIX A. THE ALTERNATIVE VIEW 86 Figure A.2: Equal loudness contours. [From Zwicker and Fastl (1999), Psychoacoustics: Facts and Models, Second Edition, New York: SpringerVerlag, Figure 8.1, p. 204, with permission.] Bibliography [1] Aristotle. (1987). A New Aristotle Reader. J.L. Ackrill (ed.). Princeton: Princeton University Press. [2] Armstrong, David M. (1961). Perception and the Physical World. London: Routledge and Kegan Paul. [3] Bennett, Jonathan. (1988). Events and Their Names. Oxford: Clarendon. [4] Berkeley, George. (1975). Berkeley, Philosophical Works. M. R. Ayers (ed.). London: Dent. [5] Blauert, Jens. (1997). Spatial Hearing: The Psychophysics of Human Sound Localization. Cambridge, MA: MIT Press. [6] Block, Ned. (1990). “Inverted Earth.” In James Tomberlin (ed.), Philosophical Perspectives 4. Atascadero, CA: Ridgeview. [7] ——. (1995). “On a Confusion about a Function of Consciousness.” Behavioral and Brain Sciences 18: 227–247. [8] Boghossian, Paul A. and Velleman, J. David. (1989). “Colour as a Secondary Quality.” Mind 98: 81–103. [9] ——. (1991).“Physicalist Theories of Color.” Philosophical Review 100 (1991): 67–106. [10] Bradley, Peter and Tye, Michael. (2001). “Of Colors, Kestrels, Caterpillars, and Leaves.” Journal of Philosophy 98: 469–487. [11] Breen, N., Caine, D., Coltheart, M., Hendy, J., and Roberts, C. (2000). “Toward an Understanding of Delusions of Misidentification: Four Case Studies.” Mind and Language 15: 74– 110. 87 88 BIBLIOGRAPHY [12] Bregman, Albert S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, MA: MIT Press. [13] Byrne, Alex. (2001). “Intentionalism Defended.” Philosophical Review 110: 199–240. [14] ——. (forthcoming). “Color and Similarity.” Philosophy and Phenomenological Research. [15] ——. (ms). manuscript. “Colors and Dispositions.” Unpublished [16] Byrne, Alex, and Hilbert, David (eds.). (1997). Readings on Color, Volume 1: The Philosophy of Color. Cambridge, MA: MIT Press. [17] ——. (forthcoming BBS). “Color Realism and Color Science.” Behavioural and Brain Sceinces. [18] Carlile, Simon. (1996). Virtual Auditory Space: Generation and Applications. Austin, TX: R.G. Landes. [19] Davidson, Donald. (1970). “Events as Particulars.” In Essays on Actions and Events. Oxford: Clarendon Press, 1980. [20] Dretske, Fred. (1995). Naturalizing the Mind. Cambridge, MA: MIT Press. [21] Fara, Michael. (2001). Dispositions and Their Ascriptions. Princeton University Doctoral Thesis. [22] Gelfand, Stanley A. (1998). Hearing: An Introduction to Psychological and Physiological Acoustics. 3rd ed. New York: Marcel Dekker. [23] Hardin, C.L. (1988). Color for Philosophers: Unweaving the Rainbow. Indianapolis: Hackett. [24] Harman, Gilbert. (1990). “The Intrinsic Quality of Experience.” In James Tomberlin (ed.), Philosophical Perspectives 4. Atascadero, CA: Ridgeview. BIBLIOGRAPHY 89 [25] Helmholtz, Hermann. (1954). On the Sensations of Tone . New York: Dover. (Translated conforming to the Fourth German Edition of 1877 by Alexander J. Ellis). [26] Kim, Jaegwon. (1973). “Causation, Nomic Subsumption, and the Concept of Event.” Journal of Philosophy 70: 217–236. [27] Lewis, David. (1986). “Events.” In Philosophical Papers, Volume II. New York: Oxford University Press. [28] ——. (1997). “Finkish Dispositions.” The Philosophical Quarterly 47: 143–58. [29] Locke, John. (1975). An Essay Concerning Human Understanding. Peter Nidditch (ed.). Oxford: Clarendon Press. [30] Lycan, William. (1996). Consciousness and Experience. Cambridge, MA: MIT Press. [31] Martin, C.B. (1994). “Dispositions and Conditionals.” The Philosophical Quarterly 44: 1–8. [32] McGinn, Colin. (1983). The Subjective View: Secondary Qualities and Indexical Thoughts. Oxford: Clarendon. [33] Mestre, J., Mullin, W.J., and Gerace, W.J. (1998). “Course Notes for Physics 114, Theory of Sound with Applications to Speech and Hearing Science.” http://www-unix.oit.umass.edu/~phys114. [34] Nudds, Matthew. (2001). “Experiencing the Production of Sounds.” European Journal of Philosophy 9: 210–229. [35] Pasnau, Robert. (1999). “What is Sound?” Quarterly 49 (1999): 309–324. Philosophical [36] ——. (2000). “Sensible Qualities: The Case of Sound.” Journal of the History of Philosophy 38 (2000): 27–40. [37] Peacocke, Christopher. (1984). “Colour Concepts and Colour Experience.” Synthese 58: 365–382. [38] Physics-faq/acoustics: Acoustics FAQ, http://ear.berkeley.edu/acoustics.html. BIBLIOGRAPHY 90 [39] Rosen, Gideon. (1994). “Objectivity and Modern Idealism: What is the Question?” In Michaelis Michael and John O’Leary-Hawthorne (eds.), Philosophy in Mind. Dordrecht: Kluwer. [40] Schouten, J.F. (1940). “The Residue, A New Concept in Subjective Sound Analysis.” Proceedings of the Koninklijke Nederlandse Akadademie 43: 356–365. [41] Stevens, S.S. and Volkmann, J. (1940). “The Relation of Pitch to Frequency: A Revised Scale.” American Journal of Psych. 53: 329–353. [42] Stevens, S.S., Volkmann, J., and Newman, E.B. (1937). “A Scale for the Measurement of the Psychological Magnitude Pitch.” Journal of the Acoustical Society of America 8: 185– 190. [43] Strawson, P.F. (1959). Individuals. London: Methuen. [44] Terhardt, Ernst. (1974). “Pitch, Consonance and Harmony.” Journal of the Acoustical Society of America 55: 1061–1069. [45] ——. (1979). “Calculating Virtual Pitch.” Hearing Research 1: 155–182 [46] ——. (1980). “Toward an Understanding of Pitch Perception: Problems, Concepts, and Solutions.” In G. van den Brink and F.A. Bilsen (eds.), Psychophysical, Physiological, and Behavioral Studies in Hearing. Delft: University Press, 353–360. [47] Terhardt, E., Stoll, G., and Seewann, M. (1982a). “Pitch of Complex Signals According to Virtual Pitch Theory: Tests, Examples, and Predictions.” Journal of the Acoustical Society of America 71: 671–678. [48] ——. (1982b). “Algorithm for Extraction of Pitch and Pitch Salience from Complex Tonal Signals.” Journal of the Acoustical Society of America 71: 679–688. [49] Thomson, Judith Jarvis. (1983). “Parthood and Identity Across Time.” The Journal of Philosophy 80: 201–220. BIBLIOGRAPHY 91 [50] Tye, Michael. (1995). Ten Problems of Consciousness. Cambridge, MA: MIT Press. [51] ——. (2000). Consciousness, Color, and Content. Cambridge, MA: MIT Press. [52] Westfall, Richard S. (1980). Never at Rest: A Biography of Isaac Newton. Cambridge: Cambridge University Press. [53] Woolsey, C.N. (1960). “Organization of the Cortical Auditory System: A Review and Synthesis.” In G.L. Rasmussen and W.F. Windle (eds.), Neural Mechanisms of the Auditory Vestibular Systems. Illinois: Thomas, Springfield, 165–180. [54] Yost, William A. and Watson, Charles S. (eds). (1987). Auditory Processing of Complex Sounds. New Jersey: Erlbaum. [55] Zwicker, Eberhard. (1961). “Subdivision of the Audible Frequency Range into Critical Bands (Frequenzgruppen).” Journal of the Acoustical Society of America 33: 248. [56] Zwicker, Eberhard and Fastl, Hugo. (1999) Psychoacoustics: Facts and Models, Second Updated Edition. New York: Springer-Verlag. [57] Zwicker, Eberhard and Terhardt, Ernst (eds.). (1974). Facts and Models in Hearing. New York: Springer-Verlag. [58] ——. (1980). “Analytical Expressions for Critical-band Rate and Critical Bandwidth as a Function of Frequency.” Journal of the Acoustical Society of America 68: 1523–1525.
© Copyright 2025 Paperzz