An enormous wealth of acoustic information is present in the t;ermporal firing patterns of audito... more An enormous wealth of acoustic information is present in the t;ermporal firing patterns of auditory neurons. Distributions of interspike intervals across neural populations in the auditory nerve and brainstem form autocorrelation-like stimulus representations that closely predict the ...
Problem: How can radical constructivism gain wider recognition and acceptance? Method: Based on i... more Problem: How can radical constructivism gain wider recognition and acceptance? Method: Based on informal direct observation of other social and intellectual movements, the social and psychological …
Upshot: Emergence and Embodiment is a highly worthwhile and well-crafted collection of essays on ... more Upshot: Emergence and Embodiment is a highly worthwhile and well-crafted collection of essays on second-order cybernetics that draws together ideas related to self-organization, autopoiesis, …
Arguably, the most important barrier to widespread use of automatic speech recognition systems in... more Arguably, the most important barrier to widespread use of automatic speech recognition systems in reallife situations is their present inability to separate speech of individual speakers from other sound sources: other speak-ers, acoustic clutter, background noise. We believe ...
Open-endedness is an important goal for designing systems that can autonomously find solutions to... more Open-endedness is an important goal for designing systems that can autonomously find solutions to combinatorically- complex and ill-defined problems. We distinguish two modes of creating novelty: combinatoric (new combinations of existing primitives) and creative (new primitives). Although combinatoric systems may differ in numbers of possible combinations, their set of possibilities is closed. Creative systems, on the other hand, have open- sets of possibilities because of the partial- or ill-defined nature of the space of possible primitives. We discuss classes of adaptive and self-modifying cybernetic robotic devices in terms of these two kinds of processes. We consider such systems whose hardwares are constructed from genetically-directed pattern-grammars. Here although the space of accessible structures is closed, the space of functions is open. We conclude that genome sequence spaces and gene-product structure spaces are closed, whereas, being ill-defined, phenomic function-sp...
The Journal of the Acoustical Society of America, 2010
ABSTRACT Yoichi Ando is a well-known architectural acoustician who employed genetic algorithms, a... more ABSTRACT Yoichi Ando is a well-known architectural acoustician who employed genetic algorithms, acoustics, and psychophysical models of listener percepts and preferences to optimize the design of the Kirishima International Concert Hall in Japan. Ando's recent book [Auditory and Visual Sensations, guest editor P. Cariani (Springer, New York, 2009)] summarizes decades of psychophysical experiments and neurophysiological observations (ABR, SVR, EEG, MEG). This paper will outline Ando's psychophysics-based approach to architectural acoustics and his correlation-based theory of hearing and vision, along with supporting psychophysical and neurophysiological experimental observations. Ando proposes a temporally coded, correlation-based model of neuronal signal processing in which features of an internal autocorrelation representation subserve "temporal sensations" (pitch, timbre, loudness, duration), while features of an internal interaural cross correlation representation subserve "spatial sensations" (sound location, size, diffuseness related to envelopment). Together these two representations account for the basic auditory qualities most relevant for listening to music and speech in indoor performance spaces. Remarkably, Ando and colleagues have found many analogs of auditory percepts and preferences in vision. These include perception of the missing fundamental of flickering light as well as preferences for flickering lights, oscillatory movements, and texture regularity. Ando's theory suggests possible common temporal processing mechanisms for hearing and vision.
The Journal of the Acoustical Society of America, 2013
A processing scheme for speech signals is proposed that emulates synchrony capture in the auditor... more A processing scheme for speech signals is proposed that emulates synchrony capture in the auditory nerve. The role of stimulus-locked spike timing is important for representation of stimulus periodicity, low frequency spectrum, and spatial location. In synchrony capture, dominant single frequency components in each frequency region impress their time structures on temporal firing patterns of auditory nerve fibers with nearby characteristic frequencies (CFs). At low frequencies, for voiced sounds, synchrony capture divides the nerve into discrete CF territories associated with individual harmonics. An adaptive, synchrony capture filterbank (SCFB) consisting of a fixed array of traditional, passive linear (gammatone) filters cascaded with a bank of adaptively tunable, bandpass filter triplets is proposed. Differences in triplet output envelopes steer triplet center frequencies via voltage controlled oscillators (VCOs). The SCFB exhibits some cochlea-like responses, such as two-tone suppression and distortion products, and possesses many desirable properties for processing speech, music, and natural sounds. Strong signal components dominate relatively greater numbers of filter channels, thereby yielding robust encodings of relative component intensities. The VCOs precisely lock onto harmonics most important for formant tracking, pitch perception, and sound separation.
The stripped-down experimental setup may be missing important sensory proprioceptive and tactile ... more The stripped-down experimental setup may be missing important sensory proprioceptive and tactile observables that may well be crucial for designing useful, effective, and flexible general-purpose …
Upshot: Written by recognized experts in their fields, the book is a set of essays that deals wit... more Upshot: Written by recognized experts in their fields, the book is a set of essays that deals with the influences of early cybernetics, computational theory, artificial intelligence, and …
An enormous wealth of acoustic information is present in the t;ermporal firing patterns of audito... more An enormous wealth of acoustic information is present in the t;ermporal firing patterns of auditory neurons. Distributions of interspike intervals across neural populations in the auditory nerve and brainstem form autocorrelation-like stimulus representations that closely predict the ...
Problem: How can radical constructivism gain wider recognition and acceptance? Method: Based on i... more Problem: How can radical constructivism gain wider recognition and acceptance? Method: Based on informal direct observation of other social and intellectual movements, the social and psychological …
Upshot: Emergence and Embodiment is a highly worthwhile and well-crafted collection of essays on ... more Upshot: Emergence and Embodiment is a highly worthwhile and well-crafted collection of essays on second-order cybernetics that draws together ideas related to self-organization, autopoiesis, …
Arguably, the most important barrier to widespread use of automatic speech recognition systems in... more Arguably, the most important barrier to widespread use of automatic speech recognition systems in reallife situations is their present inability to separate speech of individual speakers from other sound sources: other speak-ers, acoustic clutter, background noise. We believe ...
Open-endedness is an important goal for designing systems that can autonomously find solutions to... more Open-endedness is an important goal for designing systems that can autonomously find solutions to combinatorically- complex and ill-defined problems. We distinguish two modes of creating novelty: combinatoric (new combinations of existing primitives) and creative (new primitives). Although combinatoric systems may differ in numbers of possible combinations, their set of possibilities is closed. Creative systems, on the other hand, have open- sets of possibilities because of the partial- or ill-defined nature of the space of possible primitives. We discuss classes of adaptive and self-modifying cybernetic robotic devices in terms of these two kinds of processes. We consider such systems whose hardwares are constructed from genetically-directed pattern-grammars. Here although the space of accessible structures is closed, the space of functions is open. We conclude that genome sequence spaces and gene-product structure spaces are closed, whereas, being ill-defined, phenomic function-sp...
The Journal of the Acoustical Society of America, 2010
ABSTRACT Yoichi Ando is a well-known architectural acoustician who employed genetic algorithms, a... more ABSTRACT Yoichi Ando is a well-known architectural acoustician who employed genetic algorithms, acoustics, and psychophysical models of listener percepts and preferences to optimize the design of the Kirishima International Concert Hall in Japan. Ando's recent book [Auditory and Visual Sensations, guest editor P. Cariani (Springer, New York, 2009)] summarizes decades of psychophysical experiments and neurophysiological observations (ABR, SVR, EEG, MEG). This paper will outline Ando's psychophysics-based approach to architectural acoustics and his correlation-based theory of hearing and vision, along with supporting psychophysical and neurophysiological experimental observations. Ando proposes a temporally coded, correlation-based model of neuronal signal processing in which features of an internal autocorrelation representation subserve "temporal sensations" (pitch, timbre, loudness, duration), while features of an internal interaural cross correlation representation subserve "spatial sensations" (sound location, size, diffuseness related to envelopment). Together these two representations account for the basic auditory qualities most relevant for listening to music and speech in indoor performance spaces. Remarkably, Ando and colleagues have found many analogs of auditory percepts and preferences in vision. These include perception of the missing fundamental of flickering light as well as preferences for flickering lights, oscillatory movements, and texture regularity. Ando's theory suggests possible common temporal processing mechanisms for hearing and vision.
The Journal of the Acoustical Society of America, 2013
A processing scheme for speech signals is proposed that emulates synchrony capture in the auditor... more A processing scheme for speech signals is proposed that emulates synchrony capture in the auditory nerve. The role of stimulus-locked spike timing is important for representation of stimulus periodicity, low frequency spectrum, and spatial location. In synchrony capture, dominant single frequency components in each frequency region impress their time structures on temporal firing patterns of auditory nerve fibers with nearby characteristic frequencies (CFs). At low frequencies, for voiced sounds, synchrony capture divides the nerve into discrete CF territories associated with individual harmonics. An adaptive, synchrony capture filterbank (SCFB) consisting of a fixed array of traditional, passive linear (gammatone) filters cascaded with a bank of adaptively tunable, bandpass filter triplets is proposed. Differences in triplet output envelopes steer triplet center frequencies via voltage controlled oscillators (VCOs). The SCFB exhibits some cochlea-like responses, such as two-tone suppression and distortion products, and possesses many desirable properties for processing speech, music, and natural sounds. Strong signal components dominate relatively greater numbers of filter channels, thereby yielding robust encodings of relative component intensities. The VCOs precisely lock onto harmonics most important for formant tracking, pitch perception, and sound separation.
The stripped-down experimental setup may be missing important sensory proprioceptive and tactile ... more The stripped-down experimental setup may be missing important sensory proprioceptive and tactile observables that may well be crucial for designing useful, effective, and flexible general-purpose …
Upshot: Written by recognized experts in their fields, the book is a set of essays that deals wit... more Upshot: Written by recognized experts in their fields, the book is a set of essays that deals with the influences of early cybernetics, computational theory, artificial intelligence, and …
Uploads
Papers by Peter Cariani