Susan R. Hertz
Adjunct Professor
(Ph.D., Cornell, 1979)
Department of Linguistics, srh9@cornell.edu
|
|
Research
The overarching question underlying most of my research is what role
universal principles of human speech perception play in the organization of speech/sound patterns
in language. I am currently developing a theory of how listeners extract phonological structure
and other information, such as speaker identity, from the continuous speech signal. I am testing
various hypotheses related to this theory through perceptual experiments involving utterances
that contain mixes of natural and synthetic speech segments. More information about my research
as well as its applications to speech synthesis and other areas of speech processing can be found
at
www.novaspeech.com.
Teaching
Every other year, I teach Linguistics 648, a hands-on, graduate-level course
in speech synthesis. Occasionally, I also teach Linguistics 419, an upper-level introduction to
phonetics.
Selected Publications
Hertz, S. R. (2006)
A model of the regularities underlying speaker variation: Evidence from hybrid synthesis,
Proc. Interspeech 2006i.
Hertz, S. R. and Goldhor, R (2004).
When can segments serve as surrogates? From Sound to Sense: 50+ Years of Discovery in Speech Communications, MIT, Cambridge.
Hertz, S. R., Spencer, I.C., Church, T.F., and Goldhor, R. (2004).
Perceptual consequences of nasal surrogates in English: implications for speech synthesis,
Proc. 147th Acoust. Soc. Amer.
Hertz, S. R. (2002)
Integration of rule-based formant synthesis and waveform concatenation: a hybrid approach to text-to-speech synthesis,
Proceedings IEEE 2002 Workshop On Speech Synthesis.
Hertz, S. R., Younes, R. J., and Zinovieva, N. (1999)
Language-universal and language-specific components in the multi-language ETI-Eloquence text-to-speech system.
Proc. 14th ICPhS 2283-2286.
Clements, G.N., Hertz, S.R. (1996) An integrated approach to phonology and phonetics, in J. Durand and B. Laks (eds.),
Current Trends in Phonology: Models and Methods, CNRS, Paris X and Univ. of Salford Pub.
Hertz, S. R. and Huffman, M. K. (1992) A nucleus-based timing model applied to multi-dialect speech synthesis by
rule,
Proc. International Conference on Spoken Language Processing 2, 1171-1174.
Hertz, S. R. (1991) Streams, phones, and transitions: toward a phonological and phonetic model of formant timing.
J. of Phonetics 19, 91-109.
Special Issue on Speech Synthesis and Phonetics, ed. By R. Carlson.