Skip to main content
more options

Susan R. Hertz

Adjunct Professor

(Ph.D., Cornell, 1979)
Department of Linguistics, srh9@cornell.edu

Research

The overarching question underlying most of my research is what role universal principles of human speech perception play in the organization of speech/sound patterns in language. I am currently developing a theory of how listeners extract phonological structure and other information, such as speaker identity, from the continuous speech signal. I am testing various hypotheses related to this theory through perceptual experiments involving utterances that contain mixes of natural and synthetic speech segments. More information about my research as well as its applications to speech synthesis and other areas of speech processing can be found at www.novaspeech.com.

Teaching

Every other year, I teach Linguistics 648, a hands-on, graduate-level course in speech synthesis. Occasionally, I also teach Linguistics 419, an upper-level introduction to phonetics.

Selected Publications

Hertz, S. R. (2006) A model of the regularities underlying speaker variation: Evidence from hybrid synthesis, Proc. Interspeech 2006i.

Hertz, S. R. and Goldhor, R (2004). When can segments serve as surrogates? From Sound to Sense: 50+ Years of Discovery in Speech Communications, MIT, Cambridge.

Hertz, S. R., Spencer, I.C., Church, T.F., and Goldhor, R. (2004). Perceptual consequences of nasal surrogates in English: implications for speech synthesis, Proc. 147th Acoust. Soc. Amer.

Hertz, S. R. (2002) Integration of rule-based formant synthesis and waveform concatenation: a hybrid approach to text-to-speech synthesis, Proceedings IEEE 2002 Workshop On Speech Synthesis.

Hertz, S. R., Younes, R. J., and Zinovieva, N. (1999) Language-universal and language-specific components in the multi-language ETI-Eloquence text-to-speech system. Proc. 14th ICPhS 2283-2286.

Clements, G.N., Hertz, S.R. (1996) An integrated approach to phonology and phonetics, in J. Durand and B. Laks (eds.), Current Trends in Phonology: Models and Methods, CNRS, Paris X and Univ. of Salford Pub.

Hertz, S. R. and Huffman, M. K. (1992) A nucleus-based timing model applied to multi-dialect speech synthesis by rule, Proc. International Conference on Spoken Language Processing 2, 1171-1174.

Hertz, S. R. (1991) Streams, phones, and transitions: toward a phonological and phonetic model of formant timing. J. of Phonetics 19, 91-109. Special Issue on Speech Synthesis and Phonetics, ed. By R. Carlson.