Age and gender effects in European Portuguese spontaneous speech

1. INTRODUCTION

⌅

The older population is quickly increasing worldwide. Demographic aging, due primarily to lower fertility, also reflects a human success history of increased longevity (He, Goodkind, & Kowal, 2016He, W., Goodkind, D., & Kowal, P. R. (2016). An aging world: 2015. International Population Reports, P95/16-1.
; Kart & Kinney, 2001Kart, C. S., & Kinney, J. M. (2001). The Realities of Aging : An Introduction to Gerontology. Boston: Allyn & Bacon.
). According to the World Health Organization, the number of people aged over 65 is rising, and by 2050, 1 in 6 people in the world will be over the age of 65, up from 1 in 11 in 2019 (United Nations Department of Economic and Social Affairs Population Division, 2019United Nations Department of Economic and Social Affairs Population Division. (2019). World Population Ageing 2019: Highlights.
). Portugal is also one of the developed countries with the highest rate of older population (between 1970 and 2019, the percentage of people aged 65 and over increased from 9.7% to 22.4%) (Statistics Portugal, 2015Statistics Portugal. (2015). Envelhecimento da população residente em Portugal e na União Europeia (Aging of the resident population in Portugal and the European Union). Destaque: Informação à Comunicação Social.
, 2019Statistics Portugal. (2019). Estimativas de População Residente em Portugal - 2018 (Estimates of resident population in Portugal - 2018). Destaque: Informação à Comunicação Social.
; United Nations Department of Economic and Social Affairs Population Division, 2019United Nations Department of Economic and Social Affairs Population Division. (2019). World Population Ageing 2019: Highlights.
).

Aging, as a normal and inevitable process, involves physiological, cognitive, psychological, and social changes (Hermes, Mertens, & Mücke, 2018Hermes, A., Mertens, J., & Mücke, D. (2018). Age-related effects on sensorimotor control of speech production. INTERSPEECH, 1526-1530. Hyderabad. http://dx.doi.org/10.21437/Interspeech.2018-1233
; Makiyama & Hirano, 2017Makiyama, K., & Hirano, S. (2017). Aging Voice. Singapore: Springer.
). The physiological aging affects all speech organs. In this sense, age-related changes in speech organs as well as in neuro-muscular control determine considerable variation in speech production and speech planning across the life span (Hermes et al., 2018Hermes, A., Mertens, J., & Mücke, D. (2018). Age-related effects on sensorimotor control of speech production. INTERSPEECH, 1526-1530. Hyderabad. http://dx.doi.org/10.21437/Interspeech.2018-1233
; Pellegrino, He, & Dellwo, 2018Pellegrino, E., He, L., & Dellwo, V. (2018). The effect of ageing on speech rhythm: A study on Zurich German. Speech Prosody, 133-137. http://dx.doi.org/10.21437/SpeechProsody.2018-27
). These changes alter considerably the way individuals speak over time both segmentally and suprasegmentally (Pellegrino et al., 2018Pellegrino, E., He, L., & Dellwo, V. (2018). The effect of ageing on speech rhythm: A study on Zurich German. Speech Prosody, 133-137. http://dx.doi.org/10.21437/SpeechProsody.2018-27
). Although most studies have focused on the production of segments, such as vowels, the suprasegmental characteristics of speech are also particularly vulnerable to age. However, the effect of aging on suprasegmental characteristics of speech, i.e., in speech rhythm and intonation has been investigated in far less detail.

Concerning European Portuguese (EP), there are only a few studies regarding age-related speech variations, mainly, at the segmental level (Albuquerque, Oliveira, Teixeira, Sa-Couto, & Figueiredo, 2019Albuquerque, L., Oliveira, C., Teixeira, A., Sa-Couto, P., & Figueiredo, D. (2019). Age-related changes in European Portuguese vowel acoustics. INTERSPEECH, 3965-3969. Graz, Austria: ISCA. http://dx.doi.org/10.21437/Interspeech.2019-1818
; Albuquerque et al., 2014Albuquerque, L., Oliveira, C., Teixeira, A., Sa-Couto, P., Freitas, J., & Dias, M. S. M. S. (2014). Impact of age in the production of European Portuguese vowels. INTERSPEECH, 940-944. Singapore. http://dx.doi.org/10.21437/Interspeech.2014-244
; Guimarães & Abberton, 2005Guimarães, I., & Abberton, E. (2005). Fundamental frequency in speakers of Portuguese for different voice samples. Journal of Voice, 19(4), 592-606. http://dx.doi.org/10.1016/j.jvoice.2004.11.004
; Pellegrini et al., 2013Pellegrini, T., Hämäläinen, A., de Mareüil, P. B., Tjalve, M., Trancoso, I., Candeias, S., … Braga, D. (2013). A corpus-based study of elderly and young speakers of European Portuguese: acoustic correlates and their impact on speech recognition performance. INTERSPEECH, 852-856. Lyon, Fance. http://dx.doi.org/10.21437/Interspeech.2013-241
). Still, vowel production in reading tasks failed to provide a clear picture of the age effects in spontaneous speech and has some limitations. To the best of our knowledge, there is almost no data on the acoustic correlates of aging on EP spontaneous speech (Guimarães & Abberton, 2005Guimarães, I., & Abberton, E. (2005). Fundamental frequency in speakers of Portuguese for different voice samples. Journal of Voice, 19(4), 592-606. http://dx.doi.org/10.1016/j.jvoice.2004.11.004
).

For this reason, the aim of this study is to analyze the effects of aging on speech rhythm (duration measures, speech and articulation rate), intonation (speaking fundamental frequency (f₀)) and voice quality (Harmonics-to-Noise Ratio (HNR)) measured in spontaneous speech. This study considers age as a continuous variable in the analysis avoiding the effects of arbitrary age groups division (as in Albuquerque, Oliveira, Teixeira, Sa-Couto, & Figueiredo (2020)Albuquerque, L., Oliveira, C., Teixeira, A., Sa-Couto, P., & Figueiredo, D. (2020). A comprehensive analysis of age and gender effects in European Portuguese oral vowels. Journal of Voice, In press.
).

Since there is a paucity of literature on EP speech acoustics and the available data were collected from a small number of speakers (Albuquerque et al., 2014Albuquerque, L., Oliveira, C., Teixeira, A., Sa-Couto, P., Freitas, J., & Dias, M. S. M. S. (2014). Impact of age in the production of European Portuguese vowels. INTERSPEECH, 940-944. Singapore. http://dx.doi.org/10.21437/Interspeech.2014-244
; Escudero, Boersma, Rauber, & Bion, 2009Escudero, P., Boersma, P., Rauber, A. S., & Bion, R. A. H. (2009). A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. J. Acoust. Soc. Am., 126(3), 1379-1393. http://dx.doi.org/10.1121/1.3180321
; Martins, 1973Martins, M. R. D. (1973). Análise acústica das vogais orais tónicas em Português. Boletim de Filologia, 22, 303-314.
; Oliveira, Cunha, Silva, Teixeira, & Sa-Couto, 2012Oliveira, C., Cunha, M. M., Silva, S., Teixeira, A., & Sa-Couto, P. (2012). Acoustic analysis of European Portuguese oral vowels produced by children. IberSPEECH, 328, 129-138. Madrid, Spain. http://dx.doi.org/10.1007/978-3-642-35292-8_14
), this study also provides valuable insights to understand the aging effects on natural spoken language production.

Knowledge of how speech changes with age is essential for the development of automatic speech recognition (ASR) systems suitable for older voices (i.e., personalized reading aids and voice prostheses) (Vipperla, Renals, & Frankel, 2010Vipperla, R., Renals, S., & Frankel, J. (2010). Ageing voices: The effect of changes in voice parameters on ASR performance. EURASIP J. Aud. Speech Music Process, 1-10. http://dx.doi.org/10.1155/2010/525783
), for clinical assessment and treatment of different speech disorders that are often age-related, and to provide information for other fields of knowledge (i.e., phonetics, speech science, forensic linguistics and biometric recognition).

2. RELATED WORK

⌅

Many studies have addressed the effects of aging on the acoustic properties of speech (Eichhorn, Kent, Austin, & Vorperian, 2018Eichhorn, J. T., Kent, R. D., Austin, D., & Vorperian, H. K. (2018). Effects of aging on vocal fundamental frequency and vowel formants in men and women. Journal of Voice, 32(5), 644.e1-644.e9. http://dx.doi.org/10.1016/j.jvoice.2017.08.003
; Linville, 2001Linville, S. E. (2001). Vocal Aging. Australia, San Diego: Singular Thomson Learning.
; Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
). Most of them have focused on segmental features and have shown an increase of segment duration (Albuquerque et al., 2020Albuquerque, L., Oliveira, C., Teixeira, A., Sa-Couto, P., & Figueiredo, D. (2020). A comprehensive analysis of age and gender effects in European Portuguese oral vowels. Journal of Voice, In press.
; Linville, 2001Linville, S. E. (2001). Vocal Aging. Australia, San Diego: Singular Thomson Learning.
; Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
). Despite spontaneous speech characteristics having been investigated in far less detail, acoustic studies have also revealed age-induced effects at suprasegmental level (i.e., intonation and rhythm), but they have focused mainly on reading speech.

A commonly reported effect of aging on speech is that pause duration as well as the use of speech pauses tend to increase (Hartman & Danhauer, 1976Hartman, D. E., & Danhauer, J. L. (1976). Perceptual features of speech for males in four perceived age decades. J. Acoust. Soc. Am, 59(3), 713-715. http://dx.doi.org/10.1121/1.380894
; Steffens, 2011Steffens, Y. (2011). The Aging Voice. GRIN Verlag.
). The reduced speech (or articulation) rate with age has also been reported, regardless of the speaker’s language (Bourbon & Hermes, 2020Bourbon, A., & Hermes, A. (2020). Have a break: Aging effects on sentence production and structuring in French. 12th International Seminar on Speech Production.
; Hazan et al., 2018Hazan, V., Tuomainen, O., Kim, J., Davis, C., Sheffield, B., & Brungart, D. (2018). Clear speech adaptations in spontaneous speech produced by young and older adults. J. Acoust. Soc. Am., 144(3), 1331-1346. http://dx.doi.org/10.1121/1.5053218.
; Hermes, Bourbon, & Cecile, 2020Hermes, A., Bourbon, A., & Cecile, F. (2020). Aging effects on prosodic structuring in French. SPPL2020: 2nd Workshop on Speech Perception and Production across the Lifespan, 119.
; Linville, 2001Linville, S. E. (2001). Vocal Aging. Australia, San Diego: Singular Thomson Learning.
; Pellegrini et al., 2013Pellegrini, T., Hämäläinen, A., de Mareüil, P. B., Tjalve, M., Trancoso, I., Candeias, S., … Braga, D. (2013). A corpus-based study of elderly and young speakers of European Portuguese: acoustic correlates and their impact on speech recognition performance. INTERSPEECH, 852-856. Lyon, Fance. http://dx.doi.org/10.21437/Interspeech.2013-241
; Pellegrino, 2019Pellegrino, E. (2019). The effect of healthy aging on within-speaker rhythmic variability: A case study on Noam Chomsky. Loquens, 6(1), e060. http://dx.doi.org/10.3989/loquens.2019.060
; Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
; Smith, Wasowicz, & Preston, 1987Smith, B. L., Wasowicz, J., & Preston, J. (1987). Temporal characteristics of the speech of normal elderly adults. J Speech Hear Res, 30(4), 522-529. http://dx.doi.org/10.1044/jshr.3004.522
; Volín, Tykalová, & Bořil, 2017Volín, J., Tykalová, T., & Bořil, T. (2017). Stability of prosodic characteristics across age and gender groups. INTERSPEECH, 3902-3906. http://dx.doi.org/10.21437/Interspeech.2017-1503
). However, Jacewicz et al. (2009Jacewicz, E., Fox, R. A., O’Neill, C., & Salmons, J. (2009). Articulation rate across dialect, age, and gender. Language Variation and Change, 21(02), 233-256.
, 2010)Jacewicz, E., Fox, R. A., & Wei, L. (2010). Between-speaker and within-speaker variation in speech tempo of American English. J. Acoust. Soc. Am., 128(2), 839-850. http://dx.doi.org/10.1121/1.3459842
, Brückl & Sendlmeier (2003)Brückl, M., & Sendlmeier, W. (2003). Aging female voices: an acoustic and perceptive analysis. VOQUAL’03, pp. 163-168.
, and Gerstenberg et al. (2018)Gerstenberg, A., Fuchs, S., Kairet, J. M., Frankenberg, C., & Schröder, J. (2018). A cross-linguistic, longitudinal case study of pauses and interpausal units in spontaneous speech corpora of older speakers of German and French. Speech Prosody. http://dx.doi.org/10.21437/SpeechProsody.2018-43
did not find a decline in speech or articulation rate with aging. In general, the use of more speech pauses, and the associated slowing of speech rate can directly be traced to an aggravation of the lung-function (Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
; Steffens, 2011Steffens, Y. (2011). The Aging Voice. GRIN Verlag.
).

Speaking f₀ has been found to be sensitive to the aging effect, tending to decrease in older females (Goy, Fernandes, Pichora-Fuller, & van Lieshout, 2013Goy, H., Fernandes, D. N., Pichora-Fuller, M. K., & van Lieshout, P. (2013). Normative voice data for younger and older adults. Journal of Voice, 27(5), 545-555. http://dx.doi.org/10.1016/j.jvoice.2013.03.002
; Hazan et al., 2018Hazan, V., Tuomainen, O., Kim, J., Davis, C., Sheffield, B., & Brungart, D. (2018). Clear speech adaptations in spontaneous speech produced by young and older adults. J. Acoust. Soc. Am., 144(3), 1331-1346. http://dx.doi.org/10.1121/1.5053218.
; Morgan & Rastatter, 1986Morgan, E. E., & Rastatter, M. (1986). Variability of voice fundamental frequency in elderly female speakers. Perceptual and Motor Skills, 63(1), 215-218. http://dx.doi.org/10.2466/pms.1986.63.1.215
; Winkler, 2004Winkler, R. (2004). Open Quotient and breathiness in aging voices-changes with increasing chronological age and it’s perception. German-French Summerschool on “Cognitive and Physical Models of Speech Production, Perception and Perception-Production Interaction.” Lubmin.
). For men, most studies have reported that f₀ decreases from young adulthood into middle age and then rises again into older ages (about 35 Hz) (Linville, 2001Linville, S. E. (2001). Vocal Aging. Australia, San Diego: Singular Thomson Learning.
; Pegoraro Krook, 1988Pegoraro Krook, M. I. (1988). Speaking fundamental frequency characteristics of normal Swedish subjects obtained by glottal frequency analysis. Folia Phoniatrica et Logopaedica, 40(2), 82-90. http://dx.doi.org/10.1159/000265888
; Rojas, Kefalianos, & Vogel, 2020Rojas, S., Kefalianos, E., & Vogel, A. (2020). How does our voice change as we age? A systematic review and meta-analysis of acoustic and perceptual voice data from healthy adults over 50 years of age. JSLHR, 63(2), 533-551. http://dx.doi.org/10.1044/2019_JSLHR-19-00099
). Other studies have indicated that speaking f₀ decreases (Vipperla et al., 2010Vipperla, R., Renals, S., & Frankel, J. (2010). Ageing voices: The effect of changes in voice parameters on ASR performance. EURASIP J. Aud. Speech Music Process, 1-10. http://dx.doi.org/10.1155/2010/525783
), while others have suggested a speaking f₀ increase in older age (Dunashova, 2021Dunashova, A. (2021). Intraspeaker variability of a professional lecturer: Ageing, genre, pragmatics vs. voice acting (Case study). International Conference on Speech and Computer, 179-189. http://dx.doi.org/10.1007/978-3-030-87802-3_17
; Ramig & Ringel, 1983Ramig, L., & Ringel, R. (1983). Effects of physiological aging on selected acoustic characteristics of voice. J Speech Hear Res, 26(1), 22-30. http://dx.doi.org/10.1044/jshr.2601.22
; Winkler, 2004Winkler, R. (2004). Open Quotient and breathiness in aging voices-changes with increasing chronological age and it’s perception. German-French Summerschool on “Cognitive and Physical Models of Speech Production, Perception and Perception-Production Interaction.” Lubmin.
).

Another parameter that impacts how voices sound is the Harmonics-to-Noise Ratio (HNR), which quantifies the amount of noise in the voicing signal (Dehqan, Scherer, Dashti, Ansari-Moghaddam, & Fanaie, 2013Dehqan, A., Scherer, R. C., Dashti, G., Ansari-Moghaddam, A., & Fanaie, S. (2013). The effects of aging on acoustic parameters of voice. Folia Phoniatrica et Logopaedica, 64(6), 265-270. http://dx.doi.org/10.1159/000343998
; Ferrand, 2002Ferrand, C. T. (2002). Harmonics-to-Noise Ratio: An index of vocal aging. Journal of Voice, 16(4), 480-487.
; Mautner, 2011Mautner, H. (2011). A Cross-System Instrumental Voice Profile of the Aging Voice: With Considerations of Jaw Posture Effects. University of Canterbury, New Zealand.
; Stathopoulos, Huber, & Sussman, 2011Stathopoulos, E. T., Huber, J. E., & Sussman, J. E. (2011). Changes in acoustic characteristics of the voice across the life span: measures from individuals 4-93 years of age. J Speech Lang Hear Res, 54(4), 1011-1021. http://dx.doi.org/10.1044/1092-4388(2010/10-0036)
; Xue & Deliyski, 2001Xue, S. A., & Deliyski, D. (2001). Effects of aging on selected acoustic voice parameters: preliminary normative data and educational implications. Educational Gerontology, 27(2), 159-168.
) Generally, HNR, in sustained vowels, has been reported to decrease with age, mainly in females (Dehqan et al., 2013Dehqan, A., Scherer, R. C., Dashti, G., Ansari-Moghaddam, A., & Fanaie, S. (2013). The effects of aging on acoustic parameters of voice. Folia Phoniatrica et Logopaedica, 64(6), 265-270. http://dx.doi.org/10.1159/000343998
; Ferrand, 2002Ferrand, C. T. (2002). Harmonics-to-Noise Ratio: An index of vocal aging. Journal of Voice, 16(4), 480-487.
; Xue & Deliyski, 2001Xue, S. A., & Deliyski, D. (2001). Effects of aging on selected acoustic voice parameters: preliminary normative data and educational implications. Educational Gerontology, 27(2), 159-168.
). Nonetheless, in other studies almost no age-related variation was found for women (Ambreen, Bashir, Tarar, & Kausar, 2019Ambreen, S., Bashir, N., Tarar, S. A., & Kausar, R. (2019). Acoustic analysis of normal voice patterns in Pakistani adults. Journal of Voice, 33(1), 124.e49-124.e58. http://dx.doi.org/10.1016/j.jvoice.2017.09.003
; Goy et al., 2013Goy, H., Fernandes, D. N., Pichora-Fuller, M. K., & van Lieshout, P. (2013). Normative voice data for younger and older adults. Journal of Voice, 27(5), 545-555. http://dx.doi.org/10.1016/j.jvoice.2013.03.002
; Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
) or men (Ambreen et al., 2019Ambreen, S., Bashir, N., Tarar, S. A., & Kausar, R. (2019). Acoustic analysis of normal voice patterns in Pakistani adults. Journal of Voice, 33(1), 124.e49-124.e58. http://dx.doi.org/10.1016/j.jvoice.2017.09.003
; Goy et al., 2013Goy, H., Fernandes, D. N., Pichora-Fuller, M. K., & van Lieshout, P. (2013). Normative voice data for younger and older adults. Journal of Voice, 27(5), 545-555. http://dx.doi.org/10.1016/j.jvoice.2013.03.002
). Furthermore, in Schötz (2006)Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
, a slight decrease followed by an increase after age 50 was observed in male HNR.

For EP, acoustic studies about age-related changes on spontaneous speech are scarce. Previous studies with different speech corpora (e.g., reading or conversation), indicated a slight or significant age-related f₀ decrease in women, and a non-agreement in men (Albuquerque et al., 2019Albuquerque, L., Oliveira, C., Teixeira, A., Sa-Couto, P., & Figueiredo, D. (2019). Age-related changes in European Portuguese vowel acoustics. INTERSPEECH, 3965-3969. Graz, Austria: ISCA. http://dx.doi.org/10.21437/Interspeech.2019-1818
, 2014Albuquerque, L., Oliveira, C., Teixeira, A., Sa-Couto, P., Freitas, J., & Dias, M. S. M. S. (2014). Impact of age in the production of European Portuguese vowels. INTERSPEECH, 940-944. Singapore. http://dx.doi.org/10.21437/Interspeech.2014-244
; Guimarães & Abberton, 2005Guimarães, I., & Abberton, E. (2005). Fundamental frequency in speakers of Portuguese for different voice samples. Journal of Voice, 19(4), 592-606. http://dx.doi.org/10.1016/j.jvoice.2004.11.004
; Pellegrini et al., 2013Pellegrini, T., Hämäläinen, A., de Mareüil, P. B., Tjalve, M., Trancoso, I., Candeias, S., … Braga, D. (2013). A corpus-based study of elderly and young speakers of European Portuguese: acoustic correlates and their impact on speech recognition performance. INTERSPEECH, 852-856. Lyon, Fance. http://dx.doi.org/10.21437/Interspeech.2013-241
). In reading tasks, Pellegrini et al. (2013)Pellegrini, T., Hämäläinen, A., de Mareüil, P. B., Tjalve, M., Trancoso, I., Candeias, S., … Braga, D. (2013). A corpus-based study of elderly and young speakers of European Portuguese: acoustic correlates and their impact on speech recognition performance. INTERSPEECH, 852-856. Lyon, Fance. http://dx.doi.org/10.21437/Interspeech.2013-241
refer that speech rate decreases and the percent pause ratio increases with age for both genders. Given that previous research used different corpora and analysis procedures, it is hard at this time to draw solid conclusions on the effects of age and gender on EP spontaneous speech.

3. METHOD

⌅

This cross-sectional study was approved by the Ethics Committee Centro Hospitalar São João/ Faculty of Medicine, University of Porto, Portugal (approval number N38/18), and all participants signed a written informed consent form.

3.1. Participants

⌅

A total of 112 native Portuguese speakers, from the central region of Portugal, aged between 35 and 97, participated in this study. They covered 4 age groups: [35-49] (15 men, 15 women), [50-64] (15 men, 15 women), [65-79] (15 men, 16 women), and ≥80 (11 men, 10 women).

Each participant completed a written questionnaire and reported: no history of speech-language impairment; no severe hearing problems; and no history of neurological disorders or head/ neck cancer. Also, the participant inclusion criteria consisted of individuals free of upper respiratory tract infection for 3 weeks prior to the experiment. Participants were excluded if they: 1) were current smokers or had smoked within the previous 5 years; 2) self-reported poor general health; 3) wore hearing aids, 4) had received speech and language therapy, and 5) reported that their voice was different than usual on the day of testing (i.e., having a cold or allergy symptoms). Participants who exhibited any observable sign of speech, voice or severe hearing problems, as assessed by a speech pathologist on the moment of recording, and those who were unable to follow directions were excluded.

3.2. Corpus and recording protocol

⌅

Spontaneous speech samples were collected from the participants using the Boston Diagnostic Aphasia Examination (Goodglass & Kaplan, 1983Goodglass, H., & Kaplan, E. (1983). The Assessment of Aphasia and Related Disorders (2nd ed.). Philadelphia, PA.: Lea and Febiger.
) picture description task, with the standardized picture “Cookie Theft” stimulus. Participants were instructed to describe the picture at comfortable pitch and loudness level, after familiarizing themselves with the image, in order to obtain induced spontaneous speech (Morgan & Rastatter, 1986Morgan, E. E., & Rastatter, M. (1986). Variability of voice fundamental frequency in elderly female speakers. Perceptual and Motor Skills, 63(1), 215-218. http://dx.doi.org/10.2466/pms.1986.63.1.215
; Pakhomov et al., 2011Pakhomov, S. V. S., Kaiser, E. A., Boley, D. L., Marino, S. E., Knopman, D. S., & Birnbaum, A. K. (2011). Effects of age and dementia on temporal cycles in spontaneous speech fluency. Journal of Neurolinguistics, 24(6), 619-635. http://dx.doi.org/10.1016/j.jneuroling.2011.06.002
). The instruction given to participants was as follows: “Tell me everything you see in this picture.”.

Recordings took place in quiet rooms using an AKG condenser microphone and USB external soundcard (PreSonus), with a sampling rate of 44100 Hz. The image was presented on the computer screen with software system SpeechRecorder (Draxler & Jänsch, 2017Draxler, C., & Jänsch, K. (2017). SpeechRecorder (3.12.0).
). The participants were seated at a table and the microphone was adjusted to each participant and positioned at an approximately 15-20 cm distance from the mouth.

3.3. Suprasegmental annotation

⌅

The recorded data was segmented through automatic Praat scripts. The speech and pauses were labeled, and in the speech intervals, the vowel onsets were also detected.

The Praat Script Syllable Nuclei v2 (modified by Hugo Quené, Ingrid Persoon, & Nivja de Jong and is available in: https://sites.google.com/site/speechrate/Home/praat-script-syllable-nuclei-v2) (de Jong & Wempe, 2009de Jong, N. H., & Wempe, T. (2009). Praat script to detect syllable nuclei and measure speech rate automatically. Behavior Research Methods, 41(2), 385-390. http://dx.doi.org/10.3758/BRM.41.2.385
) was used to automatically detect silent pauses of over 250 ms length (Cannizzaro, Harel, Reilly, Chappell, & Snyder, 2004Cannizzaro, M., Harel, B., Reilly, N., Chappell, P., & Snyder, P. J. (2004). Voice acoustical measurement of the severity of major depression. Brain and Cognition, 56(1), 30-35. http://dx.doi.org/10.1016/j.bandc.2004.05.003
) and create textgrid files.

The automated alignments of silent pauses were manually checked by two trained analyzers, who verified the accuracy of pause and speech intervals, and also labeled intervals with speaker or environmental noise. The intervals were labeled as: pause (breathing sound was considered as silent pause), speech, verbal non lexical (i.e., filled pauses), noise (i.e., noise that occurs during the speaker’s pauses), vocal non lexical (i.e., laughter, coughing or other human noises), and speech with noise (i.e., speech intervals with environmental noise that could affect the acoustic measurements) (Pellegrino, 2019Pellegrino, E. (2019). The effect of healthy aging on within-speaker rhythmic variability: A case study on Noam Chomsky. Loquens, 6(1), e060. http://dx.doi.org/10.3989/loquens.2019.060
; Schuller et al., 2013Schuller, B., Steidl, S., Batliner, A., Burkhardt, F., Devillers, L., Müller, C., & Narayanan, S. (2013). Paralinguistics in speech and language - State-of-the-art and the challenge. Computer Speech and Language, 27, 4-39. http://dx.doi.org/10.1016/j.csl.2012.02.005
).

Vocal non lexical phenomena were considered as pause time, while verbal non lexical phenomena were not included in the present analysis (Pellegrino, 2019Pellegrino, E. (2019). The effect of healthy aging on within-speaker rhythmic variability: A case study on Noam Chomsky. Loquens, 6(1), e060. http://dx.doi.org/10.3989/loquens.2019.060
; Tuomainen, Hazan, & Taschenberger, 2019Tuomainen, O., Hazan, V., & Taschenberger, L. (2019). Speech communication in background noise: Effects of aging. 19th International Congress of the Phonetic Sciences, 805-809. Australasian Speech Science and Technology Association Inc.
). Speech intervals with noise were not counted for further analysis, and also the beginning and ending of all recordings were not pondered in the analysis due to sentence initial and final acoustic variability (a total of 7% of the speech intervals were excluded).

Regarding the syllables spoken, an adapted Praat script of the BeatExtractor (Barbosa, 2006Barbosa, P. A. (2006). Incursões em torno do ritmo da fala (Incursions around the rhythm of speech). Campinas: FAPESP/Pontes Editores.
, 2010Barbosa, P. A. (2010). Automatic duration-related salience detection in Brazilian Portuguese read and spontaneous speech. Speech Prosody. Chicago.
) was used to detect vowel onset using a beat wave (a normalized and band-specific amplitude). The cut-off frequency was defined automatically. The thresholds were 0.1 and 0.06; the filter was defined as Butterworth and the technique was Amplitude. The total number of syllables were automatically obtained through the sum of all vowel onset detected within all valid speech intervals per speaker. A random check was done to verify the vowel onsets and confirm the script performance.

3.4. Acoustic measurements

⌅

Acoustic parameters (f₀ and HNR) were automatically extracted from the valid speech intervals, using the Praat script ProsodyDescriptor (Barbosa, 2013Barbosa, P. A. (2013). Semi-automatic and automatic tools for generating prosodic descriptors for prosody research. TRASP, 86-89. Aix-en-Provence. https://www.researchgate.net/publication/268278216
), with the f₀ threshold of 75 - 400 Hz for males, and 120 - 600 Hz for females. The script extracted and calculated the parameters f₀ mean (semitones (re 1 Hz)) and HNR. Each value was considered and used to obtain the average of speaking f₀ for each participant. The f₀ scale with values in semitones relative to 1 Hz was converted to Hertz.

In order to analyze the age effects in rhythm, intonation and voice quality, several parameters (Table 1) were extracted from spontaneous speech, based on the suprasegmental annotation of speech, pauses and syllables.

Table 1. Systematic overview of the parameters.

Parameter	Description
Total speech duration (s)	Sum of speech duration of all speech intervals
Total pause duration (s)	Sum of pause duration of all pause intervals
Total recording duration (s)	Sum of all speech and pause intervals
Percent pause time (%)	Total pause duration divided by total time (all speech and pause intervals)
Speech pause ratio	Total time talking divided by total pause time
Number of pauses	Number of pause intervals in the description task
Mean pause duration (s)	Duration average of pause length
Mean speech duration (s)	Duration average of speech length
Pause variability (s)	Standard deviation (SD) of pause length
Speech variability (s)	Standard deviation (SD) of speech length
Number of syllables	Sum of all syllable onset detected within all speech intervals
Speech rate	Number of syllables divided by the total time (include pause intervals)
Articulation rate	Syllables per seconds of speech without pauses
Speaking f0 (Hz)	Average number of vibrations per second of the vocal folds in the entire speech sample
HNR (dB)	Average ratio of the aperiodic energy to the harmonic energy

3.5. Statistical Analysis

⌅

The statistical analysis was conducted with the SPSS software package (SPSS 25.0). For each suprasegmental parameter, a multiple linear regression was conducted with the following explanatory variables: age (continuous), gender (male: reference group, female), and the interaction between age and gender. The regression coefficients and the correspondent 95% IC were calculated. The level of significance was p<0.05 for all statistical analyses.

4. RESULTS

⌅

This section presents the results of the selected acoustic parameters, and their statistical analyses.

The multiple linear regression coefficients are displayed in Table 2 and the results revealed significant differences between genders for the following parameters: speech rate, articulation rate, speaking f₀ and HNR. There was a significant age effect in males for total speech duration, percent pause time, mean pause duration, speech variability and number of syllables; and in females for the suprasegmental parameters: speech rate, articulation rate and HNR.

Table 2. Results of multiple linear regression: the effect of gender and gender*age interaction on suprasegmental parameters. Grey cells represent significant results (p<0.05).

	Intercept			Gender (Male)			Male * Age			Female * Age
	B	95%	CI	B	95%	CI	B	95%	CI	B	95%	CI
Total speech duration (s)	29,20	15,58	42,82	7,24	-12,32	26,81	-0,25	-0,47	-0,03	-0,12	-0,33	0,10
Total pause duration (s)	6,93	0,88	12,99	-0,14	-8,84	8,56	0,03	-0,07	0,12	0,02	-0,08	0,12
Total recording duration (s)	36,13	18,57	53,70	7,10	-18,13	32,33	-0,22	-0,50	0,06	-0,10	-0,38	0,18
Percent pause time (%)	22,81	10,41	35,22	-10,08	-27,90	7,73	0,27	0,07	0,47	0,07	-0,13	0,26
Speech pause ratio	13,15	-2,37	28,67	-8,24	-30,54	14,05	-0,03	-0,28	0,22	-0,11	-0,35	0,14
Number of pauses	9,08	3,23	14,93	3,55	-4,85	11,95	-0,06	-0,15	0,03	-0,02	-0,11	0,08
Mean pause duration (s)	0,91	0,36	1,46	-0,53	-1,32	0,26	0,01	0,00	0,02	0,00	-0,01	0,01
Mean speech duration (s)	4,08	2,13	6,02	-1,26	-4,06	1,53	-0,01	-0,04	0,02	-0,02	-0,05	0,01
Pause variability (SD)	0,31	-0,30	0,92	-0,16	-1,04	0,72	0,01	0,00	0,02	0,01	0,00	0,02
Speech variability (SD)	1,71	1,08	2,34	0,35	-0,56	1,25	-0,01	-0,02	0,00	0,00	-0,01	0,01
Number of syllables	112,14	42,38	181,89	65,89	-34,30	166,08	-1,14	-2,25	-0,02	-0,14	-1,24	0,95
Speech rate	2,40	1,46	3,34	2,03	0,68	3,39	-0,01	-0,03	0,00	0,02	0,00	0,03
Articulatory rate	3,06	2,04	4,08	1,97	0,50	3,44	0,00	-0,01	0,02	0,03	0,01	0,04
Speaking f0 (Hz)	204,29	181,36	227,22	-82,13	-115,06	-49,20	0,07	-0,29	0,44	-0,23	-0,59	0,13
HNR (dB)	17,56	15,29	19,82	-7,07	-10,32	-3,82	0,00	-0,04	0,04	-0,05	-0,09	-0,02
	B= Linear Coefficient

4.1. Rhythmic parameters

⌅

Concerning the mean pause duration (see Figure 1) and the percent pause time (see Figure 2), the regression lines showed an increase with age, mainly in males, which means pause duration increases from approximately 0.7 s at age 35 to 1.4 s at age 100, and the percent pause time increases 20% in the same age interval. The multiple linear regression revealed that, for both parameters, the age effect is only significant for men.

Figure 1. Scatterplot and regression lines for mean pause duration by age and gender. Each symbol corresponds to one speaker. Solid line and circles: females; dashed line and triangles: males.

Figure 2. Scatterplot and regression lines for percent pause time by age and gender. Each symbol corresponds to one speaker. Solid line and circles: females; dashed line and triangles: males.

Regarding articulation rate (Figure 3): (1) age effect is gender dependent, with men presenting a 9.4% higher mean articulation rate than women; (2) difference between genders decreases with age, with older men and women presenting similar mean articulation rate, due to an increase in women’s articulation rate. The regression model confirmed the gender differences and also showed a significant increase with age for female. Speech rate (Figure 4), as articulation rate, presented a tendency to increase in women. In men, a higher tendency to decrease, due to a rise in the percent pause time with age was observed.

Figure 3. Scatterplot and regression lines for articulation rate by age and gender. Each symbol corresponds to one speaker. Solid line and circles: females; dashed line and triangles: males.

Figure 4. Scatterplot and regression lines for speech rate by age and gender. Each symbol corresponds to one speaker. Solid line and circles: females; dashed line and triangles: males.

Additionally, a decrease of the total speech duration with age, mainly in males was seen (Figure 5), which is in line with the significant decrease of the number of syllables also observed for males. For both genders, speech duration variability decreases with age, but is only significant in males.

Figure 5. Scatterplot and regression lines for total speech duration by age and gender. Each symbol corresponds to one speaker. Solid line and circles: females; dashed line and triangles: males.

4.2. Speaking f₀

⌅

In Figure 6 a large age effect in speaking f₀ was not observed. In men speaking f₀ tended to increase with aging. Conversely, in women speaking f₀ tended to slightly decrease with age. As expected, male speakers had a significantly lower speaking f₀ when compared to female speakers regardless of age.

Figure 6. Scatterplot and regression lines for Speaking f₀ by age and gender. Each symbol corresponds to one speaker. Solid line and circles: females; dashed line and triangles: males.

4.3. Harmonics-to-Noise Ratio

⌅

As for HNR, as can be seen in Fig. 7, males presented lower values than females, and the regression results confirmed the gender differences. In the case of females, their values tended to be lower with age, while no change in males was observed. The regression line showed a decrease of about 4 dB between the ages 35 and 100 in females, which was significant.

Figure 7. Scatterplot and regression lines for HNR by age and gender. Each symbol corresponds to one speaker. Solid line and circles: females; dashed line and triangles: males.

5. DISCUSSION

⌅

This study provides a base to complement previous studies in age effects, mostly at the suprasegmental level. The age effects in rhythmic parameters, speaking f₀ and HNR of the EP spontaneous speech were explored through a picture description task. Firstly, the older males produced shorter descriptions than younger adults, which may be related to task nature or indicate differences in linguistic domain (Mortensen, Meyer, & Humphreys, 2006Mortensen, L., Meyer, A. S., & Humphreys, G. W. (2006). Age-related effects on speech production: A review. Lang Cognitive Proc, 21(1-3), 238-290. http://dx.doi.org/10.1080/01690960444000278
).

Our analyses of rhythmic variation with age are in line with previous findings for other languages, mostly for men (Hartman & Danhauer, 1976Hartman, D. E., & Danhauer, J. L. (1976). Perceptual features of speech for males in four perceived age decades. J. Acoust. Soc. Am, 59(3), 713-715. http://dx.doi.org/10.1121/1.380894
; Steffens, 2011Steffens, Y. (2011). The Aging Voice. GRIN Verlag.
), where older men presented significantly more pause time and longer pause duration when compared with the young male speakers. Although men spoke faster than women, as in Jacewicz, Fox, & Wei (2010)Jacewicz, E., Fox, R. A., & Wei, L. (2010). Between-speaker and within-speaker variation in speech tempo of American English. J. Acoust. Soc. Am., 128(2), 839-850. http://dx.doi.org/10.1121/1.3459842
and Verhoeven et al. (2004)Verhoeven, J., De Pauw, G., & Kloots, H. (2004). Speech rate in a pluricentric language: A comparison between Dutch in Belgium and the Netherlands. Language and Speech, 43(3), 297-308. http://dx.doi.org/10.1177/00238309040470030401
, this difference decreased with age. The faster articulation rate in older women is not in agreement with the general trend (Hazan et al., 2018Hazan, V., Tuomainen, O., Kim, J., Davis, C., Sheffield, B., & Brungart, D. (2018). Clear speech adaptations in spontaneous speech produced by young and older adults. J. Acoust. Soc. Am., 144(3), 1331-1346. http://dx.doi.org/10.1121/1.5053218.
; Linville, 2001Linville, S. E. (2001). Vocal Aging. Australia, San Diego: Singular Thomson Learning.
), although some studies (Brückl & Sendlmeier, 2003Brückl, M., & Sendlmeier, W. (2003). Aging female voices: an acoustic and perceptive analysis. VOQUAL’03, pp. 163-168.
; Jacewicz, Fox, O’Neill, & Salmons, 2009Jacewicz, E., Fox, R. A., O’Neill, C., & Salmons, J. (2009). Articulation rate across dialect, age, and gender. Language Variation and Change, 21(02), 233-256.
; Jacewicz et al., 2010Jacewicz, E., Fox, R. A., & Wei, L. (2010). Between-speaker and within-speaker variation in speech tempo of American English. J. Acoust. Soc. Am., 128(2), 839-850. http://dx.doi.org/10.1121/1.3459842
) refer no age-related differences. Furthermore, a longitudinal study of spontaneous speech (Gerstenberg et al., 2018Gerstenberg, A., Fuchs, S., Kairet, J. M., Frankenberg, C., & Schröder, J. (2018). A cross-linguistic, longitudinal case study of pauses and interpausal units in spontaneous speech corpora of older speakers of German and French. Speech Prosody. http://dx.doi.org/10.21437/SpeechProsody.2018-43
) also reported that articulation rate tends to increase in French speakers with age, presumably due to language specific effects or due to age-related changes in the cardiovascular system (i.e., older speakers may inhale more often, but may compensate for the reduced respiratory capacities by an increase in articulation rate to maintain information density) (Gerstenberg et al., 2018Gerstenberg, A., Fuchs, S., Kairet, J. M., Frankenberg, C., & Schröder, J. (2018). A cross-linguistic, longitudinal case study of pauses and interpausal units in spontaneous speech corpora of older speakers of German and French. Speech Prosody. http://dx.doi.org/10.21437/SpeechProsody.2018-43
). Also, Brückl and Sendlmeier (2003)Brückl, M., & Sendlmeier, W. (2003). Aging female voices: an acoustic and perceptive analysis. VOQUAL’03, pp. 163-168.
suggest that the duration of speech pauses is a better indicator of age than the articulation rate.

Additionally, for males, our results indicated that the total speech duration decreases with age, despite the fact that the mean speech duration of each speech interval only tends to decrease slightly. However, the speech duration variability significantly decreases with age, which may indicate that older males tend to perform speech intervals with a similar duration.

Regarding speaking f₀, our data tend to confirm the claim in the literature that f₀ slightly decreases in females with age (Goy et al., 2013Goy, H., Fernandes, D. N., Pichora-Fuller, M. K., & van Lieshout, P. (2013). Normative voice data for younger and older adults. Journal of Voice, 27(5), 545-555. http://dx.doi.org/10.1016/j.jvoice.2013.03.002
; Guimarães & Abberton, 2005Guimarães, I., & Abberton, E. (2005). Fundamental frequency in speakers of Portuguese for different voice samples. Journal of Voice, 19(4), 592-606. http://dx.doi.org/10.1016/j.jvoice.2004.11.004
; Hazan et al., 2018Hazan, V., Tuomainen, O., Kim, J., Davis, C., Sheffield, B., & Brungart, D. (2018). Clear speech adaptations in spontaneous speech produced by young and older adults. J. Acoust. Soc. Am., 144(3), 1331-1346. http://dx.doi.org/10.1121/1.5053218.
; Morgan & Rastatter, 1986Morgan, E. E., & Rastatter, M. (1986). Variability of voice fundamental frequency in elderly female speakers. Perceptual and Motor Skills, 63(1), 215-218. http://dx.doi.org/10.2466/pms.1986.63.1.215
; Titze, 1994Titze, I. R. (1994). Voice classification and life-span changes. In Principles of Voice Production (pp. 169-190). Englewood Cliffs, New Jersey: Prentice Hall.
; Winkler, 2004Winkler, R. (2004). Open Quotient and breathiness in aging voices-changes with increasing chronological age and it’s perception. German-French Summerschool on “Cognitive and Physical Models of Speech Production, Perception and Perception-Production Interaction.” Lubmin.
), which has been attributed to the endocrinological changes that occur after menopause (Linville, 2001Linville, S. E. (2001). Vocal Aging. Australia, San Diego: Singular Thomson Learning.
; Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
; Sebastian, Babu, Oommen, & Ballraj, 2012Sebastian, S., Babu, S., Oommen, N. E., & Ballraj, A. (2012). Acoustic measurements of geriatric voice. Journal of Laryngology and Voice, 2(2), 81-84. http://dx.doi.org/10.4103/2230-9748.106984
). For males the speaking f₀ appears to remain stable with age, based on the regression lines. Nonetheless, the analysis of the same data by age groups in Albuquerque et al. (2021)Albuquerque, L., Valente, A. R. S., Teixeira, A., Oliveira, C., & Figueiredo, D. (2021). Acoustic changes in spontaneous speech with age. VIII Congreso Internacional de Fonética Experimental, 3. Girona.
revealed that speaking f₀ decreases until the age group [65-79] and starts to rise after that age, with an increase of about 10 Hz in the older group, which presented the highest mean value. Thus, a nonlinear variation of speaking f₀ with age, which has been reported in other studies (Titze, 1994Titze, I. R. (1994). Voice classification and life-span changes. In Principles of Voice Production (pp. 169-190). Englewood Cliffs, New Jersey: Prentice Hall.
), was also observed in the data. The increase of speaking f₀ in older males may be associated with the muscle atrophy, or with an increase in stiffness of vocal folds tissue with aging (Higgins & Saxman, 1991Higgins, M. B., & Saxman, J. H. (1991). A Comparison of selected phonatory behaviors of healthy aged and young adults. J Speech Hear Res, 34(5), 1000-1010. http://dx.doi.org/10.1044/jshr.3405.1000
; Linville, 2001Linville, S. E. (2001). Vocal Aging. Australia, San Diego: Singular Thomson Learning.
; Sebastian et al., 2012Sebastian, S., Babu, S., Oommen, N. E., & Ballraj, A. (2012). Acoustic measurements of geriatric voice. Journal of Laryngology and Voice, 2(2), 81-84. http://dx.doi.org/10.4103/2230-9748.106984
). Thus, speaking f₀ tends to converge across genders as age increases.

Lastly, females exhibit higher values of HNR than males, as in most studies (Ambreen et al., 2019Ambreen, S., Bashir, N., Tarar, S. A., & Kausar, R. (2019). Acoustic analysis of normal voice patterns in Pakistani adults. Journal of Voice, 33(1), 124.e49-124.e58. http://dx.doi.org/10.1016/j.jvoice.2017.09.003
; Dehqan et al., 2013Dehqan, A., Scherer, R. C., Dashti, G., Ansari-Moghaddam, A., & Fanaie, S. (2013). The effects of aging on acoustic parameters of voice. Folia Phoniatrica et Logopaedica, 64(6), 265-270. http://dx.doi.org/10.1159/000343998
; Goy et al., 2013Goy, H., Fernandes, D. N., Pichora-Fuller, M. K., & van Lieshout, P. (2013). Normative voice data for younger and older adults. Journal of Voice, 27(5), 545-555. http://dx.doi.org/10.1016/j.jvoice.2013.03.002
; Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
). The significant decrease of HNR with age in females is also consistent with past studies (Dehqan et al., 2013Dehqan, A., Scherer, R. C., Dashti, G., Ansari-Moghaddam, A., & Fanaie, S. (2013). The effects of aging on acoustic parameters of voice. Folia Phoniatrica et Logopaedica, 64(6), 265-270. http://dx.doi.org/10.1159/000343998
; Ferrand, 2002Ferrand, C. T. (2002). Harmonics-to-Noise Ratio: An index of vocal aging. Journal of Voice, 16(4), 480-487.
; Xue & Deliyski, 2001Xue, S. A., & Deliyski, D. (2001). Effects of aging on selected acoustic voice parameters: preliminary normative data and educational implications. Educational Gerontology, 27(2), 159-168.
), and reflects more additive noise in the voiced signal (Dehqan et al., 2013Dehqan, A., Scherer, R. C., Dashti, G., Ansari-Moghaddam, A., & Fanaie, S. (2013). The effects of aging on acoustic parameters of voice. Folia Phoniatrica et Logopaedica, 64(6), 265-270. http://dx.doi.org/10.1159/000343998
; Ferrand, 2002Ferrand, C. T. (2002). Harmonics-to-Noise Ratio: An index of vocal aging. Journal of Voice, 16(4), 480-487.
; Schötz, 2006Schötz, S. (2006). Perception, analysis and synthesis of speaker age. In Travaux de l’Institut de Linguistique de Lund (Vol. 47). Lund University: Linguistics and Phonetics.
). According to Mueller (1997)Mueller, P. B. (1997). The aging voice. Seminars in Speech and Language, 18(2), 159-168. http://dx.doi.org/10.1055/s-2008-1064070
, a lower HNR would be predicted because hoarseness tends to be more prevalent in the voices of older speakers due to decrements in laryngeal function, such as ossification of cartilage, degeneration of muscle, connective tissue, and neural tissue, as well as respiratory changes (Dehqan et al., 2013Dehqan, A., Scherer, R. C., Dashti, G., Ansari-Moghaddam, A., & Fanaie, S. (2013). The effects of aging on acoustic parameters of voice. Folia Phoniatrica et Logopaedica, 64(6), 265-270. http://dx.doi.org/10.1159/000343998
; Mueller, 1997Mueller, P. B. (1997). The aging voice. Seminars in Speech and Language, 18(2), 159-168. http://dx.doi.org/10.1055/s-2008-1064070
; Titze, 1994Titze, I. R. (1994). Voice classification and life-span changes. In Principles of Voice Production (pp. 169-190). Englewood Cliffs, New Jersey: Prentice Hall.
).

For males, almost no age-related variation was found in HNR, as in Ambreen et al. (2019)Ambreen, S., Bashir, N., Tarar, S. A., & Kausar, R. (2019). Acoustic analysis of normal voice patterns in Pakistani adults. Journal of Voice, 33(1), 124.e49-124.e58. http://dx.doi.org/10.1016/j.jvoice.2017.09.003
, Dunashova (2021)Dunashova, A. (2021). Intraspeaker variability of a professional lecturer: Ageing, genre, pragmatics vs. voice acting (Case study). International Conference on Speech and Computer, 179-189. http://dx.doi.org/10.1007/978-3-030-87802-3_17
and Goy et al. (2013)Goy, H., Fernandes, D. N., Pichora-Fuller, M. K., & van Lieshout, P. (2013). Normative voice data for younger and older adults. Journal of Voice, 27(5), 545-555. http://dx.doi.org/10.1016/j.jvoice.2013.03.002
. However, these results should be considered with caution, since the type of speech sample used in the present research did not consist of sustained vowels. Still, a longitudinal study (Dunashova, 2021Dunashova, A. (2021). Intraspeaker variability of a professional lecturer: Ageing, genre, pragmatics vs. voice acting (Case study). International Conference on Speech and Computer, 179-189. http://dx.doi.org/10.1007/978-3-030-87802-3_17
), using reading samples of one male speaker at different ages (age 59 and age 74), found no changes in HNR with age.

Conclusions on the effects of aging on spontaneous speech should be drawn with caution due to differences in the recording environment, and because of the automatic extraction procedures. Although not all labeled syllables were manually verified, they were obtained using the same procedure for all speakers.

6. CONCLUSION

⌅

This article explores the age effects at the suprasegmental level in EP spontaneous speech. These results provide, essentially, a point of departure to establish the normal patterns of rhythm and intonation in spontaneous speech across age among adult Portuguese native speakers.

Fundamentally, as age progresses, male speakers tend to talk less, with a decrease in variability of speech interval duration and with higher pause time. Female participants present a significant increase in speech and articulation rate and also a HNR decrease, which means that women speak faster and with a lower voice quality.

Additional studies about HNR and articulation rate are required, and longitudinal studies might provide additional insights to the question of how getting older modifies the characteristics of spontaneous speech.

Age and gender effects in European Portuguese spontaneous speech

Efectos de la edad y del sexo en el habla espontánea en portugués europeo

1. INTRODUCTION

2. RELATED WORK

3. METHOD

3.1. Participants

3.2. Corpus and recording protocol

3.3. Suprasegmental annotation

3.4. Acoustic measurements

3.5. Statistical Analysis

4. RESULTS

4.1. Rhythmic parameters

4.2. Speaking f₀

4.3. Harmonics-to-Noise Ratio

5. DISCUSSION

6. CONCLUSION

7. ACKNOLEDGEMENTS

REFERENCES

1. INTRODUCTION

2. RELATED WORK

3. METHOD

3.1. Participants

3.2. Corpus and recording protocol

3.3. Suprasegmental annotation

3.4. Acoustic measurements

3.5. Statistical Analysis

4. RESULTS

4.1. Rhythmic parameters

4.2. Speaking f0

4.3. Harmonics-to-Noise Ratio

5. DISCUSSION

6. CONCLUSION

7. ACKNOLEDGEMENTS

REFERENCES

4.2. Speaking f₀