L2 rhythm acquisition and rhythmic transfer in Galician learners of English: A pilot study

1. INTRODUCTION

⌅

The concept of linguistic rhythm was first put forward in the 20th century by Pike (1945)Pike, K. (1945). The Intonation of American English. Ann Arbor, MI: University of Michigan Press.
and Abercrombie (1967)Abercrombie, D. (1967). Elements of General Phonetics. Edinburgh: Edinburgh University Press.
, who defined it in general terms as isochrony of speech intervals. In the following decades, a substantial amount of phonological research (e.g. Dauer, 1983Dauer, R. (1983). Stress-timing and syllable-timing reanalyzed. Journal of Phonetics, 11, 51-62.
; Bertinetto, 1989Bertinetto P.M. (1989). Reflections of the dichotomy “stress” vs. “syllable-timing”. Revue de Phonétique Appliquée, 91-93, 99-130.
) explored the rhythmic characterization of languages and suggested that languages perceived as “stress-timed” show complex consonant clusters and vocalic reduction, while languages perceived as “syllable-timed” do not show vowel reduction and have much simpler syllabic structures. In recent years, some researchers (e.g. Ramus, Nespor and Mehler, 1999Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
; Grabe and Low, 2002Grabe, E. & Low E.L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
; Dellwo, 2006Dellwo, V. (2006). Rhythm and Speech Rate: A Variation Coefficient for deltaC. In P. Karnowski & I. Szigeti (Eds.), Language and Language-processing (pp. 231-241). Bern: Peter Lang. https://doi.org/10.5167/uzh-111789
) have discussed and put to practice various quantifiable measures to prove the phonetic reality of such classification and thus confirm the existence of objective, acoustically measurable differences between stress-timed and syllable-timed languages. Although the existing rhythm metrics have recently received some criticism (see Ross, Ferjan and Arvaniti, 2008Ross, T., Ferjan, N., & Arvaniti, A. (2008). On the reliability of rhythm metrics. The Journal of the Acoustical Society of America, 124, 2495. https://doi.org/10.1121/1.4782810
; Arvaniti, 2009Arvaniti, A. (2009). Rhythm, timing and the timing of rhythm. Phonetica, 66(1-2), 46-63. https://doi.org/10.1159/000208930
; Kohler, 2009Kohler, K.J. (2009). Rhythm in speech and language. Phonetica, 66(1-2), 29-45. https://doi.org/10.1159/000208929
; Arvaniti, 2012Arvaniti, A. (2012). The usefulness of metrics in the quantification of speech rhythm. Journal of Phonetics, 40, 351-373. https://doi.org/10.1016/j.wocn.2012.02.003
; Rathcke and Smith, 2015Rathcke, T.V. & Smith, R.H. (2015). Speech timing and linguistic rhythm: On the acoustic bases of rhythm typologies. The Journal of the Acoustical Society of America, 137(5), 2834-2845. https://doi.org/10.1121/1.4919322
), they continue to be thoroughly applied in linguistic research, particularly when aimed at finding cross-linguistic rhythmic differences and looking into the differences in rhythmic patterns between L1 and L2 speech (Gabriel and Kireva, 2014Gabriel, C., & Kireva, E. (2014). Speech rhythm and vowel raising in Bulgarian Judeo-Spanish. In N. Campbell, D. Gibbon, & D. Hirst (Eds.), Proceedings of Speech Prosody 2014 (pp. 728-732). https://doi.org/10.21437/speechprosody.2014-133
). This paper presents an investigation into rhythm acquisition in a foreign language using what we could call the ‘classic’ metrics, developed by Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
, and Grabe and Low (2002)Grabe, E. & Low E.L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
, which will be explained in section 3.

It must be noted that as much as the research on cross-linguistic rhythmic differences has evolved, the attainment, production and evolution of L2 speech rhythm has been paid comparatively little attention in the literature on L2 acquisition and linguistic transfer, although some recent studies have focused on this issue (see, for instance, White and Mattys, 2007aWhite, L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
and 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
; Kinoshita and Sheppard, 2011Kinoshita, K. & Sheppard, C. (2011). Validating acoustic measures of speech rhythm for second language acquisition. In W. S. Lee & E. Zee (Eds.), Proceedings of the 17th International Congress of Phonetic Sciences (pp. 1086-1089). Hong Kong: City University of Hong Kong.
; Gut, 2012Gut, U. (2012). Rhythm in L2 speech. Speech and language technology, 14/15, 83-94.
; Ordin and Polyanskaya, 2015Ordin, M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a second language by learners with rhythmically different native languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
). In spite of the limited consideration that this topic has been given, the existing literature on L2 prosodic acquisition generally agrees that L2 rhythm is indeed a real challenge for language learners (Kinoshita and Sheppard, 2011Kinoshita, K. & Sheppard, C. (2011). Validating acoustic measures of speech rhythm for second language acquisition. In W. S. Lee & E. Zee (Eds.), Proceedings of the 17th International Congress of Phonetic Sciences (pp. 1086-1089). Hong Kong: City University of Hong Kong.
). The fundamental reason why L2 rhythm acquisition is problematic is the existence of prosodic transfer from L1 to L2, an idea which is implicitly or explicitly supported by the view that the elements making up the phonetic subsystems of L1 and L2 exist in a common phonological space, and so will necessarily influence one another (Flege, Schirru and MacKay, 2003Flege, J.E., Schirru, C., & MacKay, I.R.A. (2003). Interaction between the native and second language phonetic subsystems. Speech Communication, 40, 467-491. https://doi.org/10.1016/S0167-6393(02)00128-0
). As a natural consequence, L1 rhythmic transfer is taken to be a crucial factor in the process of L2 prosody learning (Rasier and Hiligsmann, 2007Rasier, L. & Hiligsmann, Ph. (2007). Prosodic transfer. Theoretical and issues. Nouveaux cahiers de linguistique Franτaise, 28, 41-66.
).

Generally speaking, there seems to be a wide consensus with regard to the view that every adult learner’s point of departure when learning an L2 is their L1. In the process of L2 acquisition, especially when this commences after the childhood period, L2 phonological features interact with L1 phonology and are assumed to cause adult learners difficulty in acquiring native-like L2 phonology, as their L1 phonological system makes it difficult for them to perceive and, therefore, produce, a number of phonic features in the target language (see Broselow and Kang, 2013Broselow, E. & Kang, Y. (2013). Second language phonology and speech. In Herschensohn, J. & M. Young-Scholten (Eds.), The Cambridge Handbook of Second Language Acquisition (pp. 529-554). Cambridge: Cambridge University Press. https://doi.org/10.1017/CBO9781139051729
). In fact, it has been observed that, although adult L2 learners often attain native-like proficiency in syntactic, morphological, and lexical systems of L2, they rarely or never master the complete L2 sound system (Scovel, 1969Scovel, T. (1969). Foreign accents, language acquisition and cerebral dominance. Language Learning, 25, 209-235. https://doi.org/10.1111/j.1467-1770.1969.tb00466.x
and 1988Scovel, T. (1988). A Time to Speak: A Psycholinguistic Inquiry into the Critical Period for Human Speech. Cork, Ireland: Newbury House.
). Moreover, the mastery of suprasegmentals seems to emerge at very advanced stages in L2 pronunciation (James, 1988James, A. (1988). The Acquisition o f a Second Language Phonology. Tübingen: Narr.
; Brown, 2000Brown, H. (2000). Principles of Language Learning and Teaching. London: Pearson Education.
).

In order to explore this issue further, Li and Post (2014)Li, A. & Post, B. (2014). L2 acquisition of prosodic properties of speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
analyze the rhythm produced by Chinese (syllable-timed) and German (stress-timed) learners of English (also stress-timed) with intermediate or advanced proficiency level and conclude that while learners from both L1 backgrounds produce rhythm metric values that increasingly approach the L2 target-i.e. the higher their level of English, the more similar those measures are to the target-, their rhythmic development also shows signs of L1 transfer.

In a similar vein, Ordin and Polyanskaya (2015)Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
compare French (syllable-timed) and German (stress-timed) L2 learners of English at beginner/intermediate, and advanced/proficiency levels and corroborate Li and Post’s (2014)Li, A. & Post, B. (2014). L2 acquisition of prosodic properties of speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
conclusions in that rhythm metric values of both learner groups show that durational variability increases as L2 acquisition progresses. According to the authors, that could be an indicator of universal L2 acquisition development. Interestingly, though, they also show that while the most proficient German learners of English achieve target values, the French learners of English do not. This could indicate that L1 speakers of a syllable-timed language like French (and Chinese) find it more difficult to acquire the speech rhythm of a stress-timed language like English than L1 speakers of a different stress-timed language (like German), a statement that requires further exploration. In this regard, it is worth noting that Polyanskaya and Ordin (2015) have investigated the attainment of rhythmic patterns by monolingual English children and adults to show that the speech rhythm of children universally develops from more syllable-timed to more stress-timed as language acquisition progresses. In other words, the existence of a stress-timed rhythm implies the existence of a syllable-timed rhythm in an earlier developmental stage, but not vice versa, which proves that stress-timed rhythm is typologically more marked than syllable-timed rhythm (see Van Maastricht et al., 2019Van Maastricht, L., Krahmer, E., Swerts, M., & Prieto, P. (2019). Learning direction matters: A study on L2 rhythm acquisition by Dutch learners of Spanish and Spanish learners of Dutch. Studies in Second Language Acquisition, 41(1), 87-121. https://doi.org/10.1017/S0272263118000062
).

Although the general L2 adult learners’ goal as non-native speakers of the language is the quality of pronunciation that will not detract from their ability to communicate (Celce-Murcia, 1991Celce-Murcia, M. (Ed.) (1991). Teaching English as a Second or Foreign Language (2nd ed.). Boston, MA: Heinle & Heinle Publishers.
), native-like pronunciation is taken to be the somewhat abstract ideal that L2 learners aim at, as it is automatically judged to be a synonym for “understandable pronunciation” or, in more general terms, “good pronunciation” (see Leather and James, 1991Leather, J. & James, A. (1991). The acquisition of second language speech. Studies in Second Language Acquisition, 13, 305-341. https://doi.org/10.1017/S0272263100010019
; Ryu, 2002Ryu, Y. (2002). Pronunciation of English as a second or foreign language learners [sic]: the reexamination of teaching pronunciation [Unpublished MA dissertation]. Missoula, USA: The University of Montana.
). L2 speech usually exhibits a certain degree of foreign accent resulting from the production of specific segmental and prosodic characteristics that differ from those produced by L1 speakers and is thus easily perceived by L1 speakers (see Polyanskaya, Ordin and Busa, 2016Polyanskaya, L., Ordin, M., & Busa, M.G. (2016). Relative salience of speech rhythm and speech rate on perceived foreign accent in a second language. Language and Speech, 60(3), 333-355. https://doi.org/10.1177/0023830916648720
).

The judgements regarding the level of native-like quality of a learner’s pronunciation have been complemented by the use of rhythm metrics to actually detect to what extent L2 rhythm resembles that of native speakers. The crucial study in this direction was carried out by White and Mattys (2007a)White, L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
, who first investigated how some metrics-chiefly, %V and VarcoV-reflect the evolution in the process of L2 acquisition in speakers of English and Spanish. In a later study (White and Mattys, 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
), the same authors compare those results to L1 English speakers’ judgements and conclude that L1 English speakers rate Spanish speakers of English as more non-native-like when their VarcoV values are lower, i.e., when they are farther away from the typical native values for that metric (both American and British English yield higher VarcoV values than American and Iberian Spanish (Espinosa, 2019Espinosa, G.E. (2019). English speech rhythm in instructed learners. Its development as shown by VarcoV. In G. E. Espinosa, M. Fernández Beschtedt, P. A. Formiga, & M. A. Verdú (Eds.), Conocimiento y diversidad en el estudio y la enseñanza de lenguas, Neuquén: Universidad Nacional del Comahue (pp. 204-212). Retrieved from http://bibliotecadelenguas.uncoma.edu.ar/items/show/426
)). In order to master native English pronunciation, L2 learners must improve their production of vowel reduction in unstressed syllables as much as possible, as this is “a meaningful sign of the ongoing acquisition of English speech rhythm” (Wenk, 1986, p. 125Wenk, B. (1986). Crosslinguistic influence in sound language phonology: Speech rhythms. In E. Kellerman & M. Sharwood Smith (Eds.). Crosslinguistic Influence in Second Language Acquisition (pp. 120-133). Oxford: Pergamon Press Ltd.
). This is confirmed by rhythm metrics, as acquiring a more stress-timed rhythm necessarily implies a greater degree of variation in vowel duration.

As stated above, studies point to the idea that speech rhythm development proceeds from syllable- to stress-timed characteristics for L2 speakers and native-speaking children alike (see Ordin and Polyanskaya, 2014Ordin, M. & Polyanskaya, L. (2014). Development of timing patterns in first and second languages. System, 42, 244-257. https://doi.org/10.1016/j.system.2013.12.004
). Arguably, it is thanks to input analysis and practice that learners finally produce the prosodic properties of the target language. Indeed, although it has been acknowledged that “[p]ronunciation of adult L2 learners is particularly resistant to change, even if those learners have received targeted pronunciation instruction” (Kennedy and Trofimovich, 2010, p. 171Kennedy, S. & Trofimovich, P. (2010). Language awareness and second language pronunciation: A classroom study. Language Awareness, 19, 171-185. https://doi.org/10.1080/09658416.2010.486439
), studies like Tsiartsioni’s (2011)Tsiartsioni, E. (2011). Can pronunciation be taught? Teaching English speech rhythm to Greek students. In E. Kitis, N. Lavidas, N. Tpointzi, & T. Tsangalidis (Eds.), Proceedings of the 19th International Symposium on Theoretical and Applied Linguistics (ISTAL) (pp. 447-458). Tesalonica: Aristotle University of Thessaloniki.
analyze vocalic and consonant variability measures and observe an adaptation of a more English-like rhythm by those having received rhythm instruction. The question remains whether before getting to that final stage learners develop a phonological system which shows characteristics of their L1 and their L2, thus giving rise to an intermediate stage where the rhythm of the L2 is neither completely the same as that of the speakers’ L1 nor exactly the same as the rhythm of the target language (see Eckman, 2012Eckman, F.R. (2012). Second language phonology. In S. M. Gass & A. Mackey (Eds.), The Routledge Handbook of Second Language Acquisition (pp. 91-105). London: Routledge.
; Espinosa, 2018Espinosa, G.E. (2018). La adquisición del ritmo inglés por hablantes nativos de español. El caso de aprendientes argentinos en contexto de instrucción formal [Unpublished PhD thesis]. Buenos Aires: Universidad de Buenos Aires, Argentina.
). The investigation of systematic similarities and differences between L1 and L2 speech rhythm has not yet shed any definitive findings, which entails that maybe the L2 speakers’ level of competence is indeed one of the most important variables to be borne in mind. More research needs to be carried out to show evidence for rhythmic differences between L2 learners of different proficiency levels or with different native language backgrounds (see Li and Post, 2014Li, A. & Post, B. (2014). L2 acquisition of prosodic properties of speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
). This leads to the question whether the rhythm metrics are actually able to differentiate between beginners’ and more advanced language learners’ speech rhythm, and whether they can trace rhythmic developments in language acquisition (Gut, 2012Gut, U. (2012). Rhythm in L2 speech. Speech and language technology, 14/15, 83-94.
).

The present article analyze the process of rhythm acquisition in two groups of Galician learners of English with different levels of proficiency in order to determine i) the existence of L1 rhythmic transfer in the acquisition of L2 rhythm, and ii) the influence of L2 proficiency level on the degree of rhythmic mastery on the adult learners’ part. The structure of the article is as follows: Section 2 delves into the objectives and hypotheses that conduct this study. Section 3 explains the methodology of the research study conducted. Section 4 explores the results obtained, and section 5 presents the discussion of those results. Finally, section 6 deals with the final conclusions of the study.

2. OBJECTIVES AND HYPOTHESES

⌅

As mentioned in the Introduction, this study analyzes the production of speech rhythm in the English language classroom by two groups of bilingual Galician/European Spanish learners whose dominant language is Galician and certified, at the moment of recording, a B1 and C1 level of English according to the Common European Framework of Reference for Languages [CEFR] (Council of Europe, 2001Council of Europe. (2001). Common European framework of reference for languages: Learning, teaching, assessment. Cambridge: Press Syndicate of the University of Cambridge.
). Crucially for our study, while British English is generally described as a stress-timed language (see, for instance, Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
, and Grabe ad Low (2002)Grabe, E. & Low E.L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
), Galician has been shown to be a syllable-timed language in the few studies that have analyzed its rhythmic make-up (Roseano, 2021Roseano, P. (2021). Splitting the rhythmic continuum in rhythmic classes: An HCA approach [Poster presentation]. 4th Phonetics and Phonology in Europe (PaPE), June 21-23. Barcelona, Spain.
).

1. To compare rhythmically the production of English L1 and L2 so as to verify whether the production of English L2 is affected by rhythmic transfer from the learners’ L1.
2. To explore the development of non-native rhythm at different stages in the process of acquisition so as to find out whether the degree of rhythmic transfer decreases as the proficiency level in the foreign language increases (Gutiérrez Díez et al., 2008Gutiérrez Díez, F., Dellwo, V., Gavaldà, N., & Rosen, S. (2008). The development of measurable speech rhythm during second language acquisition. The Journal of the Acoustical Society of America, 123. https://doi.org/10.1121/1.2935815
  ; Gass and Selinker, 2008Gass, S.M. & Selinker, L. (2008). Second Language Acquisition: An Introductory Course (3rd edition). London: Routledge.
  ; Major, 2008Major, R.C. (2008). Transfer in Second Language Phonology - A review. In J. G. Hansen Edwards & M. L. Zampini (Eds.), Phonology and second language acquisition, Studies in Bilingualism (pp. 63-94). Amsterdam: John Benjamins. https://doi.org/10.1075/sibil.36.05maj
  ).

The hypotheses that conduct the research work and give rise to the above objectives are two, namely:

1. Galician learners of English will show intermediate rhythm scores between their L1 and their L2.
2. The degree of rhythmic transfer will be shown to be negatively correlated with the level of proficiency in the foreign language; in other words, increased L2 proficiency will lead to the production of more native-like rhythmic patterns.

3. METHODOLOGY

⌅

In order to conduct this investigation, two groups of native speakers of Galician currently studying a degree in Translation and Interpreting at the University of Vigo were handed a questionnaire to determine their language profile, and subsequently recorded in the soundproof booth at the Language and Cognition Lab of the University of Vigo with a Sennheiser MK4 Digital microphone connected to a Toshiba Satellite Click 2 Pro P30W-B.

The first group (henceforth, group A) comprised 6 subjects who, at the moment of being recorded, were doing their first year of the Translation and Interpreting degree at the University of Vigo, had received English lessons in primary and secondary school for a total twelve years, certified a B1 level of English (Table 1)-a necessary condition to enter the above-mentioned degree-and had not received any phonetic instruction prior to their first term at university. The second group (henceforth, group C) comprised 6 subjects who were doing their fourth year at university and certified a C1 level of English (Table 1)-i.e. they had passed their first year at university. In order to ascertain that the level of English reported by the subjects in groups A and C was accurate, they were evaluated by the first author of this article prior to the completion of the task that this investigation comprised. Group C individuals received explicit phonetic instruction for a period of twelve weeks during which they sat through one two-hour theoretical session per week where they were taught English segmental and prosodic phonology, and two two-hour practical sessions per week where they were instructed to improve their listening and speaking skills putting into practice the theoretical knowledge acquired in the theoretical sessions. In addition to the twelve-week period of formal instruction, group C subjects had lived in the UK for a period of four months.

Table 1. Group A (6 subjects studying Translation and Interpreting at the University of Vigo with a B1 level of English; none of them had spent any time in the UK) and Group C (6 subjects studying Translation and Interpreting at the University of Vigo with a C1 level of English; they had spent one term studying in the UK).

Group	Group A		Group C
Subject	Gender	Age	Gender	Age
1	Female	19	Male	21
2	Female	18	Male	21
3	Female	18	Female	21
4	Female	18	Female	21
5	Female	19	Female	21
6	Male	18	Female	21
		M = 18.3
		SD = 0.52

All twelve subjects had pursued primary and secondary education in Galicia and had taken the exam to access Galician universities, which includes Galician and Spanish tests, so they reported a very good command of both languages. In addition, they all reported to be Galician-dominant, as they had been raised with Galician as their primary language and continued to use it far more regularly than Spanish. Also, they were learning French as their second foreign language and did not speak any further languages. As far as gender is concerned, the majority of the subjects were female, since the majority of students of the above-mentioned faculty are female. Their age ranged from 18-19 (first year students belonging to group A) to 21 (third-year students belonging to group C).

Besides, a control group of 6 Southern British English (SBE) speakers (Table 2), and another control group of 6 Galician (GAL) speakers (Table 2) were recorded in order to be able to compare the rhythm of the experimental subjects’ L1 and L2, and facilitate the observation of developmental patterns in the production of the subjects’ L2 rhythm. In this regard, the degree of difference or similarity with the target language (in this case, SBE) allows us to observe the evolution in L2 rhythm production on the part of the two experimental groups. It must also be noted that the English control group comprised 6 speakers of Southern British English because this variety is representative of the largest amount of input that learners receive in an instructional context (classroom videos and audios) and is the variety that was spoken in the geographical areas in the UK where C1 subjects spent their study-abroad period.

Table 2. Control group 1 (6 Southern British English recorded in the Phonetics Lab at the University of Barcelona) and Control group 2 (6 Galician speakers recorded in the Language and Cognition Lab at the University of Vigo).

Group	Control group 1		Control group 2
Subject	Gender	Age	Gender	Age
1	Male	43	Female	19
2	Male	35	Female	18
3	Male	21	Female	18
4	Female	21	Female	18
5	Female	22	Female	19
6	Female	31	Male	18
		M = 28.8		M = 18.3
		SD = 9.1		SD = 0.52

The recordings of the subjects in the SBE control group were carried out in a soundproof booth at the Phonetics Lab of the University of Barcelona using a Zoom H4n Pro digital recorder and a Shure SM58microphone, while those of the subjects in the GAL control group were done in the Language and Cognition Lab of the University of Vigo with the same equipment used for groups A and C.

The data were obtained by means of a reading task where subjects had to read the text The North Wind and the Sun (Handbook of the IPA, 1999, p. 39IPA (1999). Handbook of the International Phonetic Association. A Guide to the Use of the International Phonetic Alphabet. Cambridge: Cambridge University Press.
) in Galician and/or in English (Figure 1). Speakers were asked to read aloud at a normal speech rate and the resulting readings of the text took approximately 30 seconds for each informant, which implied that reading pace was spontaneously controlled for. They were asked to make pauses only where commas or full stops appeared in the text (this requirement is due to the fact that we wanted to have the same number of pauses in all recordings of each language, since pre-pausal lengthening might have an effect on the duration of intervals). If a speaker hesitated, mispronounced, made extra pauses, etc., we asked them to read the sentence again.

Figure 1. The North Wind and the Sun in Galician and in English.

medium/medium-LOQUENS-9-1-2-e087-gf1.png

The acoustic analysis was carried out with Praat (Boersma and Weenink, 2019Boersma, P. & Weenink, D. (2019). Praat: Doing phonetics by computer [Computer program]. Version 6.1.07, retrieved from http://www.praat.org/
). For each recording, the vocalic and consonantal intervals were annotated in a textgrid following the same criteria as in Kireva and Gabriel (2015Kireva, E. & Gabriel, C. (2015). Rhythmic properties of a contact variety: Comparing read and semi-spontaneous speech in Argentinean Porteño Spanish. In M. Avanzi, E. Delais-Roussarie & S. Herment (Eds.), Prosody and Languages in Contact (pp. 149-168). Berlin: Springer.
), namely:

boundaries between V and C intervals were determined on the basis of formant structure and pitch;
we treated glides as belonging to the V intervals;
for unvoiced plosives and affricates following a pause, the beginning was placed at 0.05s prior to the burst, given that their boundaries can hardly be determined on the basis of other criteria;
pauses and material affected by any kind of speech disfluency were excluded from the analysis.

The textgrids were then processed by means of Correlatore (Mairano and Romano, 2010Mairano, P. & Romano, A. (2010). Un confronto tra diverse metriche ritmiche usando Correlatore. In S. Schmid, M. Schwarzenbach, & D. Studer (Eds.), La dimensione temporale del parlato (pp. 79-100). Bern: EDK.
) in order to calculate and plot vocalic and consonantal rhythm metrics. The metrics used in this study are, as mentioned in the Introduction, the ‘classic’ metrics created by Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
(Figure 2), and by Grabe and Low (2002)Grabe, E. & Low E.L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
(Figure 3), thus leaving to future experiments other prosodic factors-i.e. stress/intensity, pitch, speech rate, etc.-which may arguably have an effect on L1 and L2 rhythm as well as rhythm transfer. The metrics proposed by the previously mentioned authors are complementary, as Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
propose “global” rhythm metrics (insofar as the duration of each interval is compared with the duration of all other intervals in the recording), while Grabe and Low (2002)Grabe, E. & Low E.L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
use “local” rhythm metrics (in the sense that the duration of each interval is compared with the duration of the following interval).

Figure 2. Formulae for the global metrics in .

medium/medium-LOQUENS-9-1-2-e087-gf2.png

Figure 3. Formulae for the global metrics in .

medium/medium-LOQUENS-9-1-2-e087-gf3.png

For the interpretation of the results presented in section 4, it is useful to remember that ΔV and VnPVI measure the variability of vocalic intervals, while ΔC and CrPVI measure the variability of consonant intervals. Finally, %V is related to both consonant and vocalic intervals. Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
suggest that the rhythm metrics reflect the phonological rhythmic properties of the languages. As stated above, given that stress-timed languages have a greater variety of syllable structures and favor consonant clusters, they present a comparatively higher ∆C value and, consequently, a lower %V value. In contrast to this, syllable-timed languages tend towards simpler syllable structures and disfavor consonant clusters, which translates into a lower ∆C value, and a higher %V value. With reference to ΔV, stress-timed languages allow for a comparatively greater number of phonological processes which affect vowels-vowel reduction, long-vs-short vowel contrast, vowel lengthening in certain phonological contexts-and thus have a higher ΔV value than syllable-timed languages, in which these phonological processes are not productive.

The metrics in Figures 2 and 3 capture the durational variability of speech intervals that corresponds to the auditory impression of stress- and syllable-timing. Romance languages like Spanish or Galician have been shown to have an overall lower degree of durational variability (i.e. low ΔV, ΔC, CrPVI and VnPVI values) and higher proportion of vocalic material (i.e. high %V) compared to Germanic languages like English, where high ΔV, ΔC, CrPVI and VnPVI values and low %V have been reported (e.g. Bunta and Ingram, 2007Bunta, F. & Ingram, D. (2007). The acquisition of speech rhythm by bilingual Spanish- and English-speaking 4- and 5-year-old children. Journal of Speech, Language, and Hearing Research, 50(4), 999-1014. https://doi.org/10.1044/1092-4388(2007/070)
; Payne et al., 2012Payne, E., Post, B., Astruc, Ll., Prieto, P. & Vanrell, M.M. (2012). Measuring child rhythm. Language and Speech, 55(2), 203-229. https://doi.org/10.1177/0023830911417687
; Prieto et al., 2012Prieto, P., Vanrell, M.M., Astruc, Ll., Payne, E., & Post B. (2012). Phonotactic and phrasal properties of speech rhythm. Evidence from Catalan, English, and Spanish. Speech Communication, 54(6), 681-702.
; Ramus, Nespor and Mehler, 1999Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
; White and Mattys, 2007aWhite, L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
and 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
), which corresponds to a lower degree of stress-timing. Differences in rhythmic metrics between L1 and L2 speech have also been confirmed in several studies (Bond and Fokes, 1985Bond, Z.S. & Fokes, J. (1985). Non-native pattern of English syllable-timing. Journal of Phonetics, 13, 407-420.
; Grenon and White, 2008Grenon, I. & White, L. (2008). Acquiring rhythm: A comparison of L1 and L2 speakers of Canadian English and Japanese. Proceedings of the 32nd Boston University conference on language development (pp. 155-166). Boston: Boston University.
; Ordin and Polyanskaya, 2015Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
; White and Mattys, 2007aWhite, L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
and 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
). L2 English exhibits a lesser degree of stress-timing compared to L1 English, irrespective of the L1 of the learner (Li and Post, 2014Li, A. & Post, B. (2014). L2 acquisition of prosodic properties of speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
; Ordin and Polyanskaya, 2015Ordin, M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a second language by learners with rhythmically different native languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
).

4. RESULTS

⌅

In this section, we shall present the results of our study by discussing the plots of the rhythm metrics obtained by means of Correlatore. We shall first present the results of the mean values for “global” rhythm metrics, and later the results of “local” rhythm metrics. At the end of the section, we will present the results of the speakers individually.

Figure 4 shows the mean values obtained for the ΔC, ΔV metrics for the four groups analyzed. i.e. SBE, GAL, A and C, while Figure 5 indicates the mean values rendered for the ΔC, %V metrics for the same four groups.

Figure 4. ΔC (Cdev), ΔV (Vperc).

medium/medium-LOQUENS-9-1-2-e087-gf4.png

Figure 5. ΔC (Cdev), %V (Vdev).

medium/medium-LOQUENS-9-1-2-e087-gf5.png

Figures 4 and 5 show, firstly, that GAL and SBE appear at the opposite corners of each plot. This is consistent with the fact that GAL is classified as a syllable-timed language (like Central Peninsular Spanish), while SBE is stress-timed. Secondly, English L2 groups (A in red, C in yellow) appear in an intermediate position between the speakers’ L1 and those same speakers’ foreign language (only the value of %V for C-speakers is slightly out of the diagonal, though not dramatically far). Furthermore, if we consider the level of EFL speakers, we observe that group A in red (subjects with a B1 level of English) is closer to L1, while group C, in yellow (students with a C1 level of English) is closer to L2.

Given the results obtained, three preliminary conclusions may be inferred. To start with, there is rhythmic transfer from the speakers’ L1 (Galician, syllable-timed) to their L2 (English, stress-timed). Not unimportantly, either, the level of proficiency has an effect on the degree of rhythmic transfer, as both consonantal and vocalic intervals are affected by transfer. Thus, the areas where groups A and C stand could be considered ‘interlanguage’ areas where the produced rhythm is neither the rhythm of the subjects’ L1 nor the rhythm of the target language.

The “local” metrics used by Grabe and Low (2002)Grabe, E. & Low E.L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
provide results that are completely in line with those of the “global” metrics used by Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
. These results are desirable, insofar as all metrics give a cohesive picture of the phenomenon under study, and also foreseeable, since-like Ramus (2002)Ramus, F. (2002). Acoustic correlates of linguistic rhythm: Perspectives. In B. Bel & I. Marlien (Eds.), Proceedings of Speech Prosody 2002 (pp. 115-130). Aix-en-Provence: Université de Provence.
pointed out-the different rhythm metrics should give comparable results when the quantity of data analyzed is large enough. The mean values obtained for the CrPVI and VnPVI metrics in this study are offered in Figure 6.

Figure 6. CrPVI & VnPVI.

medium/medium-LOQUENS-9-1-2-e087-gf6.png

Figure 6 shows that, like for ΔC, ΔV (Figure 4), GAL and SBE appear at the opposite corners of the plot. Once again, this is consistent with the fact that GAL is indeed syllable-timed (like Central Peninsular Spanish), while SBE is stress-timed. Again, English L2 groups appear in an intermediate position between the speakers’ L1 and the speakers’ foreign language. Similarly, if the level of English L2 speakers is considered, we observe, once again, that A is closer to L1, while C is closer to L2.

The above observations provide us with information that confirms the preliminary conclusions previously stated, namely that there is rhythmic transfer from the speakers’ L1 to their L2, that both consonantal and vocalic intervals are affected by transfer, and that the level of proficiency has an effect on the degree of rhythmic transfer-the higher the proficiency level of the learners, the lesser the degree of rhythmic transfer observed.

If we analyze speakers one by one (Figures 7 and 8) a few more detailed considerations can be made. To begin with, on the whole, the distribution along the diagonal does not change in comparison to the data in Figures 4, 5 and 6. Nevertheless, in both Figures there is an area where the clouds of SBE speakers and C speakers overlap. This might mean that some C speakers display near native-like rhythm, which is consistent with the fact that they had a C1 level of English.

Figure 7. Deltas.

medium/medium-LOQUENS-9-1-2-e087-gf7.png

Figure 8. PVIs.

medium/medium-LOQUENS-9-1-2-e087-gf8.png

5. DISCUSSION

⌅

The objective of this study is twofold, as it aims at comparing the production of English L1 and L2 rhythmically so as to verify whether the production of English L2 is affected by rhythmic transfer from the learners’ L1, i.e. syllable-timed Galician, and determining whether the degree of rhythmic transfer decreases as the level of proficiency increases.

The initial hypotheses were that a) the transfer of the rhythm from Galician to English spoken by Galician speakers would be visible-English L2 would lie in an intermediate rhythmic position between GAL and SBE-and b) that the degree of rhythmic transfer would prove variable and directly linked to the learners’ proficiency level (group-C speakers would be closer to L2, while group-A speakers would be closer to GAL).

The statistical analysis carried out by means of Correlatore shows that both hypotheses are confirmed, insofar as the variety of English spoken by Galician speakers is predominantly located in an intermediate rhythmic position between GAL and SBE. As the degree of proficiency increases thanks to the explicit instruction-i.e. conducted in the classroom-and the non-explicit instruction-i.e. connected to the learners’ experience abroad-received by learners, the rhythm of L2 English resembles more and more that of SBE.

The results obtained may be summarized as follows:

There is rhythmic transfer from a syllable-timed L1 (GAL) to a stress-timed foreign language (ENG);
both C and V intervals are affected by rhythmic transfer;
the level of proficiency in the foreign language has an effect on the level of rhythmic transfer, i.e. the degree of rhythmic transfer from Galician L1 to English L2 decreases as the level of the foreign language increases;
there are individual differences (some C speakers have reached native-like rhythm).

The above results confirm previous research (Li and Post, 2014Li, A. & Post, B. (2014). L2 acquisition of prosodic properties of speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
; Ordin and Polyanskaya, 2015Ordin, M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a second language by learners with rhythmically different native languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
) conducted with young adult speakers of different native tongues in that the higher the level of English on the part of the learners, the more similar the rhythm measures are to the target, no matter the learners’ L1. These results also prove that the signs of L1 transfer are present, albeit with a higher or lower predominance, at all stages of the learning process. Additionally, in line with Ordin and Polyanskaya’s (2015)Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
observation, the present study shows that successfully acquiring stress-timed rhythm and, hence, a greater degree of variation in vowel duration, tends to prove considerably difficult for speakers of a syllable timed language-in this case, Galician. Besides, in line with Espinosa’s (2018)Espinosa, G.E. (2018). La adquisición del ritmo inglés por hablantes nativos de español. El caso de aprendientes argentinos en contexto de instrucción formal [Unpublished PhD thesis]. Buenos Aires: Universidad de Buenos Aires, Argentina.
observations, the present study corroborates Tsiartsioni’s (2011)Tsiartsioni, E. (2011). Can pronunciation be taught? Teaching English speech rhythm to Greek students. In E. Kitis, N. Lavidas, N. Tpointzi, & T. Tsangalidis (Eds.), Proceedings of the 19th International Symposium on Theoretical and Applied Linguistics (ISTAL) (pp. 447-458). Tesalonica: Aristotle University of Thessaloniki.
observation that learners who receive rhythm instruction adapt to a more English-like rhythm when speaking the language. Moreover, the data obtained and analyzed offers an answer to Tsiartsioni’s (2011)Tsiartsioni, E. (2011). Can pronunciation be taught? Teaching English speech rhythm to Greek students. In E. Kitis, N. Lavidas, N. Tpointzi, & T. Tsangalidis (Eds.), Proceedings of the 19th International Symposium on Theoretical and Applied Linguistics (ISTAL) (pp. 447-458). Tesalonica: Aristotle University of Thessaloniki.
question whether learners develop a phonological system that may give rise to an intermediate stage where the rhythm of the L2 is neither totally the same as that of the speakers’ L1 nor completely equal to the rhythm of the target language. In this regard, it does seem to be the case that learner English sits at a point in between the speakers’ L1 (Galician) and their L2 (British English) and moves towards their L2 as the level of proficiency increases.

6. CONCLUSIONS

⌅

The present study has looked into the production of English L1 and L2 rhythm by two groups of Galician-dominant learners with a B1 and a C1 level of English. The study pursued two main objectives, namely to determine i) whether a certain degree of rhythmic transfer from the learners’ L1 would show in the results of the rhythmic measurements and, ii) if so, whether the degree of rhythmic transfer observed would be lower in the case of students with a higher proficiency level of English who had also received explicit phonetic instruction for twelve weeks and have spent a university term in England. The initial hypotheses stated that a certain amount of transfer would be revealed in the results of the measurements, and that the degree of transfer would be negatively correlated with the level of proficiency in English.

The twelve experimental subjects were made to read a text in Galician and in English, and the results obtained were compared to the readings made by a control group of six native Galician speakers and another control group of six native British English speakers. The measurements conducted replicated Ramus, Nespor and Mehler’s (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
“global” rhythm metrics ΔV, ΔC, and %V, comparing the duration of each vocalic and consonantal interval with the duration of all other intervals in the recording, and Grabe and Low’s (2002)Grabe, E. & Low E.L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
“local” rhythm metrics VnPVI and CrPVI, where the duration of each interval was compared with the duration of the following interval.

The results prove that the rhythm of the foreign language is affected by the rhythmic properties of L1, particularly when the two have different rhythmic structures-as is the case with Galician and British English- and especially at low and intermediate levels of language proficiency. Moreover, as shown by the results of the global and local metrics put to the test, the evinced rhythmic transfer phenomenon from syllable-timed Galician to stress-timed English has an effect on both the consonant and the vowel intervals. This finding ratifies previous observations that L2 English exhibits a lesser degree of stress-timing compared to L1 English (Li and Post, 2014Li, A. & Post, B. (2014). L2 acquisition of prosodic properties of speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
; Ordin and Polyanskaya, 2015Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
). Crucially, the results prove that the degree of rhythmic transfer from Galician to English decreases as the level of the foreign language increases, which corroborates the conclusions in previous studies (Espinosa, 2018Espinosa, G.E. (2018). La adquisición del ritmo inglés por hablantes nativos de español. El caso de aprendientes argentinos en contexto de instrucción formal [Unpublished PhD thesis]. Buenos Aires: Universidad de Buenos Aires, Argentina.
; Quesada, 2019). Specifically, these data endorse the results obtained by Li and Post (2014)Li, A. & Post, B. (2014). L2 acquisition of prosodic properties of speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
and by Ordin and Polyanskaya (2015)Ordin, M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a second language by learners with rhythmically different native languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
, who argue that the rhythm metric values of the learner groups get closer to native speakers during the acquisition process.

Given that the research work presented has dealt with group results without delving into fine-grained individual analyses, it evinces the necessity to investigate in detail which specific elements within the acquisition process allow certain individuals to reach a more native-like rhythm than others and, more specifically, to what extent the combined facts that some individuals have received explicit phonetic instruction and have spent a term studying in the UK exert the same amount of influence on their overall performance or one of those two variables has a higher degree of influence on their performance with respect to rhythm. In this regard, although individual differences could be accounted for in terms of overseas experience, explicit phonetic instruction and/or proficiency level, it could also be the case that, all things being equal, differences regarding the acquisition of L2 rhythm may be due to individual aptitude, a variable that should also be addressed in future research.

Since the learner population where the experiment was conducted is bilingual Galician/Spanish, the effect of Spanish L3 on the rhythm of L2 English should not be underestimated. Given that Spanish, like Galician, is a syllable-timed language, future research may aim at determining whether any notable differences exist between Galician-dominant speakers, Spanish monolinguals and Galician/Spanish bilinguals regarding the process (i.e. time, performance level) of acquisition of English rhythm.

Last, the findings obtained may shed some light on what to prioritize in English pronunciation teaching, as they show that the gradual improvement in the pronunciation of segmentals in general and, more specifically, the mastering of vowel durations, may be taught as a corollary of rhythm instruction, since the two are intimately connected.