1. INTRODUCTION
⌅ The concept of linguistic rhythm was first put forward in the 20th century by Pike (1945)Pike, K. (1945). The Intonation of American English. Ann Arbor, MI: University of Michigan Press.
and Abercrombie (1967)Abercrombie, D. (1967). Elements of General Phonetics. Edinburgh: Edinburgh University Press.
,
who defined it in general terms as isochrony of speech intervals. In
the following decades, a substantial amount of phonological research
(e.g. Dauer, 1983Dauer, R. (1983). Stress-timing and syllable-timing reanalyzed. Journal of Phonetics, 11, 51-62.
; Bertinetto, 1989Bertinetto P.M. (1989). Reflections of the dichotomy “stress” vs. “syllable-timing”. Revue de Phonétique Appliquée, 91-93, 99-130.
)
explored the rhythmic characterization of languages and suggested that
languages perceived as “stress-timed” show complex consonant clusters
and vocalic reduction, while languages perceived as “syllable-timed” do
not show vowel reduction and have much simpler syllabic structures. In
recent years, some researchers (e.g. Ramus, Nespor and Mehler, 1999Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
; Grabe and Low, 2002Grabe,
E. & Low E.L. (2002). Durational variability in speech and the
rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
; Dellwo, 2006Dellwo, V. (2006). Rhythm and Speech Rate: A Variation Coefficient for deltaC. In P. Karnowski & I. Szigeti (Eds.), Language and Language-processing (pp. 231-241). Bern: Peter Lang. https://doi.org/10.5167/uzh-111789
) have discussed and put to practice various
quantifiable measures to prove the phonetic reality of such
classification and thus confirm the existence of objective, acoustically
measurable differences between stress-timed and syllable-timed
languages. Although the existing rhythm metrics have recently received
some criticism (see Ross, Ferjan and Arvaniti, 2008Ross, T., Ferjan, N., & Arvaniti, A. (2008). On the reliability of rhythm metrics. The Journal of the Acoustical Society of America, 124, 2495. https://doi.org/10.1121/1.4782810
; Arvaniti, 2009Arvaniti, A. (2009). Rhythm, timing and the timing of rhythm. Phonetica, 66(1-2), 46-63. https://doi.org/10.1159/000208930
; Kohler, 2009Kohler, K.J. (2009). Rhythm in speech and language. Phonetica, 66(1-2), 29-45. https://doi.org/10.1159/000208929
; Arvaniti, 2012Arvaniti, A. (2012). The usefulness of metrics in the quantification of speech rhythm. Journal of Phonetics, 40, 351-373. https://doi.org/10.1016/j.wocn.2012.02.003
; Rathcke and Smith, 2015Rathcke, T.V. & Smith, R.H. (2015). Speech timing and linguistic rhythm: On the acoustic bases of rhythm typologies. The Journal of the Acoustical Society of America, 137(5), 2834-2845. https://doi.org/10.1121/1.4919322
), they continue to be thoroughly applied in
linguistic research, particularly when aimed at finding cross-linguistic
rhythmic differences and looking into the differences in rhythmic
patterns between L1 and L2 speech (Gabriel and Kireva, 2014Gabriel,
C., & Kireva, E. (2014). Speech rhythm and vowel raising in
Bulgarian Judeo-Spanish. In N. Campbell, D. Gibbon, & D. Hirst
(Eds.), Proceedings of Speech Prosody 2014 (pp. 728-732). https://doi.org/10.21437/speechprosody.2014-133
). This paper presents an investigation into
rhythm acquisition in a foreign language using what we could call the
‘classic’ metrics, developed by Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
, and Grabe and Low (2002)Grabe,
E. & Low E.L. (2002). Durational variability in speech and the
rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
, which will be explained in section 3.
It
must be noted that as much as the research on cross-linguistic rhythmic
differences has evolved, the attainment, production and evolution of L2
speech rhythm has been paid comparatively little attention in the
literature on L2 acquisition and linguistic transfer, although some
recent studies have focused on this issue (see, for instance, White and Mattys, 2007aWhite,
L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first
and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
and 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
; Kinoshita and Sheppard, 2011Kinoshita,
K. & Sheppard, C. (2011). Validating acoustic measures of speech
rhythm for second language acquisition. In W. S. Lee & E. Zee
(Eds.), Proceedings of the 17th International Congress of Phonetic Sciences (pp. 1086-1089). Hong Kong: City University of Hong Kong.
; Gut, 2012Gut, U. (2012). Rhythm in L2 speech. Speech and language technology, 14/15, 83-94.
; Ordin and Polyanskaya, 2015Ordin,
M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a
second language by learners with rhythmically different native
languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
). In spite of the limited consideration that this
topic has been given, the existing literature on L2 prosodic
acquisition generally agrees that L2 rhythm is indeed a real challenge
for language learners (Kinoshita and Sheppard, 2011Kinoshita,
K. & Sheppard, C. (2011). Validating acoustic measures of speech
rhythm for second language acquisition. In W. S. Lee & E. Zee
(Eds.), Proceedings of the 17th International Congress of Phonetic Sciences (pp. 1086-1089). Hong Kong: City University of Hong Kong.
).
The fundamental reason why L2 rhythm acquisition is problematic is the
existence of prosodic transfer from L1 to L2, an idea which is
implicitly or explicitly supported by the view that the elements making
up the phonetic subsystems of L1 and L2 exist in a common phonological
space, and so will necessarily influence one another (Flege, Schirru and MacKay, 2003Flege, J.E., Schirru, C., & MacKay, I.R.A. (2003). Interaction between the native and second language phonetic subsystems. Speech Communication, 40, 467-491. https://doi.org/10.1016/S0167-6393(02)00128-0
). As a natural consequence, L1 rhythmic transfer is taken to be a crucial factor in the process of L2 prosody learning (Rasier and Hiligsmann, 2007Rasier, L. & Hiligsmann, Ph. (2007). Prosodic transfer. Theoretical and issues. Nouveaux cahiers de linguistique Franτaise, 28, 41-66.
).
Generally speaking, there seems to be a wide consensus with regard to
the view that every adult learner’s point of departure when learning an
L2 is their L1. In the process of L2 acquisition, especially when this
commences after the childhood period, L2 phonological features interact
with L1 phonology and are assumed to cause adult learners difficulty in
acquiring native-like L2 phonology, as their L1 phonological system
makes it difficult for them to perceive and, therefore, produce, a
number of phonic features in the target language (see Broselow and Kang, 2013Broselow, E. & Kang, Y. (2013). Second language phonology and speech. In Herschensohn, J. & M. Young-Scholten (Eds.), The Cambridge Handbook of Second Language Acquisition (pp. 529-554). Cambridge: Cambridge University Press. https://doi.org/10.1017/CBO9781139051729
). In fact, it has been observed that, although
adult L2 learners often attain native-like proficiency in syntactic,
morphological, and lexical systems of L2, they rarely or never master
the complete L2 sound system (Scovel, 1969Scovel, T. (1969). Foreign accents, language acquisition and cerebral dominance. Language Learning, 25, 209-235. https://doi.org/10.1111/j.1467-1770.1969.tb00466.x
and 1988Scovel, T. (1988). A Time to Speak: A Psycholinguistic Inquiry into the Critical Period for Human Speech. Cork, Ireland: Newbury House.
). Moreover, the mastery of suprasegmentals seems to emerge at very advanced stages in L2 pronunciation (James, 1988James, A. (1988). The Acquisition o f a Second Language Phonology. Tübingen: Narr.
; Brown, 2000Brown, H. (2000). Principles of Language Learning and Teaching. London: Pearson Education.
).
In order to explore this issue further, Li and Post (2014)Li,
A. & Post, B. (2014). L2 acquisition of prosodic properties of
speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
analyze the rhythm produced by Chinese
(syllable-timed) and German (stress-timed) learners of English (also
stress-timed) with intermediate or advanced proficiency level and
conclude that while learners from both L1 backgrounds produce rhythm
metric values that increasingly approach the L2 target-i.e. the higher
their level of English, the more similar those measures are to the
target-, their rhythmic development also shows signs of L1 transfer.
In a similar vein, Ordin and Polyanskaya (2015)Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
compare French (syllable-timed) and German
(stress-timed) L2 learners of English at beginner/intermediate, and
advanced/proficiency levels and corroborate Li and Post’s (2014)Li,
A. & Post, B. (2014). L2 acquisition of prosodic properties of
speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
conclusions in that rhythm metric values of both
learner groups show that durational variability increases as L2
acquisition progresses. According to the authors, that could be an
indicator of universal L2 acquisition development. Interestingly,
though, they also show that while the most proficient German learners of
English achieve target values, the French learners of English do not.
This could indicate that L1 speakers of a syllable-timed language like
French (and Chinese) find it more difficult to acquire the speech rhythm
of a stress-timed language like English than L1 speakers of a different
stress-timed language (like German), a statement that requires further
exploration. In this regard, it is worth noting that Polyanskaya and
Ordin (2015) have investigated the attainment of rhythmic patterns by
monolingual English children and adults to show that the speech rhythm
of children universally develops from more syllable-timed to more
stress-timed as language acquisition progresses. In other words, the
existence of a stress-timed rhythm implies the existence of a
syllable-timed rhythm in an earlier developmental stage, but not vice
versa, which proves that stress-timed rhythm is typologically more
marked than syllable-timed rhythm (see Van Maastricht et al., 2019Van
Maastricht, L., Krahmer, E., Swerts, M., & Prieto, P. (2019).
Learning direction matters: A study on L2 rhythm acquisition by Dutch
learners of Spanish and Spanish learners of Dutch. Studies in Second Language Acquisition, 41(1), 87-121. https://doi.org/10.1017/S0272263118000062
).
Although the general L2 adult learners’
goal as non-native speakers of the language is the quality of
pronunciation that will not detract from their ability to communicate (Celce-Murcia, 1991Celce-Murcia, M. (Ed.) (1991). Teaching English as a Second or Foreign Language (2nd ed.). Boston, MA: Heinle & Heinle Publishers.
),
native-like pronunciation is taken to be the somewhat abstract ideal
that L2 learners aim at, as it is automatically judged to be a synonym
for “understandable pronunciation” or, in more general terms, “good
pronunciation” (see Leather and James, 1991Leather, J. & James, A. (1991). The acquisition of second language speech. Studies in Second Language Acquisition, 13, 305-341. https://doi.org/10.1017/S0272263100010019
; Ryu, 2002Ryu,
Y. (2002). Pronunciation of English as a second or foreign language
learners [sic]: the reexamination of teaching pronunciation [Unpublished
MA dissertation]. Missoula, USA: The University of Montana.
).
L2 speech usually exhibits a certain degree of foreign accent resulting
from the production of specific segmental and prosodic characteristics
that differ from those produced by L1 speakers and is thus easily
perceived by L1 speakers (see Polyanskaya, Ordin and Busa, 2016Polyanskaya,
L., Ordin, M., & Busa, M.G. (2016). Relative salience of speech
rhythm and speech rate on perceived foreign accent in a second language. Language and Speech, 60(3), 333-355. https://doi.org/10.1177/0023830916648720
).
The judgements regarding the level of
native-like quality of a learner’s pronunciation have been complemented
by the use of rhythm metrics to actually detect to what extent L2 rhythm
resembles that of native speakers. The crucial study in this direction
was carried out by White and Mattys (2007a)White,
L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first
and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
, who first investigated how some metrics-chiefly,
%V and VarcoV-reflect the evolution in the process of L2 acquisition in
speakers of English and Spanish. In a later study (White and Mattys, 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
), the same authors compare those results to L1
English speakers’ judgements and conclude that L1 English speakers rate
Spanish speakers of English as more non-native-like when their VarcoV
values are lower, i.e., when they are farther away from the typical
native values for that metric (both American and British English yield
higher VarcoV values than American and Iberian Spanish (Espinosa, 2019Espinosa,
G.E. (2019). English speech rhythm in instructed learners. Its
development as shown by VarcoV. In G. E. Espinosa, M. Fernández
Beschtedt, P. A. Formiga, & M. A. Verdú (Eds.), Conocimiento y diversidad en el estudio y la enseñanza de lenguas, Neuquén: Universidad Nacional del Comahue (pp. 204-212). Retrieved from http://bibliotecadelenguas.uncoma.edu.ar/items/show/426
)). In order to master native English
pronunciation, L2 learners must improve their production of vowel
reduction in unstressed syllables as much as possible, as this is “a
meaningful sign of the ongoing acquisition of English speech rhythm” (Wenk, 1986, p. 125Wenk,
B. (1986). Crosslinguistic influence in sound language phonology:
Speech rhythms. In E. Kellerman & M. Sharwood Smith (Eds.). Crosslinguistic Influence in Second Language Acquisition (pp. 120-133). Oxford: Pergamon Press Ltd.
).
This is confirmed by rhythm metrics, as acquiring a more stress-timed
rhythm necessarily implies a greater degree of variation in vowel
duration.
As stated above, studies point to the idea that speech
rhythm development proceeds from syllable- to stress-timed
characteristics for L2 speakers and native-speaking children alike (see Ordin and Polyanskaya, 2014Ordin, M. & Polyanskaya, L. (2014). Development of timing patterns in first and second languages. System, 42, 244-257. https://doi.org/10.1016/j.system.2013.12.004
). Arguably, it is thanks to input analysis and
practice that learners finally produce the prosodic properties of the
target language. Indeed, although it has been acknowledged that
“[p]ronunciation of adult L2 learners is particularly resistant to
change, even if those learners have received targeted pronunciation
instruction” (Kennedy and Trofimovich, 2010, p. 171Kennedy, S. & Trofimovich, P. (2010). Language awareness and second language pronunciation: A classroom study. Language Awareness, 19, 171-185. https://doi.org/10.1080/09658416.2010.486439
), studies like Tsiartsioni’s (2011)Tsiartsioni,
E. (2011). Can pronunciation be taught? Teaching English speech rhythm
to Greek students. In E. Kitis, N. Lavidas, N. Tpointzi, & T.
Tsangalidis (Eds.), Proceedings of the 19th International Symposium on Theoretical and Applied Linguistics (ISTAL) (pp. 447-458). Tesalonica: Aristotle University of Thessaloniki.
analyze vocalic and consonant variability measures and observe an
adaptation of a more English-like rhythm by those having received rhythm
instruction. The question remains whether before getting to that final
stage learners develop a phonological system which shows characteristics
of their L1 and their L2, thus giving rise to an intermediate stage
where the rhythm of the L2 is neither completely the same as that of the
speakers’ L1 nor exactly the same as the rhythm of the target language
(see Eckman, 2012Eckman, F.R. (2012). Second language phonology. In S. M. Gass & A. Mackey (Eds.), The Routledge Handbook of Second Language Acquisition (pp. 91-105). London: Routledge.
; Espinosa, 2018Espinosa,
G.E. (2018). La adquisición del ritmo inglés por hablantes nativos de
español. El caso de aprendientes argentinos en contexto de instrucción
formal [Unpublished PhD thesis]. Buenos Aires: Universidad de Buenos
Aires, Argentina.
). The investigation of systematic
similarities and differences between L1 and L2 speech rhythm has not yet
shed any definitive findings, which entails that maybe the L2 speakers’
level of competence is indeed one of the most important variables to be
borne in mind. More research needs to be carried out to show evidence
for rhythmic differences between L2 learners of different proficiency
levels or with different native language backgrounds (see Li and Post, 2014Li,
A. & Post, B. (2014). L2 acquisition of prosodic properties of
speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
). This leads to the question whether the rhythm
metrics are actually able to differentiate between beginners’ and more
advanced language learners’ speech rhythm, and whether they can trace
rhythmic developments in language acquisition (Gut, 2012Gut, U. (2012). Rhythm in L2 speech. Speech and language technology, 14/15, 83-94.
).
The present article analyze the process of rhythm acquisition in two groups of Galician learners of English with different levels of proficiency in order to determine i) the existence of L1 rhythmic transfer in the acquisition of L2 rhythm, and ii) the influence of L2 proficiency level on the degree of rhythmic mastery on the adult learners’ part. The structure of the article is as follows: Section 2 delves into the objectives and hypotheses that conduct this study. Section 3 explains the methodology of the research study conducted. Section 4 explores the results obtained, and section 5 presents the discussion of those results. Finally, section 6 deals with the final conclusions of the study.
2. OBJECTIVES AND HYPOTHESES
⌅ As mentioned in the Introduction, this study analyzes the production of
speech rhythm in the English language classroom by two groups of
bilingual Galician/European Spanish learners whose dominant language is
Galician and certified, at the moment of recording, a B1 and C1 level of
English according to the Common European Framework of Reference for
Languages [CEFR] (Council of Europe, 2001Council of Europe. (2001). Common European framework of reference for languages: Learning, teaching, assessment. Cambridge: Press Syndicate of the University of Cambridge.
). Crucially for our study, while British English is generally described as a stress-timed language (see, for instance, Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
, and Grabe ad Low (2002)Grabe,
E. & Low E.L. (2002). Durational variability in speech and the
rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
), Galician has been shown to be a syllable-timed language in the few studies that have analyzed its rhythmic make-up (Roseano, 2021Roseano, P. (2021). Splitting the rhythmic continuum in rhythmic classes: An HCA approach [Poster presentation]. 4th Phonetics and Phonology in Europe (PaPE), June 21-23. Barcelona, Spain.
).
-
-
To compare rhythmically the production of English L1 and L2 so as to verify whether the production of English L2 is affected by rhythmic transfer from the learners’ L1.
-
To explore the development of non-native rhythm at different stages in the process of acquisition so as to find out whether the degree of rhythmic transfer decreases as the proficiency level in the foreign language increases (Gutiérrez Díez et al., 2008Gutiérrez Díez, F., Dellwo, V., Gavaldà, N., & Rosen, S. (2008). The development of measurable speech rhythm during second language acquisition. The Journal of the Acoustical Society of America, 123. https://doi.org/10.1121/1.2935815
; Gass and Selinker, 2008Gass, S.M. & Selinker, L. (2008). Second Language Acquisition: An Introductory Course (3rd edition). London: Routledge.
; Major, 2008Major, R.C. (2008). Transfer in Second Language Phonology - A review. In J. G. Hansen Edwards & M. L. Zampini (Eds.), Phonology and second language acquisition, Studies in Bilingualism (pp. 63-94). Amsterdam: John Benjamins. https://doi.org/10.1075/sibil.36.05maj
).
-
The hypotheses that conduct the research work and give rise to the above objectives are two, namely:
-
-
Galician learners of English will show intermediate rhythm scores between their L1 and their L2.
-
The degree of rhythmic transfer will be shown to be negatively correlated with the level of proficiency in the foreign language; in other words, increased L2 proficiency will lead to the production of more native-like rhythmic patterns.
-
3. METHODOLOGY
⌅In order to conduct this investigation, two groups of native speakers of Galician currently studying a degree in Translation and Interpreting at the University of Vigo were handed a questionnaire to determine their language profile, and subsequently recorded in the soundproof booth at the Language and Cognition Lab of the University of Vigo with a Sennheiser MK4 Digital microphone connected to a Toshiba Satellite Click 2 Pro P30W-B.
The first group (henceforth, group A) comprised 6 subjects who, at the moment of being recorded, were doing their first year of the Translation and Interpreting degree at the University of Vigo, had received English lessons in primary and secondary school for a total twelve years, certified a B1 level of English (Table 1)-a necessary condition to enter the above-mentioned degree-and had not received any phonetic instruction prior to their first term at university. The second group (henceforth, group C) comprised 6 subjects who were doing their fourth year at university and certified a C1 level of English (Table 1)-i.e. they had passed their first year at university. In order to ascertain that the level of English reported by the subjects in groups A and C was accurate, they were evaluated by the first author of this article prior to the completion of the task that this investigation comprised. Group C individuals received explicit phonetic instruction for a period of twelve weeks during which they sat through one two-hour theoretical session per week where they were taught English segmental and prosodic phonology, and two two-hour practical sessions per week where they were instructed to improve their listening and speaking skills putting into practice the theoretical knowledge acquired in the theoretical sessions. In addition to the twelve-week period of formal instruction, group C subjects had lived in the UK for a period of four months.
Group | Group A | Group C | ||
---|---|---|---|---|
Subject | Gender | Age | Gender | Age |
1 | Female | 19 | Male | 21 |
2 | Female | 18 | Male | 21 |
3 | Female | 18 | Female | 21 |
4 | Female | 18 | Female | 21 |
5 | Female | 19 | Female | 21 |
6 | Male | 18 | Female | 21 |
M = 18.3 | ||||
SD = 0.52 |
All twelve subjects had pursued primary and secondary education in Galicia and had taken the exam to access Galician universities, which includes Galician and Spanish tests, so they reported a very good command of both languages. In addition, they all reported to be Galician-dominant, as they had been raised with Galician as their primary language and continued to use it far more regularly than Spanish. Also, they were learning French as their second foreign language and did not speak any further languages. As far as gender is concerned, the majority of the subjects were female, since the majority of students of the above-mentioned faculty are female. Their age ranged from 18-19 (first year students belonging to group A) to 21 (third-year students belonging to group C).
Besides, a control group of 6 Southern British English (SBE) speakers (Table 2), and another control group of 6 Galician (GAL) speakers (Table 2) were recorded in order to be able to compare the rhythm of the experimental subjects’ L1 and L2, and facilitate the observation of developmental patterns in the production of the subjects’ L2 rhythm. In this regard, the degree of difference or similarity with the target language (in this case, SBE) allows us to observe the evolution in L2 rhythm production on the part of the two experimental groups. It must also be noted that the English control group comprised 6 speakers of Southern British English because this variety is representative of the largest amount of input that learners receive in an instructional context (classroom videos and audios) and is the variety that was spoken in the geographical areas in the UK where C1 subjects spent their study-abroad period.
Group | Control group 1 | Control group 2 | ||
---|---|---|---|---|
Subject | Gender | Age | Gender | Age |
1 | Male | 43 | Female | 19 |
2 | Male | 35 | Female | 18 |
3 | Male | 21 | Female | 18 |
4 | Female | 21 | Female | 18 |
5 | Female | 22 | Female | 19 |
6 | Female | 31 | Male | 18 |
M = 28.8 | M = 18.3 | |||
SD = 9.1 | SD = 0.52 |
The recordings of the subjects in the SBE control group were carried out in a soundproof booth at the Phonetics Lab of the University of Barcelona using a Zoom H4n Pro digital recorder and a Shure SM58microphone, while those of the subjects in the GAL control group were done in the Language and Cognition Lab of the University of Vigo with the same equipment used for groups A and C.
The data were obtained by means of a reading task where subjects had to read the text The North Wind and the Sun (Handbook of the IPA, 1999, p. 39IPA (1999). Handbook of the International Phonetic Association. A Guide to the Use of the International Phonetic Alphabet. Cambridge: Cambridge University Press.
) in Galician and/or in English (Figure 1).
Speakers were asked to read aloud at a normal speech rate and the
resulting readings of the text took approximately 30 seconds for each
informant, which implied that reading pace was spontaneously controlled
for. They were asked to make pauses only where commas or full stops
appeared in the text (this requirement is due to the fact that we wanted
to have the same number of pauses in all recordings of each language,
since pre-pausal lengthening might have an effect on the duration of
intervals). If a speaker hesitated, mispronounced, made extra pauses,
etc., we asked them to read the sentence again.
The acoustic analysis was carried out with Praat (Boersma and Weenink, 2019Boersma, P. & Weenink, D. (2019). Praat: Doing phonetics by computer [Computer program]. Version 6.1.07, retrieved from http://www.praat.org/
). For each recording, the vocalic and consonantal intervals were annotated in a textgrid following the same criteria as in Kireva and Gabriel (2015Kireva,
E. & Gabriel, C. (2015). Rhythmic properties of a contact variety:
Comparing read and semi-spontaneous speech in Argentinean Porteño
Spanish. In M. Avanzi, E. Delais-Roussarie & S. Herment (Eds.), Prosody and Languages in Contact (pp. 149-168). Berlin: Springer.
), namely:
-
boundaries between V and C intervals were determined on the basis of formant structure and pitch;
-
we treated glides as belonging to the V intervals;
-
for unvoiced plosives and affricates following a pause, the beginning was placed at 0.05s prior to the burst, given that their boundaries can hardly be determined on the basis of other criteria;
-
pauses and material affected by any kind of speech disfluency were excluded from the analysis.
The textgrids were then processed by means of Correlatore (Mairano and Romano, 2010Mairano,
P. & Romano, A. (2010). Un confronto tra diverse metriche ritmiche
usando Correlatore. In S. Schmid, M. Schwarzenbach, & D. Studer
(Eds.), La dimensione temporale del parlato (pp. 79-100). Bern: EDK.
)
in order to calculate and plot vocalic and consonantal rhythm metrics.
The metrics used in this study are, as mentioned in the Introduction,
the ‘classic’ metrics created by Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
(Figure 2), and by Grabe and Low (2002)Grabe,
E. & Low E.L. (2002). Durational variability in speech and the
rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
(Figure 3),
thus leaving to future experiments other prosodic factors-i.e.
stress/intensity, pitch, speech rate, etc.-which may arguably have an
effect on L1 and L2 rhythm as well as rhythm transfer. The metrics
proposed by the previously mentioned authors are complementary, as Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
propose “global” rhythm metrics (insofar as the
duration of each interval is compared with the duration of all other
intervals in the recording), while Grabe and Low (2002)Grabe,
E. & Low E.L. (2002). Durational variability in speech and the
rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
use “local” rhythm metrics (in the sense that the
duration of each interval is compared with the duration of the
following interval).
.
.
For the interpretation of the
results presented in section 4, it is useful to remember that ΔV and
VnPVI measure the variability of vocalic intervals, while ΔC and CrPVI
measure the variability of consonant intervals. Finally, %V is related
to both consonant and vocalic intervals. Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
suggest that the rhythm metrics reflect the
phonological rhythmic properties of the languages. As stated above,
given that stress-timed languages have a greater variety of syllable
structures and favor consonant clusters, they present a comparatively
higher ∆C value and, consequently, a lower %V value. In contrast to
this, syllable-timed languages tend towards simpler syllable structures
and disfavor consonant clusters, which translates into a lower ∆C value,
and a higher %V value. With reference to ΔV, stress-timed languages
allow for a comparatively greater number of phonological processes which
affect vowels-vowel reduction, long-vs-short vowel contrast, vowel
lengthening in certain phonological contexts-and thus have a higher ΔV
value than syllable-timed languages, in which these phonological
processes are not productive.
The metrics in Figures 2 and 3 capture the durational variability of speech intervals that corresponds
to the auditory impression of stress- and syllable-timing. Romance
languages like Spanish or Galician have been shown to have an overall
lower degree of durational variability (i.e. low ΔV, ΔC, CrPVI and VnPVI
values) and higher proportion of vocalic material (i.e. high %V)
compared to Germanic languages like English, where high ΔV, ΔC, CrPVI
and VnPVI values and low %V have been reported (e.g. Bunta and Ingram, 2007Bunta,
F. & Ingram, D. (2007). The acquisition of speech rhythm by
bilingual Spanish- and English-speaking 4- and 5-year-old children. Journal of Speech, Language, and Hearing Research, 50(4), 999-1014. https://doi.org/10.1044/1092-4388(2007/070)
; Payne et al., 2012Payne, E., Post, B., Astruc, Ll., Prieto, P. & Vanrell, M.M. (2012). Measuring child rhythm. Language and Speech, 55(2), 203-229. https://doi.org/10.1177/0023830911417687
; Prieto et al., 2012Prieto,
P., Vanrell, M.M., Astruc, Ll., Payne, E., & Post B. (2012).
Phonotactic and phrasal properties of speech rhythm. Evidence from
Catalan, English, and Spanish. Speech Communication, 54(6), 681-702.
; Ramus, Nespor and Mehler, 1999Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
; White and Mattys, 2007aWhite,
L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first
and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
and 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
), which corresponds to a lower degree of
stress-timing. Differences in rhythmic metrics between L1 and L2 speech
have also been confirmed in several studies (Bond and Fokes, 1985Bond, Z.S. & Fokes, J. (1985). Non-native pattern of English syllable-timing. Journal of Phonetics, 13, 407-420.
; Grenon and White, 2008Grenon, I. & White, L. (2008). Acquiring rhythm: A comparison of L1 and L2 speakers of Canadian English and Japanese. Proceedings of the 32nd Boston University conference on language development (pp. 155-166). Boston: Boston University.
; Ordin and Polyanskaya, 2015Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
; White and Mattys, 2007aWhite,
L. & Mattys, S.L. (2007a) Rhythmic typology and variation in first
and second languages. In P. Prieto, Mascaró, & M.J. Solé (Eds.) Segmental and Prosodic Issues in Romance Phonology (pp. 237-257). Amsterdam: John Benjamins. https://doi.org/10.1075/cilt.282.16whi
and 2007bWhite, L. & Mattys, S.L. (2007b). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501-522. https://doi.org/10.1016/j.wocn.2007.02.003
). L2 English exhibits a lesser degree of stress-timing compared to L1 English, irrespective of the L1 of the learner (Li and Post, 2014Li,
A. & Post, B. (2014). L2 acquisition of prosodic properties of
speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
; Ordin and Polyanskaya, 2015Ordin,
M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a
second language by learners with rhythmically different native
languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
).
4. RESULTS
⌅In this section, we shall present the results of our study by discussing the plots of the rhythm metrics obtained by means of Correlatore. We shall first present the results of the mean values for “global” rhythm metrics, and later the results of “local” rhythm metrics. At the end of the section, we will present the results of the speakers individually.
Figure 4 shows the mean values obtained for the ΔC, ΔV metrics for the four groups analyzed. i.e. SBE, GAL, A and C, while Figure 5 indicates the mean values rendered for the ΔC, %V metrics for the same four groups.
Figures 4 and 5 show, firstly, that GAL and SBE appear at the opposite corners of each plot. This is consistent with the fact that GAL is classified as a syllable-timed language (like Central Peninsular Spanish), while SBE is stress-timed. Secondly, English L2 groups (A in red, C in yellow) appear in an intermediate position between the speakers’ L1 and those same speakers’ foreign language (only the value of %V for C-speakers is slightly out of the diagonal, though not dramatically far). Furthermore, if we consider the level of EFL speakers, we observe that group A in red (subjects with a B1 level of English) is closer to L1, while group C, in yellow (students with a C1 level of English) is closer to L2.
Given the results obtained, three preliminary conclusions may be inferred. To start with, there is rhythmic transfer from the speakers’ L1 (Galician, syllable-timed) to their L2 (English, stress-timed). Not unimportantly, either, the level of proficiency has an effect on the degree of rhythmic transfer, as both consonantal and vocalic intervals are affected by transfer. Thus, the areas where groups A and C stand could be considered ‘interlanguage’ areas where the produced rhythm is neither the rhythm of the subjects’ L1 nor the rhythm of the target language.
The “local” metrics used by Grabe and Low (2002)Grabe,
E. & Low E.L. (2002). Durational variability in speech and the
rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
provide results that are completely in line with those of the “global” metrics used by Ramus, Nespor and Mehler (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
. These results are desirable, insofar as all
metrics give a cohesive picture of the phenomenon under study, and also
foreseeable, since-like Ramus (2002)Ramus, F. (2002). Acoustic correlates of linguistic rhythm: Perspectives. In B. Bel & I. Marlien (Eds.), Proceedings of Speech Prosody 2002 (pp. 115-130). Aix-en-Provence: Université de Provence.
pointed out-the different rhythm metrics should give comparable results
when the quantity of data analyzed is large enough. The mean values
obtained for the CrPVI and VnPVI metrics in this study are offered in Figure 6.
Figure 6 shows that, like for ΔC, ΔV (Figure 4), GAL and SBE appear at the opposite corners of the plot. Once again, this is consistent with the fact that GAL is indeed syllable-timed (like Central Peninsular Spanish), while SBE is stress-timed. Again, English L2 groups appear in an intermediate position between the speakers’ L1 and the speakers’ foreign language. Similarly, if the level of English L2 speakers is considered, we observe, once again, that A is closer to L1, while C is closer to L2.
The above observations provide us with information that confirms the preliminary conclusions previously stated, namely that there is rhythmic transfer from the speakers’ L1 to their L2, that both consonantal and vocalic intervals are affected by transfer, and that the level of proficiency has an effect on the degree of rhythmic transfer-the higher the proficiency level of the learners, the lesser the degree of rhythmic transfer observed.
If we analyze speakers one by one (Figures 7 and 8) a few more detailed considerations can be made. To begin with, on the whole, the distribution along the diagonal does not change in comparison to the data in Figures 4, 5 and 6. Nevertheless, in both Figures there is an area where the clouds of SBE speakers and C speakers overlap. This might mean that some C speakers display near native-like rhythm, which is consistent with the fact that they had a C1 level of English.
5. DISCUSSION
⌅The objective of this study is twofold, as it aims at comparing the production of English L1 and L2 rhythmically so as to verify whether the production of English L2 is affected by rhythmic transfer from the learners’ L1, i.e. syllable-timed Galician, and determining whether the degree of rhythmic transfer decreases as the level of proficiency increases.
The initial hypotheses were that a) the transfer of the rhythm from Galician to English spoken by Galician speakers would be visible-English L2 would lie in an intermediate rhythmic position between GAL and SBE-and b) that the degree of rhythmic transfer would prove variable and directly linked to the learners’ proficiency level (group-C speakers would be closer to L2, while group-A speakers would be closer to GAL).
The statistical analysis carried out by means of Correlatore shows that both hypotheses are confirmed, insofar as the variety of English spoken by Galician speakers is predominantly located in an intermediate rhythmic position between GAL and SBE. As the degree of proficiency increases thanks to the explicit instruction-i.e. conducted in the classroom-and the non-explicit instruction-i.e. connected to the learners’ experience abroad-received by learners, the rhythm of L2 English resembles more and more that of SBE.
The results obtained may be summarized as follows:
-
There is rhythmic transfer from a syllable-timed L1 (GAL) to a stress-timed foreign language (ENG);
-
both C and V intervals are affected by rhythmic transfer;
-
the level of proficiency in the foreign language has an effect on the level of rhythmic transfer, i.e. the degree of rhythmic transfer from Galician L1 to English L2 decreases as the level of the foreign language increases;
-
there are individual differences (some C speakers have reached native-like rhythm).
The above results confirm previous research (Li and Post, 2014Li,
A. & Post, B. (2014). L2 acquisition of prosodic properties of
speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
; Ordin and Polyanskaya, 2015Ordin,
M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a
second language by learners with rhythmically different native
languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
) conducted with young adult speakers of different
native tongues in that the higher the level of English on the part of
the learners, the more similar the rhythm measures are to the target, no
matter the learners’ L1. These results also prove that the signs of L1
transfer are present, albeit with a higher or lower predominance, at all
stages of the learning process. Additionally, in line with Ordin and Polyanskaya’s (2015)Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
observation, the present study shows that
successfully acquiring stress-timed rhythm and, hence, a greater degree
of variation in vowel duration, tends to prove considerably difficult
for speakers of a syllable timed language-in this case, Galician.
Besides, in line with Espinosa’s (2018)Espinosa,
G.E. (2018). La adquisición del ritmo inglés por hablantes nativos de
español. El caso de aprendientes argentinos en contexto de instrucción
formal [Unpublished PhD thesis]. Buenos Aires: Universidad de Buenos
Aires, Argentina.
observations, the present study corroborates Tsiartsioni’s (2011)Tsiartsioni,
E. (2011). Can pronunciation be taught? Teaching English speech rhythm
to Greek students. In E. Kitis, N. Lavidas, N. Tpointzi, & T.
Tsangalidis (Eds.), Proceedings of the 19th International Symposium on Theoretical and Applied Linguistics (ISTAL) (pp. 447-458). Tesalonica: Aristotle University of Thessaloniki.
observation that learners who receive rhythm instruction adapt to a
more English-like rhythm when speaking the language. Moreover, the data
obtained and analyzed offers an answer to Tsiartsioni’s (2011)Tsiartsioni,
E. (2011). Can pronunciation be taught? Teaching English speech rhythm
to Greek students. In E. Kitis, N. Lavidas, N. Tpointzi, & T.
Tsangalidis (Eds.), Proceedings of the 19th International Symposium on Theoretical and Applied Linguistics (ISTAL) (pp. 447-458). Tesalonica: Aristotle University of Thessaloniki.
question whether learners develop a phonological system that may give
rise to an intermediate stage where the rhythm of the L2 is neither
totally the same as that of the speakers’ L1 nor completely equal to the
rhythm of the target language. In this regard, it does seem to be the
case that learner English sits at a point in between the speakers’ L1
(Galician) and their L2 (British English) and moves towards their L2 as
the level of proficiency increases.
6. CONCLUSIONS
⌅The present study has looked into the production of English L1 and L2 rhythm by two groups of Galician-dominant learners with a B1 and a C1 level of English. The study pursued two main objectives, namely to determine i) whether a certain degree of rhythmic transfer from the learners’ L1 would show in the results of the rhythmic measurements and, ii) if so, whether the degree of rhythmic transfer observed would be lower in the case of students with a higher proficiency level of English who had also received explicit phonetic instruction for twelve weeks and have spent a university term in England. The initial hypotheses stated that a certain amount of transfer would be revealed in the results of the measurements, and that the degree of transfer would be negatively correlated with the level of proficiency in English.
The twelve experimental subjects were made to read a text in Galician
and in English, and the results obtained were compared to the readings
made by a control group of six native Galician speakers and another
control group of six native British English speakers. The measurements
conducted replicated Ramus, Nespor and Mehler’s (1999)Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73, 265-292. https://doi.org/10.1016/S0010-0277(00)00101-3
“global” rhythm metrics ΔV, ΔC, and %V, comparing
the duration of each vocalic and consonantal interval with the duration
of all other intervals in the recording, and Grabe and Low’s (2002)Grabe,
E. & Low E.L. (2002). Durational variability in speech and the
rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory Phonology 7 (pp. 515-546). Berlin: De Gruyter. https://doi.org/10.1515/9783110197105.2.515
“local” rhythm metrics VnPVI and CrPVI, where the
duration of each interval was compared with the duration of the
following interval.
The results prove that the rhythm of the
foreign language is affected by the rhythmic properties of L1,
particularly when the two have different rhythmic structures-as is the
case with Galician and British English- and especially at low and
intermediate levels of language proficiency. Moreover, as shown by the
results of the global and local metrics put to the test, the evinced
rhythmic transfer phenomenon from syllable-timed Galician to
stress-timed English has an effect on both the consonant and the vowel
intervals. This finding ratifies previous observations that L2 English
exhibits a lesser degree of stress-timing compared to L1 English (Li and Post, 2014Li,
A. & Post, B. (2014). L2 acquisition of prosodic properties of
speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
; Ordin and Polyanskaya, 2015Ordin, M. & Polyanskaya, L. (2015b). Acquisition of English speech rhythm by monolingual children. In INTERSPEECH-2015 (pp. 3120-3124). https://doi.org/10.21437/Interspeech.2015-628
). Crucially, the results prove that the degree of
rhythmic transfer from Galician to English decreases as the level of
the foreign language increases, which corroborates the conclusions in
previous studies (Espinosa, 2018Espinosa,
G.E. (2018). La adquisición del ritmo inglés por hablantes nativos de
español. El caso de aprendientes argentinos en contexto de instrucción
formal [Unpublished PhD thesis]. Buenos Aires: Universidad de Buenos
Aires, Argentina.
; Quesada, 2019). Specifically, these data endorse the results obtained by Li and Post (2014)Li,
A. & Post, B. (2014). L2 acquisition of prosodic properties of
speech rhythm: Evidence from L1 Mandarin and German learners of English. Studies in Second Language Acquisition, 36(2), 223-255. https://doi.org/10.1017/S0272263113000752
and by Ordin and Polyanskaya (2015)Ordin,
M. & Polyanskaya, L. (2015a). Acquisition of speech rhythm in a
second language by learners with rhythmically different native
languages. The Journal of the Acoustic Society of America, 138(2), 533-545. https://doi.org/10.1121/1.4923359
, who argue that the rhythm metric values of the learner groups get closer to native speakers during the acquisition process.
Given that the research work presented has dealt with group results without delving into fine-grained individual analyses, it evinces the necessity to investigate in detail which specific elements within the acquisition process allow certain individuals to reach a more native-like rhythm than others and, more specifically, to what extent the combined facts that some individuals have received explicit phonetic instruction and have spent a term studying in the UK exert the same amount of influence on their overall performance or one of those two variables has a higher degree of influence on their performance with respect to rhythm. In this regard, although individual differences could be accounted for in terms of overseas experience, explicit phonetic instruction and/or proficiency level, it could also be the case that, all things being equal, differences regarding the acquisition of L2 rhythm may be due to individual aptitude, a variable that should also be addressed in future research.
Since the learner population where the experiment was conducted is bilingual Galician/Spanish, the effect of Spanish L3 on the rhythm of L2 English should not be underestimated. Given that Spanish, like Galician, is a syllable-timed language, future research may aim at determining whether any notable differences exist between Galician-dominant speakers, Spanish monolinguals and Galician/Spanish bilinguals regarding the process (i.e. time, performance level) of acquisition of English rhythm.
Last, the findings obtained may shed some light on what to prioritize in English pronunciation teaching, as they show that the gradual improvement in the pronunciation of segmentals in general and, more specifically, the mastering of vowel durations, may be taught as a corollary of rhythm instruction, since the two are intimately connected.