Adjective-Noun Collocations in L2 Speech
Adjective-Noun Collocations in L2 Speech
Adjective + Noun
Collocations in L2 and L1
Speech: Evidence from the
Trinity Lancaster Corpus
and the Spoken BNC2014
Vaclav Brezina and Lorrae Fox
1 Introduction
Phraseological knowledge represents an important component of
linguistic competence (e.g. Bestgen & Granger, 2014; Gablasova et al.,
2017; Howarth, 1998). It is an essential part of naturally sounding
language production and successful communication. A simple look
at the short excerpt below (1) from the Trinity Lancaster Corpus
(Gablasova et al., 2019) reveals the omnipresence of collocations,
chunks, idiomatic sequences, phrases, etc. in L2 speech. These are
underlined in example (1).
(1) before the starting I er like to er outline the principle er or main points
in my er topic er in the first place traditional classrooms talking
about traditional classrooms if are they the best places or the <only>
places to learn and to teach in? er in the second place traditional
methodologies are they certainly or best tools and at the end of this
presentation I’d like to er to talk you about effective teachers how to
recognise them and how to be one of them [TLC, 2_SP_30].
152
Adjective + Noun Collocations in L2 and L1 Speech 153
(single word) vocabularies, they also tend to experiment more with word
combinatorial mechanisms and, as a result, produce less idiomatic,
less nativelike word sequences’ (2019: 32). Similarly, Omidian et al.
(Chapter 8 of this volume) show the complex nature of phraseological
development. The authors used linear mixed-effects model analysis to
study verb + noun collocations within three dimensions in L2 Italian:
exclusivity, phrase frequency and phrasal diversity. In particular, they
noted aspects of non-linearity in the phrasal diversity and exclusivity
dimensions when considering language proficiency; A1 level (beginner)
learners were the only group demonstrating a longitudinal increase in
collocation diversity while B1 level (intermediate) learners produced
more strongly associated collocations when exploring exclusivity, with
random individual variation also being found.
In contrast, other studies have discovered some effect of L2
proficiency on the use of adjective + noun collocations. Granger and
Bestgen (2014) identified differences between intermediate and advanced
learners’ use of collocations (n-grams) in English academic writing,
including adjective + noun combinations. The study confirmed the L2
proficiency hypothesis, which predicts ‘a smaller proportion of lower-
frequency, strongly-associated, collocations [attested by MI] and a larger
proportion of high-frequency collocations [attested by t-score] in the
intermediate learner texts than in the advanced learner texts’ (Granger &
Bestgen, 2014: 238). Further, Sonbul (2015) investigated frequency effects
on collocational processing in an experimental study, when L1 and L2
speakers were exposed to synonymous adjective + noun collocations.
The study found that proficiency affected the off-line sensitivity
measure, where participants rated written English sentences based on
typicality. Both L1 and L2 speakers were found to show some sensitivity
to collocational frequency in the on-line condition measured by
eye-tracking, demonstrating possible similarities in language processing
shared by the two groups.
The results of previous studies thus offer a complex and often
contradictory picture on the development of L2 phraseological compe-
tence and the relationship between L1 and L2 phraseological use. It is
important to realise that phraseological competence is often evaluated,
especially in pedagogical contexts, within the broader framework of
native and non-native language use. Although there are many common-
alities between L1 and L2 speakers, L1 speaker performance is often
perceived as the target or baseline for L2. Llurda (2016) explained that
this has been fuelled mostly by Western approaches to teaching and
learning languages, as to speak ‘native-like’ has historically been the
goal. Previously, researchers found that even highly proficient language
learners may never achieve this (Long, 1990), while others estab-
lished that some learners can attain language norms typical for native
speakers (Birdsong, 1992). The term ‘native speaker’ is also not without
156 Part 3: The Learner Phrasicon: Developmental Approaches
Table 7.1 Number of L2 speakers (and word tokens) across proficiency levels, L1s
and cultural backgrounds in the Trinity Lancaster Corpus Sample – conversation and
discussion tasks
L1/cultural background Proficiency level according to CEFR
B1 B2 C1/C2
China 80 (63,503) 70 (65,522) 30 (35,548)
India 100 (102,668) 100 (117,019) 30 (50,074)
Italy 100 (79,781) 100 (102,390) 60 (75,415)
Mexico 100 (81,488) 70 (71,648) 60 (71,861)
Russia 40 (22,145) 28 (32,526) –
Spain 100 (74,256) 100 (94,702) 60 (64,672)
Total 520 (423,841) 468 (483,807) 240 (297,570)
3.2 Methodology
The data was searched using Sketch Engine (Kilgarriff et al., 2014) to
extract adjective + noun combinations with possible multiple adjectives
and intervening elements such as hesitation marks (er, erm). A complex
CQL query was formulated to capture this pattern. The query is shown
below:
It was established that this query has a high precision and a high
recall. The precision was 97% with non-accurate hits involving the
use of like and just as spoken features that had been mistagged as
adjectives. Recall was tested on a sample of six texts with three male and
three female speakers across four language backgrounds and different
proficiency levels. These texts were manually annotated and compared
with the results of the automated procedure. The recall was 98.02%. The
obtained precision and recall levels were deemed sufficient for the present
study.
A series of PHP scripts (PHP is a flexible all-purpose scripting
language) using regular expressions were used to further process and
compare the L2 and L1 data. In this process, disfluencies such as n-new
year were normalised to allow a fair comparison of the collocation
strength. The comparison of L1 and L2 collocation use follows the
method applied in Granger and Bestgen (2014), originally devised by
Durrant and Schmitt (2009). Adapting this method, we used the Spoken
BNC2014 as the reference corpus for the L2 performance in the TLC
Sample. The values of collocational strength of individual adjective
+ noun combinations in L2 were assigned based on the L1 use in the
Spoken BNC2014.
The unit of analysis in the comparisons is a collocation type.
This allows more precise measuring of the evidence of collocational
knowledge in individual speakers than if tokens were considered. We
Adjective + Noun Collocations in L2 and L1 Speech 159
4 Results
This section deals with the results of the study, focusing on the
frequency, types and collocational strength of the target feature (adjective
+ noun combinations). The results are then discussed in Section 5.
Figure 7.1 Relative frequency of adjective + noun combinations per 1,000 nouns
(group means and 95% CIs)
Table 7.4 Top 5 nouns and their adjectival collocates in TLC Sample and the Spoken
BNC2014
Corpus Proficiency Nouns Absolute Most frequent Most exclusive adjective
frequency adjective modifications (MI score)
modifications
(collocate
frequency)
TLC B1 people 1474 other, young, poor poor, rich, corrupt
Sample
school 1094 primary, high, nursing, elementary,
middle primary
money 882 little, important, extra, lucky, little
extra
time 777 long, free, last sleeping, spare, free
years 654 next, recent, past recent, past, last
B2 people 2565 other, young, old qualified, illiterate, elder
time 1170 long, free, good spare, rough, olden
things 1093 other, different, spooky, meaningful,
new chemical
school 976 high, primary, primary, elementary,
middle secondary
years 844 last, early, next, past, recent, early
C1/C2 people 2889 young, other, old handicapped, Brazilian,
elder
things 946 other, good, bad politic, mean,
controversial
way 860 good, different, positive, certain, proper
other
time 813 long, next, ancient spare, changing, ancient
children 697 little, other, young little, own, young
BNC2014 L1 – thing other, good, whole petrol-driven, withered,
benchmark strawberry-flavoured
stuff good, other, new power-based, anti-
mosquito, run-of-the-mill
people gay, normal, stupid heterosexual, like-
minded, effeminate
person only, other, good non-lazy, clean-living,
debauched
way other, good, long joking, uncivilized,
convoluted
162 Part 3: The Learner Phrasicon: Developmental Approaches
Mean log Dice (SD) 7.20 (1.13) 7.24 (0.83) 7.39 (0.82)
Adjective + Noun Collocations in L2 and L1 Speech 165
Figure 7.2 Comparison of the three proficiency levels using the mean collocation
frequency (95% CIs)
As can be seen from Table 7.10, two of the indices (mean collocation
frequency and log Dice) increase with proficiency; mean MI score, on
the other hand, follows an opposite trend. Note that while the MI score
is designed to identify exclusive collocates and penalises high frequency
combinations, log Dice highlights combinations that are both exclusive
and relatively frequent (for more information see Brezina, 2018: 73–74
and Gablasova et al., 2017). We can also observe that standard deviations
in each group are fairly high (compared to the mean), indicating large
inter-speaker differences. The overall picture is best illustrated with
three graphs with 95% confidence intervals around the mean values of
the respective statistics. These are displayed in Figures 7.2–7.4. Largely
overlapping 95% CIs indicate non-significant results.
In order to test whether L2 speakers tend to repeat the same
formulaic sequences multiple times, the type/token ratios of the
adjective + noun combinations were also computed per proficiency
166 Part 3: The Learner Phrasicon: Developmental Approaches
Figure 7.3 Comparison of the three proficiency levels using the mean MI score
(95% CIs)
level. Table 7.11 reports the results. As can be seen, the mean type/token
ratios of adjective + noun combinations are comparable across all three
proficiency levels; the L1 baseline, however, has considerably higher
mean type/token ratios, indicating less repetition in L1 speech than in L2
speech. Note that, for comparability, type/token ratios for the L1 baseline
were computed for a subset of scripts with the same range of adjective
+ noun tokens (1–73) as the L2 data; this is because type/token ratios of
any kind are dependent on text length and are negatively correlated with
token counts (Brezina, 2018: 57ff).
Figure 7.4 Comparison of the three proficiency levels using the mean log Dice (95% CIs)
5 Discussion
The results indicate a complex picture related to the use of adjective
+ noun combinations in L2 and L1 speech. The focus of the discussion
will be on L2 use with L1 serving as a reference point. The frequencies
of adjective + noun combinations observed in L2 speech are considerably
lower (on average 19.8 occurrences per 1,000 words) than what has
been reported in the literature for L2 writing. Granger and Bestgen
(2014), for instance, report 45.4 adjective + noun bigrams per 1,000
words in L2 student essays. This effect is likely related to the variation
in the frequencies of noun phrases across different registers of speech
and writing (e.g. Biber, 1988). Indeed, L1 use in the Spoken BNC2014
showed even fewer adjective + noun combinations (on average 15.97
per 1,000 words) due to the fact that these data include less formal
and less informational types of interaction than the TLC Sample data.
Less informational and more involved registers, to use Biber’s (1988)
terminology, typically occur with fewer noun phrases in general. In order
to focus on phraseological aspects of adjective + noun combinations,
168 Part 3: The Learner Phrasicon: Developmental Approaches
(2) I don’t really get the question sorry (B1 level, 2_8_RU_4)
(3) it’s kind of hard question (B1 level, 2_6_IN _65)
(4) this is a a really challenging question (C2 level, IN_13)
young people, other thing, etc., are fairly general and thus have the ability
to occur across different contexts. The combinations identified using the
MI score (e.g. chemical things, handicapped people), on the other hand,
are exclusive and often context specific (Gablasova et al., 2017). In the
literature (e.g. Granger & Bestgen, 2014; Li & Schmitt, 2009), the MI
score is typically contrasted with the t-score, demonstrating the same type
of polarity as in the present study because the t-score is highly correlated
with the collocation frequency; more recent studies (e.g. Siyanova-
Chanturia & Spina, 2019) abandon the problematic t-score in favour of
collocation frequency and other measures. For a more detailed discussion
of appropriate collocation measures, see Gablasova et al. (2017).
On a more general level, collocation measures operate on two
principal scales of frequency and exclusivity (Brezina, 2018: 71). These
scales recognise the frequency and the exclusivity of the collocational
relationship as two important features of word combinations in
language, which are also highlighted in this study. We have seen that
L2 speakers in the TLC Sample used a similar range of high frequency
collocations in a similar frequency rank order as demonstrated by the
large Spearman’s correlations (above 0.77). This range, however, differed
to some extent from the L1 baseline (Spearman’s ρ approximately 0.5).
Frequent adjective + noun combinations that were shared across the
proficiency levels and the L1 baseline included collocations such as little
bit, long time, other people, other thing and good thing. The frequent L1
combinations not shared in the top 20 frequent collocations in the TLC
Sample included examples such as bloody hell, bad thing, good thing and
big thing. However, we have to note that while the L1 interactions were
informal and between interlocutors who knew each other well (friends
and family, Love et al., 2017), the L2 interactions were semi-formal and
between interlocutors with some power hierarchy (Gablasova et al.,
2019). The absence of swear-word combinations (bloody hell) and vague
expressions (bad thing, good thing, big thing) from the top 20 frequency
lists in the L2 data can thus also be explained by this difference in the
nature of the interaction.
A side effect of looking at exclusivity (using the MI score) is the
identification of some combinations which demonstrate lexical rather
than phraseological knowledge. A prime example of these combinations
in this study is the occurrence of compound nouns such as high
school, middle school and mobile phone. Similar combinations (e.g.
nitrous oxide, Hippocratic oath and pop music) figure also in the list
of collocations (n-grams) identified using the MI score in Granger and
Bestgen (2014). Strictly speaking, these combinations point to the use of
particular naming strategies and terminology in L2 production rather
than to the ability to combine words in a particular way (phraseological
competence). From the perspective of vocabulary learning, such
combinations demonstrate the breadth rather than the depth of
170 Part 3: The Learner Phrasicon: Developmental Approaches
vocabulary (cf. Read, 2004). They are thus likely to be correlated with
the topic of the spoken exchange, chosen at a higher level than the
sentence/utterance level (choice of topic at the textual level).
Third, when interpreting the collocational strength of the observed
adjective + noun combinations (see Table 7.10), we need to bear in mind
different properties of collocates identified using different collocation
statistics. In this study, we looked at three measures highlighting different
aspects of the collocational relationship between adjectives and nouns
on the frequency and exclusivity scales (Brezina, 2018; Gablasova et al.,
2017). The collocation frequency highlights frequent, non-exclusive
combinations, while the MI score focuses on infrequent but highly
exclusive combinations. We have also added a third measure, log Dice,
which identifies relatively exclusive combinations, which are also
reasonably frequent. The log Dice score, therefore, represents a measure,
which offers a balance between frequency and exclusivity (Omidian
et al., Chapter 8 of this volume; Rychlý, 2008).
In the study, we observed that all three collocation measures show
an effect of speakers’ L1. L1 has been recognised in the literature (e.g.
Wolter & Gyllstad, 2013) as an important factor in phraseological
knowledge, with L1 transfer effects being reported (Maurer-Stroh, 2005;
Takač & Lukač, 2013). In our study, we contrasted six L1 and cultural
backgrounds with typologically different languages (Chinese vs. Italian
vs. Hindi, etc.). It is therefore not surprising to find differences in the
collocation measures across these backgrounds. However, we need to
exercise caution and not automatically assume L1 transfer without
proper analysis of phraseological mechanisms in the individual L1
contexts (cf. Granger, 2015). A more in-depth investigation of this
variable is, however, beyond the scope of the present study.
The only statistically significant difference related to L2 proficiency
was between the B1 and the C1/C2 group when the mean strength of
collocations was calculated using the log Dice score. This demonstrates
that the higher proficiency level (C1/C2) spoken production included
more adjective + noun combinations that are relatively frequent and
exclusive than the lower B1 level. Examples of high log Dice score
collocations in C1/C2 are the following: good life, extended family, real
time and sore throat. This is in line with Thewissen’s (2015) finding
related to lexical phrase accuracy showing a clear distinction between
the higher proficiency bands and the B1 group, who ‘produced a higher
number of “wide-of-the-mark” errors’ (2015: 187).
From the perspective of second language acquisition, there is a certain
tension between the frequency of co-occurrence and the exclusivity of
the combination. High frequency of co-occurrence of two words points
to a high probability of repeated exposure of the language learner to
these combinations and the possibility of statistical (implicit) learning
(e.g. Saffran, 2003). On the other hand, very frequent combinations
Adjective + Noun Collocations in L2 and L1 Speech 171
might not get noticed and hence might not be committed to memory. As
Divjak (2019: 4) points out: ‘[t]here is no pure frequency effect: experience
cannot be reduced to frequency as experience is filtered through attention
before being committed to memory’. There could be a fundamental
processing difference of formulaic language between L1 and L2 speakers
regarding the frequency and exclusivity of phraseological units. Ellis et al.
(2008) found L1 English speakers had shorter recognition response times
for statistical collocations with higher MI scores, a measure of exclusivity,
while L2 speakers’ shorter reaction times correlated with the frequency
of occurrence of the collocation. Sonbul (2015), in a study of adjective +
noun collocations, found frequency to have a similar effect on typicality
rating of collocations for both L1 and L2 speakers; L2 proficiency also
played a role, with more proficient L2 speakers showing greater sensitivity
to collocation frequency.
Given this complex picture, and the importance of both frequency and
exclusivity, we can argue that for restricted collocations, such as those in
this study, it may be more apt to consider both measures combined. While
in corpus-based research we cannot control for psychological variables
such as attention, we can hypothesise that a collocation measure that takes
into account both frequency and exclusivity of the collocation might be
able to provide access to the proficiency effect better than measures that
deal either with frequency or exclusivity alone. The results of this study
clearly seem to indicate the plausibility of this hypothesis.
Considering the perspective of L2 production, high frequency
combinations have been hypothesised to appear with larger proportions
in lower proficiency texts (intermediate) than in higher proficiency texts
(advanced) (Granger & Bestgen, 2014). This study did not find evidence
for this claim in spoken L2 data: no difference was observed in the use
of the MI and frequency-based collocations across the proficiency levels.
Instead, the evidence points to a more complex picture of non-linear
development of phraseological competence, where individual differences
between speakers play a major role. The need to consider individual
differences and changing contexts has also been highlighted by Larsen-
Freeman (2006) in her case study of five L1 Chinese speakers learning
English, which shows that language attainment develops dynamically.
A similar conclusion has been put forward in Siyanova-Chanturia and
Spina (2019), who investigated the development of adjective + noun
collocations in L2 Italian in a longitudinal learner corpus. The results
demonstrated phraseological development to be complex, with higher
proficiency speakers not using more idiomatic language systematically.
Another longitudinal study, by Omidian et al. (Chapter 8 of this volume),
brings further evidence about phraseological development as a multi-
faceted and non-linear process, noting that exclusivity and diversity in
verb + noun collocations are not only influenced by time and language
proficiency but also, importantly, by individual learner differences.
172 Part 3: The Learner Phrasicon: Developmental Approaches
6 Conclusion
This study offers a unique insight into the use of adjective + noun
collocations in spoken L2, thus complementing previous research,
Adjective + Noun Collocations in L2 and L1 Speech 173
Acknowledgement
The writing of this chapter was supported by ESRC grants no.
EP/P001559/1 and ES/R008906/1.
Notes
(1) [Link]
(2) Speakers were asked to report if they were bilingual and, if so, what their L2 was
other than British English (Table 7.2). Twenty-eight (4.2%) speakers responded pos-
itively and specified the listed languages (see [Link]
[Link]).
References
Bahardoust, M. and Moeini, M.R. (2012) Lexical and grammatical collocations in writing
production of EFL learners. The Journal of Applied Linguistics 5 (1), 61–86.
Bahns, J. (1993) Lexical collocations: A contrastive view. ELT Journal 47 (1), 56–63.
Bestgen, Y. and Granger, S. (2014) Quantifying the development of phraseological
competence in L2 English writing: An automated approach. Journal of Second
Language Writing 26, 28–41.
Biber, D. (1988) Variation Across Speech and Writing. Cambridge: Cambridge University Press.
Biber, D., Gray, B. and Poonpon, K. (2011) Should we use characteristics of conversation to
measure grammatical complexity in L2 writing development? TESOL Quarterly 45 (1),
5–35.
Biber, D., Johansson, S., Leech, G., Conrad, S. and Finegan, E. (1999) Longman Grammar
of Spoken and Written English. London: Longman.
Birdsong, D. (1992) Ultimate attainment in second language acquisition. Language 68 (4),
706–755.
Boers, F., Eyckmans, J., Kappel, J., Stengers, H. and Demecheleer, M. (2006) Formulaic
sequences and perceived oral proficiency: Putting a lexical approach to the test.
Language Teaching Research 10 (3), 245–261.
Brezina, V. (2018) Statistics in Corpus Linguistics: A Practical Guide. Cambridge:
Cambridge University Press.
Choi, W. (2019) A corpus-based study on ‘delexical verb + noun’ collocations made by
Korean learners of English. The Journal of Asia TEFL 16 (1), 279–293.
Connor, U. (1984) A study of cohesion and coherence in ESL students’ writing. Paper in
Linguistics 17 (3), 301–316.
Cook, V.J. (2007) The goals of ELT: Reproducing native-speakers or promoting
multicompetence among second language users? In J. Cummins and C. Davison (eds)
International Handbook of English Language Teaching (pp. 237–248). New York, NY:
Springer.
Adjective + Noun Collocations in L2 and L1 Speech 175
Li, J. and Schmitt, N. (2009) The acquisition of lexical phrases in academic writing: A
longitudinal case study. Journal of Second Language Writing 18 (2), 85–102.
Li, J. and Schmitt, N. (2010) The development of collocation use in academic texts by
advanced L2 learners: A multiple case study approach. In D. Wood (ed.) Perspectives
on Formulaic Language: Acquisition and Communication (pp. 2–46). London & New
York: Continuum.
Llurda, E. (2016) Native speakers? English and ELT: Changing perspectives. In G. Hall
(ed.) The Routledge Handbook of English Language Teaching (pp. 51–64). London:
Routledge.
Long, M.H. (1990) Maturational constraints on language development. Studies in Second
Language Acquisition 12 (3), 251–285.
Love, R., Dembry, C., Hardie, A., Brezina, V. and McEnery, T. (2017) The Spoken BNC2014:
Designing and building a spoken corpus of everyday conversations. International
Journal of Corpus Linguistics 22 (3), 319–344.
Martinez, R. and Schmitt, N. (2012) A phrasal expressions list. Applied Linguistics 33 (3),
299–320.
Maurer-Stroh, P. (2005) ‘House-high favourites?’ A contrastive analysis of adjective-noun
collocations in German and English. ELOPE: English Language Overseas Perspectives
and Enquires 2 (1–2), 57–64.
Molenda, M., Pęzik, P. and Osborne, J. (2018) Self-repetitions in learners’ spoken language:
A corpus-based study. In V. Brezina and L. Flowerdew (eds) Learner Corpus Research:
New Perspectives and Applications (pp. 90–111). London: Bloomsbury.
Ohlrogge, A. (2009) Formulaic expressions in intermediate EFL writing assessment. In
R. Corrigan, A. Moravcsik, H. Ouali and K. M. Wheatley (eds) Formulaic Language,
Volume 2: Acquisition, Loss, Psychological Reality, and Functional Explanations
(pp. 387–404). Amsterdam & Philadelphia: Benjamins.
Paquot, M. (2018) Phraseological competence: A missing component in university entrance
language tests? Insights from a study of EFL learners’ use of statistical collocations.
Language Assessment Quarterly 15 (1), 1–15.
Paquot, M. and Granger, S. (2012) Formulaic language in learner corpora. Annual Review
of Applied Linguistics 32, 130–149.
Read, J. (2004) Research in teaching vocabulary. Annual Review of Applied Linguistics 24
(1), 146–161.
Rychlý, P. (2008) A lexicographer-friendly association score. In P. Sojka and A. Horák (eds)
Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN
(pp. 6–9). Brno, Czech Republic: Masaryk University.
Saffran, J.R. (2003) Statistical language learning: Mechanisms and constraints. Current
Directions in Psychological Science 12 (4), 110–114.
Seargeant, P. (2013) Ideologies of nativism and linguistic globalization. In S.A. Houghton
and D.J. Rivers (eds) Native-Speakerism in Japan: Intergroup Dynamics in Foreign
Language Education (pp. 231–242). Bristol: Multilingual Matters.
Shih, H.-H.R. (2000) Collocation deficiency in a learner corpus of English: From an overuse
perspective. Proceedings of the 14th Pacific Asia Conference on Language, Information
and Computation (pp. 281–288). Tokyo: Waseda University International Conference
Center.
Shin, D. and Nation, P. (2008) Beyond single words: The most frequent collocations in
spoken English. ELT Journal 62 (4), 339–348.
Siyanova-Chanturia, A. and Spina, S. (2019) Multi-word expressions in second language
writing: A large-scale longitudinal learner corpus study. [Advance online publication.]
Language Learning 70 (2), 420–463.
Sonbul, S. (2015) Fatal mistake, awful mistake, or extreme mistake? Frequency effects on
off-line/on-line collocational processing. Bilingualism: Language and Cognition 18 (3),
419–437.
Adjective + Noun Collocations in L2 and L1 Speech 177
Takač, V.P. and Lukač, M. (2013) How word choice matters: An analysis of adjective-noun
collocations in a corpus of learner essays. Jezikoslovlje 14 (2–3), 385–402.
Thewissen, J. (2015) Accuracy Across Proficiency Levels: A Learner Corpus Approach.
Louvain-la-Neuve, Belgium: Presses universitaires de Louvain.
Wolter, B. and Gyllstad, H. (2013) Frequency of input and L2 collocation processing: A
comparison of congruent and incongruent collocations. Studies in Second Language
Acquisition 35 (3), 451–482.
Wray, A. (2000) Formulaic sequences in second language teaching: Principle and practice.
Applied Linguistics 21 (4), 463–489.
Wray, A. (2013) Formulaic language. Language Teaching 46 (3), 316–334.