Article published in:The Complementary Contribution of Comparable and Parallel Corpora to Crosslinguistic Studies
Edited by Sylviane Granger and Marie-Aude Lefer
[Languages in Contrast 20:2] 2020
► pp. 184–208
Corpus-based contrastive studies
Beginnings, developments and directions
This article outlines the beginnings of corpus-based contrastive studies with special reference to the development of parallel corpora that took place in Scandinavia in the early 1990s under the direction of Stig Johansson. It then discusses multilingual corpus types and methodological issues of their exploration, including the tertium comparationis for contrastive studies based on different types of corpora. Some glimpses are offered of recent developments and current trends in the field, including the widening scope of corpus-based contrastive analysis, concerning language pairs as well as the kinds of topics studied and the methods used. The paper ends by identifying and discussing some challenges for the field and indicating prospects and directions for its future.
Keywords: contrastive analysis, multilingual corpus, parallel corpus, comparable corpus, tertium comparationis
Published online: 06 October 2020
Aijmer, K. and Altenberg, B.
Aijmer, K., Altenberg, B. and Johansson, M.
Aijmer, K. and Lewis, D.
Aijmer, K. and Simon-Vandenbergen, A.[ p. 202 ]
Altenberg, B. and Granger, S.
Biber, D., Conrad, S. and Reppen, R.
Čermák, F. and Rosen, A.
Čermáková, A. and Chlumská, L.
Chlumská, L. and Lukeš, D.
2018 Comparing the Incomparable? Rethinking n-grams for Free Word-Order Languages. In Book of Abstracts. Using Corpora in Contrastive and Translation Studies Conference (5th edition), S. Granger, M.-A. Lefer and L. A. d. S. Penha-Marion (eds), 40–41. CECL Papers 1. Louvain-la-Neuve: Centre for English Corpus Linguistics/Université catholique de Louvain.
Cignoni, L., Coffey, S. and Moon, R.
Cosme, C. and Gilquin, G.
Coussé, E. and van der Auwera, J.
Crible, L. and Degand, L.[ p. 203 ]
Dickens, A. and Salkie, R.
1996 Comparing Bilingual Dictionaries with a Parallel Corpus. Proceedings of the Seventh Euralex International Congress on Lexicography (Euralex ’96). Göteborg, Sweden 13–18 August 1996 Göteborg: Göteborg University. 551–559. Available at http://euralex.org/category/publications/euralex-1996-2/ [last accessed May 2019].
Doval, I. and Sánchez Nieto, M. T.
Ebeling, S. O.
Ebeling, S. O. and Ebeling, J.
2013 Between and through Revisited. In Corpus Linguistics and Variation in English: Focus on Non-Native Englishes, M. Huber and J. Mukherjee (eds). Studies in Variation, Contacts and Change in English, Vol. 13. Available at http://www.helsinki.fi/varieng/series/volumes/13/ [last accessed May 2019]
Egan, T. and Dirdal, H.[ p. 204 ]
Evert, S., and Neumann, S.
Frankenberg-Garcia, A. and Santos, D.
Gale, W. A. and Church, K. W.
Gómez González, M. A., Mackenzie, J. L. and González-Álvarez, E.
Gries, S. Th., Miglio, V. G. and Jansegers, M.
To appear. Quantitative Methods for Corpus-Based Contrastive Linguistics. Available at https://www.researchgate.net/publication/332120477_Quantitative_methods_for_corpus-based_contrastive_linguistics [last accessed February 2020].
Hansen-Schirra, S. and Neumann, S.
Hansen-Schirra, S., Neumann, S., Steiner, E.[ p. 205 ]
Johansson, Stig, Ebeling, J. and Oksefjell, S.
1999/2002 English-Norwegian Parallel Corpus: Manual. Available at https://www.hf.uio.no/ilos/english/services/knowledge-resources/omc/enpc/ENPCmanual.pdf [last accessed February 2020].
Johansson, S. and Hofland, K.
Kefala, S. and Sidiropoulou, M.
Kirk, J. and Čermáková, A.
2017 From ICE to ICC: The New International Comparable Corpus. In Proceedings of the Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP), Birmingham, 24 July 2017, P. Bański, M. Kupietz, H. Lüngen, P. Rayson, H. Biber, E. Breiteneder, S. Clematide, J. Mariani, M. Stevenson and T. Sick (eds), 7–12. Mannheim: Institut für Deutsche Sprache. Available at https://ids-pub.bsz-bw.de/frontdoor/deliver/index/docId/6249/file/Kirk_Cermakova_From_ICE_to_ICC_2017.pdf [last accessed February 2020].
2005 Europarl: A Parallel Corpus for Statistical Machine Translation. Proceedings of the Tenth Machine Translation Summit (MT Summit ’05). Phuket, Thailand 12–16 September 2005 Asia-Pacific Association for Machine Translation, Thai Computational Linguistics Laboratory. 79–86. Available at http://www.iccs.inf.ed.ac.uk/~pkoehn/publications/europarl-mtsummit05.pdf [last accessed February 2020].
Kong, K. C.[ p. 206 ]
Kunz, K., Lapshinova-Koltunski, E., Martínez Martínez, J. M., Menzel, K. and Steiner, E.
Lefer, M.-A. and Vogeleer, S.
López Arroyo, B. and Roberts, R. P.
Macken, L., De Clercq, O. and Paulussen, H.
Maia, B. and Santos, D.
2012 “Who’s afraid of ... what?” – In English and Portuguese. In Aspects of Corpus Linguistics: Compilation, Annotation, Analysis, S. O. Ebeling, J. Ebeling and H. Hasselgård (eds). Studies in Variation, Contacts and Change in English, Vol. 12. Available at http://www.helsinki.fi/varieng/series/volumes/12/
Nordrum, L., Ebeling, S. O. and Hasselgård, H.
Oakes, M. and McEnery, T.
O’Keeffe, A.[ p. 207 ]
Sanjurjo-González, H. and Izquierdo, M.
Šebestová, D. and Malá, M.
2018 Testing the Contrastive Application of the n-gram Method to Typologically Different Languages: The Case of English and Czech Children’s Literature. In Book of Abstracts. Using Corpora in Contrastive and Translation Studies Conference (5th edition), S. Granger, M.-A. Lefer and L. A. d. S. Penha-Marion (eds), 157–158. CECL Papers 1. Louvain-la-Neuve: Centre for English Corpus Linguistics/Université catholique de Louvain.
Ström Herold, J. and Levin, M.
Taboada, M., Carretero, M. and Hinnell, J.
van der Auwera, J.
2007 Annotation, Retrieval and Experimentation. Or: You only Get out what you Put in. In Annotating Variation and Change, A. Meurman-Solin and A. Nurmi (eds). Studies in Variation, Contacts and Change in English, Vol. 1. Available at http://www.helsinki.fi/varieng/series/volumes/01/
Xiao, R. and T. McEnery
Zanettin, F.[ p. 208 ]
2011 Translation and Corpus Design. SYNAPS – A Journal of Professional Communication 26: 14–23. Available at https://openaccess.nhh.no/nhh-xmlui/handle/11250/2394011
Cited by 1 other publications
Johansen, Stine Hulleberg
This list is based on CrossRef data as of 17 april 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.