Article published in:Multiple Affordances of Language Corpora for Data-driven Learning
Edited by Agnieszka Leńko-Szymańska and Alex Boulton
[Studies in Corpus Linguistics 69] 2015
► pp. 155–176
Textual cohesion patterns for developing reading skills
A corpus-based multilingual learning environment
This chapter proposes a systematic methodology for developing top-down processing strategies for L2/L3 reading, and the creation of contrastive linguistic resources that support them. The work specifically focuses on acquisition of reading strategies to support text comprehension using a multilingual electronic thesaurus of textual connectors for Russian, Ukrainian and English, which were systematically derived from large corpora and classified according to the author’s semantic classification scheme. This thesaurus is then used for automatically annotating connectors in texts in the corpus to support reading strategies and to find texts of similar type or genre for teaching reading. This methodology was developed primarily for students of Russian learning to read Ukrainian, but it can be applied to the development of reading and writing skills in any L2.
Keywords: connectors, reading skills, reading strategies, text structure, textual cohesion
Published online: 13 May 2015
Baroni, M. & Bernardini, S.
2004 BootCaT: Bootstrapping corpora and terms from the web. In Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC’04), M.T. Lino, M.F. Xavier, F. Ferreira, R. Costa & R. Silva (eds), 1313–1316. European Language Resources Association (ELRA). http://www.comp.nus.edu/~rpnlpir/proceedings/lrec-2004/pdf/509.pdf (7 May 2014).
Biber, D., Johansson, S., Leech, G., Conrad, S. & Finegan, E.
Evert, S. & Hardie, A.
2011 Twenty-first century Corpus Workbench: Updating a query architecture for the new millennium. In Proceedings of Corpus Linguistics Conference 2011 . http://www.birmingham.ac.uk/documents/college-artslaw/corpus/conference-archives/2011/Paper-153.pdf (7 May 2014).
Flowerdew, J. & Mahlberg, M.
Goldman, S.R. & Rakestraw, J.
Granger, S. & Petch-Tyson, S.
Halliday, M.A.K. & Matthiessen, C.
Jalkanen, J. & Vaarala, H.
McCann, W.J., Klein, H.G. & Stegmann, T.D.
McEnery, T. & Xiao, R.
2007 Parallel and comparable corpora: What are they up to? In Incorporating Corpora: Translation and the Linguist, G.M. Anderman & M. Rogers (eds), 18–31. Clevedon: Multilingual Matters. http://eprints.lancs.ac.uk/59 (7 May 2014).
Meyer, B. & Poon, L.
Paramita, M.L., Clough, P., Aker, A. & Gaizauskas, R.
2012 Correlation between similarity measures for inter-language linked Wikipedia articles. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’12), N. Calzolari, K. Choukri, T. Declerck, M. Uğur Doğan, B. Maegaard, J. Mariani, J. Odijk & S. Piperidis (eds), 790–797. European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2012/index.html (7 May 2014).
2002 Tekstovye skrepy i skrepy-frazy (o rasshyrenii kategorii sluzhebnykh edinits russkogo iazyka) [Textual ‘ties’ and ‘tie-phrases’ (on the extension of the category of function words in Russian)]. In Predlozhenie. Tekst. Rechevoe Funktsyonirovanie Iazykovykh Edinits. Mezhvuzovskii Sbornik Nauchnykh Trudov, 208–214. Elets, Russia: Bunin Elets State University.
Quirk, R., Greenbaum, S., Leech, G. & Svartvik, J.
Rashmi, P., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A. & Webber, B.
2008 The Penn Discourse TreeBank 2.0. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC’08), N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis & D. Tapias (eds), 2961–2968. European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2008 (7 May 2014).
1989 Functions of ‘because’ in spoken discourse. PhD dissertation, Georgetown University.
2006 A uniform interface to large-scale linguistic resources. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC’06), N. Calzolari, K. Choukri, A. Gangemi, B. Maegaard, J. Mariani, J. Odijk & D. Tapias (eds), 538–542. European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2006 (7 May 2014).
Skadina, I., Aker, A., Giouli, V., Tufis, D., Gaizauskas, R., Mierina, M. & Mastropavlos, N.
2007 Querying multimodal annotation: A concordancer for GeM. In LAW’07 Proceedings of the Linguistic Annotation Workshop , 57–60. Stroudsburg PA: Association for Computational Linguistics. http://dl.acm.org/citation.cfm?id=1642059.1642069&coll=DL&dl=GUIDE&CFID=333207822&CFTOKEN=35450273 (7 May 2014).
Thompson, G. & Zhou, J.