Article published in:Spanish-English Codeswitching in the Caribbean and the US
Edited by Rosa E. Guzzardo Tamargo, Catherine M. Mazak and M. Carmen Parafita Couto
[Issues in Hispanic and Lusophone Linguistics 11] 2016
► pp. 171–189
The stratification of English-language lone-word and multi-word material in Puerto Rican Spanish-language press outlets
A computational approach
This chapter considers the presence of English in a 3.3-million-word corpus of Puerto Rican news press addressed to distinct social classes: El Vocero, published for a working-class population, El Nuevo Día for a mainstream market, and 80grados for an intellectual readership. Statistical models reveal no significant differences between sub-corpora with respect to the frequency of English unigram and bigram tokens. However, significant differences are returned for English 3+grams sequences: 80grados presents longer, more diverse and complex English spans than do El Nuevo Día and El Vocero. Interpreting the results in view of the social context, we suggest that, in Puerto Rico, the use simplex and compound anglicisms might not signal prestige; it could be code-switching that is linked with status.
Keywords: computational, corpus, loanwords, Spanish-English codeswitching
Published online: 07 September 2016
Bills, G.D., & Vigil, N.A.
Bird, S., Klein, E., & Loper, E.
Bullock, B.E., Serigos, J.L., & Toribio, A.J.
(2015) The use of loan translations and its consequences in an oral bilingual corpus. Paper presented at the 10th International Symposium on Bilingualism . Rutgers University.
Bullock, B.E., Serigos, J.L., Toribio, A.J., & Wendorf, A.
Bullock, B.E., & Toribio, A.J.
(2013) The Spanish in Texas Corpus Project. Center for Open Educational Resources and Language Learning (COERLL), The University of Texas at Austin. http://www.spanishintexas.org
Chesley, P., & Baayen, R.H.
(2002) It’s the effect size, stupid: what effect size is and why it is important. Retrieved July 7, 2014, from http://www.leeds.ac.uk/educol/documents/00002182.htm
Elfardy, H., & Diab, M.T.
Jurafsky, D., & Martin, J.H.
Montes-Alcalá, C., & Lapidus Shin, N.
Poplack, S., Sankoff, D., & Miller, C.
R Development Core Team
(2013) R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. http://www.R-project.org
(1994) Probabilistic part-of-speech tagging using decision trees. In Proceedings of the international conference on new methods in language processing (Vol. 12, pp. 44–49). Manchester, UK.
Solorio, T., Blair, E., Maharjan, S., Bethard, S., Diab, M., Gohneim, M., … & Fung, P.
Solorio, T., & Liu, Y.
Thomason, S.G., & Kaufman, T.
Toribio, A.J., Bullock, B.E., Serigos, J., Neupane, R. & Ball, K.
(2015) Towards developing a metric for U.S. Spanish. Paper presented at the 25th Conference on Spanish in the United States . New York, NY: City University of New York.
Torres Cacoullos, R. & Aaron, J.E.