Prof. Dr. Christian Mair (Freiburg): CCN ("Corpus of Cyber-Nigerian"): Concordancing, annotation and visualisation in a very large web-derived CMC corpus

The paper reports on research carried out since November 2011 in the framework of a DFG-funded project ("Cyber-Creole," DFG MA 1652/9) on the global spread of English-lexifier pidgins and creoles in the participatory web. The Cyber-Creole project is complemented by "RomWeb" (DFG PF 699/4), which is headed by Stefan Pfänder of the Department of Romance Languages and Literatures at the University of Freiburg, Germany. The two projects both use data downloaded from web-based diasporic discussion forums, employ identical strategies of data collection and mark-up and have developed a shared search and analysis interface, NCAT (= Net Corpora Administration Tool). The two projects also share a similar general theoretical orientation, being motivated by an interest in how the new media impact on the spread of standard and non-standard varieties of European ex-colonial languages in conditions of economic, political, cultural and media globalisation. In this regard, their work is intended to contribute to the emerging research paradigm of the sociolinguistics of globalisation. The present brief survey will (i) explore how findings from CMC force us to re-think some basic theoretical assumptions in research on World Englishes (e.g. on the notion of "varieties of English"), (ii) introduce the tool-kit which we have developed for concordancing and visualisation of the data, and (iii) demonstrate its application in the description of the "Cyber-Nigerian" linguistic repertoire developed by one of the communities under investigation.