Building and analysing corpora of computer-mediated communication

King, Brian Walter

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- English: Chapter in book

Book Chapter: Building and analysing corpora of computer-mediated communication

Title	Building and analysing corpora of computer-mediated communication
Authors	King, Brian Walter
Issue Date	1-Aug-2009
Abstract	This chapter addresses problems encountered during the construction and analysis of a synchronic corpus of computer-mediated discourse. The corpus was not primarily constructed for the examination of the linguistic idiosyncrasies of the online chatting medium; rather it is to be used for corpus-based sociolinguistic inquiry into the language use and identity construction of a particular social group (which in this case could be classed as ‘vulnerable’). Therefore the corpus data needed considerable adaptation during compilation and analysis to prevent those idiosyncrasies from acting as noise in the data. Adaptations include responses to spam (in the form of ‘adbots’), cyber-orthography, the ubiquity of names, overlapping conversations and challenges of annotation. Dif culties with gaining participant permissions and demographic information also required signi cant attention. Attempted solutions to these corpus construction and analysis challenges, which are closely bound to the elds of both cyber-research and corpus linguistics, are outlined.
Persistent Identifier	http://hdl.handle.net/10722/345613
ISBN	9780826496102

DC Field	Value	Language
dc.contributor.author	King, Brian Walter	-
dc.date.accessioned	2024-08-27T09:10:00Z	-
dc.date.available	2024-08-27T09:10:00Z	-
dc.date.issued	2009-08-01	-
dc.identifier.isbn	9780826496102	-
dc.identifier.uri	http://hdl.handle.net/10722/345613	-
dc.description.abstract	<p>This chapter addresses problems encountered during the construction and analysis of a synchronic corpus of computer-mediated discourse. The corpus was not primarily constructed for the examination of the linguistic idiosyncrasies of the online chatting medium; rather it is to be used for corpus-based sociolinguistic inquiry into the language use and identity construction of a particular social group (which in this case could be classed as ‘vulnerable’). Therefore the corpus data needed considerable adaptation during compilation and analysis to prevent those idiosyncrasies from acting as noise in the data. Adaptations include responses to spam (in the form of ‘adbots’), cyber-orthography, the ubiquity of names, overlapping conversations and challenges of annotation. Dif culties with gaining participant permissions and demographic information also required signi cant attention. Attempted solutions to these corpus construction and analysis challenges, which are closely bound to the elds of both cyber-research and corpus linguistics, are outlined.<br></p>	-
dc.language	eng	-
dc.relation.ispartof	Contemporary Corpus Linguistics	-
dc.title	Building and analysing corpora of computer-mediated communication	-
dc.type	Book_Chapter	-
dc.identifier.spage	301	-
dc.identifier.epage	320	-

File Download

Supplementary

Book Chapter: Building and analysing corpora of computer-mediated communication

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats