File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Appears in Collections:
Book Chapter: Building and analysing corpora of computer-mediated communication
Title | Building and analysing corpora of computer-mediated communication |
---|---|
Authors | |
Issue Date | 1-Aug-2009 |
Abstract | This chapter addresses problems encountered during the construction and analysis of a synchronic corpus of computer-mediated discourse. The corpus was not primarily constructed for the examination of the linguistic idiosyncrasies of the online chatting medium; rather it is to be used for corpus-based sociolinguistic inquiry into the language use and identity construction of a particular social group (which in this case could be classed as ‘vulnerable’). Therefore the corpus data needed considerable adaptation during compilation and analysis to prevent those idiosyncrasies from acting as noise in the data. Adaptations include responses to spam (in the form of ‘adbots’), cyber-orthography, the ubiquity of names, overlapping conversations and challenges of annotation. Dif culties with gaining participant permissions and demographic information also required signi cant attention. Attempted solutions to these corpus construction and analysis challenges, which are closely bound to the elds of both cyber-research and corpus linguistics, are outlined. |
Persistent Identifier | http://hdl.handle.net/10722/345613 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | King, Brian Walter | - |
dc.date.accessioned | 2024-08-27T09:10:00Z | - |
dc.date.available | 2024-08-27T09:10:00Z | - |
dc.date.issued | 2009-08-01 | - |
dc.identifier.isbn | 9780826496102 | - |
dc.identifier.uri | http://hdl.handle.net/10722/345613 | - |
dc.description.abstract | <p>This chapter addresses problems encountered during the construction and analysis of a synchronic corpus of computer-mediated discourse. The corpus was not primarily constructed for the examination of the linguistic idiosyncrasies of the online chatting medium; rather it is to be used for corpus-based sociolinguistic inquiry into the language use and identity construction of a particular social group (which in this case could be classed as ‘vulnerable’). Therefore the corpus data needed considerable adaptation during compilation and analysis to prevent those idiosyncrasies from acting as noise in the data. Adaptations include responses to spam (in the form of ‘adbots’), cyber-orthography, the ubiquity of names, overlapping conversations and challenges of annotation. Dif culties with gaining participant permissions and demographic information also required signi cant attention. Attempted solutions to these corpus construction and analysis challenges, which are closely bound to the elds of both cyber-research and corpus linguistics, are outlined.<br></p> | - |
dc.language | eng | - |
dc.relation.ispartof | Contemporary Corpus Linguistics | - |
dc.title | Building and analysing corpora of computer-mediated communication | - |
dc.type | Book_Chapter | - |
dc.identifier.spage | 301 | - |
dc.identifier.epage | 320 | - |