TitleUbuntu Internet Relay Chat Archives for Multiparticipant Chat Analysis
Publication TypeConference Paper
Year of Publication2013
AuthorsUthus, DC, Aha, DW
Conference NameAAAI Spring Symposium on Analyzing Microtext
PublisherAAAI Press
Conference LocationStanford, CA
Keywordschat analysis, chat corpus, machine learning
Abstract

We present the Ubuntu Chat Corpus as a data source for
multiparticipant chat analysis. This addresses the problem
of the lack of a large, publicly suitable corpora for
research in this medium. The advantages of using this
corpus for research is its large number of chat messages,
its multiple languages, its technical nature, and all of the
original chat messages are in the public domain.

Refereed DesignationRefereed
Full Text
pdf: 
http://www.nrl.navy.mil/itd/aic/sites/www.nrl.navy.mil.itd.aic/files/pdfs/%28Uthus%20%26%20Aha%2C%202013%20AAAI%20SS%29%20The%20Ubuntu%20Chat%20Corpus%20for%20Multiparticipant%20Chat%20Analysis.pdf
NRL Publication Release Number: 
12-1231-3791
pub_tags: 
machine learning
chat analysis
chat corpus
key_pub_tags: