TitleUbuntu Internet Relay Chat Archives for Multiparticipant Chat Analysis
Publication TypeConference Paper
Year of Publication2013
AuthorsUthus, DC, Aha, DW
Conference NameAAAI Spring Symposium on Analyzing Microtext
PublisherAAAI Press
Conference LocationStanford, CA
Keywordschat analysis, chat corpus, machine learning

We present the Ubuntu Chat Corpus as a data source for
multiparticipant chat analysis. This addresses the problem
of the lack of a large, publicly suitable corpora for
research in this medium. The advantages of using this
corpus for research is its large number of chat messages,
its multiple languages, its technical nature, and all of the
original chat messages are in the public domain.

Refereed DesignationRefereed
Full Text
NRL Publication Release Number: 
machine learning
chat analysis
chat corpus