Carnegie Mellon University
Browse
file.pdf (227.73 kB)

Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments

Download (227.73 kB)
journal contribution
posted on 2011-06-01, 00:00 authored by Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, Noah A. Smith

We address the problem of part-of-speech tagging for English data from the popular microblogging service Twitter. We develop a tagset, annotate data, develop features, and report tagging results nearing 90% accuracy. The data and tools have been made available to the research community with the goal of enabling richer text analysis of Twitter and related social media data sets.

History

Publisher Statement

Copyright 2011 ACL

Date

2011-06-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC