Terrier/Tweets11/TopicCategorisation

TREC Tweets2011 2011 Topic Classification

Source

'Evaluating Real-Time Search over Tweets'. In Proceedings of ICWSM 2012.

Description

Manual classification of the 50 topics (MB001 to MB050) used during the TREC 2011 Microblog track. Each query was assigned into three individual categories. The first category corresponds to the 'best fit' into one of 11 news categories. The second category denotes whether the query topic is concerned with a (U.S.) national or international event or entity. The third category denotes whether the query contains a named entity and if so the type of that entity.

Format

Query String<tab>News Class<tab>National/International<tab>Entity Class

Statistics

News Classes

National / International

Entity Classes

Download and Citation Licence

By downloading this data you consent to include a citation to the paper below in any publication that you use the data:

@inproceedings{soboroffICWSM2012,
    author={Soboroff, Ian  and McCullough, Dean and Lin, Jimmy and Macdonald, Craig and Ounis, Iadh and McCreadie, Richard },
    title={{Evaluating Real-Time Search over Tweets}},
    booktitle={Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2012)},
    year={2012},
    location={Dublin, Ireland}
}

Download link: [Upload new attachment "TREC2011MicroblogTrack.topicclassification.list"]