TREC Tweets2011 2011 Topic Classification

Source: 'Evaluating Real-Time Search over Tweets'. In Proceedings of ICWSM 2012.

Description: Manual classification of the 50 topics (MB001 to MB050) used during the TREC 2011 Microblog track. Each query was assigned into three individual categories. The first category corresponds to the 'best fit' into one of 11 news categories. The second category denotes whether the query topic is conserned with a (U.S.) national or international event or entity. The third category denotes whether the query contains a named entity and if so the type of that entity.

Format: Query String<tab>News Class<tab>National/International<tab>Entity Class


News Classes

National / International

Entity Classes

Download and Citation Licence

By downloading this data you consent to include a citation to the paper below in any publication that you use the data:



Please cross-ref using TerrierTeam/xxx