TREC Disk 1 & 2
TREC Disks 1 & 2 are the original TREC test collections.
Indexing Disk 1& 2 is easy with Terrier. Only one property in terrier.properties needs to be altered from the default created by trec_setup, as follows:
#skip indexing some tags for these corpora TrecDocTags.process=TEXT,TITLE,HEAD,HL