Terrier/ClueWeb12

== Terrier/ClueWeb12 ===

Patches

For Terrier 3.5, you will need to apply a few patches:

Terrier 3.6 does not require patching to index ClueWeb12.

Configuration

Generally, you need to follow Terrier/ClueWeb09-B, with some specifics:

trec.collection.class=WARC10Collection
indexer.meta.forward.keys=docno,url
indexer.meta.forward.keylens=26,512
indexer.meta.reverse.keys=docno

TrecDocTags.skip=SCRIPT,STYLE

Retrieval

last edited 2014-06-18 12:46:21 by CraigMacdonald