Terrier/WikiForum

We've decided to concentrate on the wiki for the community participation with Terrier. You can also discuss Terrier on the Terrier/LiveDoc/MailingLists

Question: Why do we need to rebuild the collection.spec file correctly (see Terrier/LiveDoc/TrecExample)? Is the file collection.spec created with the script trec_setup.sh?

From: GianniAmati

VassilisPlachouras: The trec_setup.sh script uses the utility find on Unix/Linux/MacOS X systems and the trec_setup.bat script uses the class [WWW] FileFind, on Windows, so that we obtain the absolute paths for all the files under a given directory.

If under the directory you specify for the trec_setup script, there are only the collection files, then it is not necessary to create the collection.spec file manually. If under the directory where the collection files are stored, there are other files as well, then it may be necessary to check that the automatically generated collection.spec file contains only the collection files, and either edit it or create it manually.

Have someone used the distributed version of Terrier with the terabyte TREC track to compare efficiency between a language model approach and the DFR models, or conventional models?

More information: I have not the data of the terabyte track and I would like to know whether the assignment of non-zero probabilities to terms increase the complexity of the retrieval, especially with long queries or QE.

From: GianniAmati

Template for Questions

Question: goes here

More information: goes gere From: goes here

Question: goes here

More information: goes gere From: goes here


CategoryTerrier