The BLOGS06 test collection

Modified: 21 March 2006

BLOGS06 is a TREC test collection, created and distributed by the University of Glasgow.



Total Number of Feeds:100,649
Total Number of Feeds collected:753,681
Average feeds collected every day:10,615
Uncompressed Size:38.6GB
Compressed Size:8.0GB

Permalink Documents:

Total number of permalink documents:3,215,171
Average documents every day:45,284
Uncompressed Size:88.8GB
Compressed Size:12.6GB

Homepage Documents:

Total number of homepage documents:324,880
Average homepage documents collected every day:4,576
Uncompressed Size:20.8GB
Compressed Size:4.0GB

Total size of collection is 25GB

Distribution information: