I attached a breakdown of scraped torrent data broken down by site.
This data only represents active torrents (those still found on
respective sites). Theres a large set of null hashes which appears to
represent earlier scraped data as currently we retrieve hashes with high
success. Im assuming that we want these null hashes to be updated.
Again with sites that require downloading to retrieve the hash the
process becomes much slower especially in cases in which the site places
limits on downloads such as Snarf-it which I have come to conclude
limits an ip to 30 downloads. As of right now I am utilizing a few web
proxy's get around such site download limitations. Also as the data may
show some sites may need more attention in the searching scheme as a low
number of urls would depict - either that or they just dont have a huge
amount of content.

