Tuesday, November 8, 2011

Common Crawl

News

A freely accessible index of 5 billion web pages, their page rank, their link graphs and other metadata, hosted on Amazon EC2, was announced today by the Common Crawl Foundation. "It is crucial [in] our information-based society that Web crawl data be open and accessible to anyone who desires to utilize it," writes Foundation director Lisa Green on the organization's blog.

Interference

Link Yeltsin was deeply unpopular at that time in Russia, polling no more than 8% and widely blamed for the rise of the gangster oligarch...