Sogou Labs Shares Useful Data

Sogou LabsSogou, the search engine of Sohu, launched its labs recently. The labs will show the innovative products, product prototypes, data on search and Chinese characters, and research reports on search by Sogou engineers.

Currently, products and prototypes in its labs webpage include Sogou Chinese Characters Input software, Sogou Ranks, Webpage Auto Categorization, that is a prototype to classify any Chinese webpages into some predefined categories.

However, most important, I think, is the data on search shared by Sogou. The data shared sofar include

The data can be used in any non-commercial projects with credit to Sogou. I like the open attitude of Sogou, it may help to harness collective intelligence to advance the research in Chinese search engine.

2 Responses to “Sogou Labs Shares Useful Data”

  1. China Snippets on November 24th, 2006 9:56 am

    Interesting. I downloaded the data but I wouldn’t have a clue how to open them. Have you downloaded the data and do you know how to open the file and which software to use.

    Cheers,

    G.

  2. Tangos on November 24th, 2006 10:46 pm

    You mean tar.gz compressed file? you can use Winrar to decompress it.

Post a comment