3 John Sule Imokha

Retrieving information in the World Wide Web, the world’s largest collection of documents is a challenging and important task. The scale of the WWW is immense, consisting of at least twenty billion publicly visible web pages distributed on millions of servers world-wide. There is no enforcement on adherence to formal protocols to publish in the web. Authors publish in a wide variety of formats, which includes deliberately misleading search platforms and hence increasing the chance of retrieving irrelevant web pages and this action has led to the degradation of search results.