Communications on Applied Electronics |
Foundation of Computer Science (FCS), NY, USA |
Volume 6 - Number 1 |
Year of Publication: 2016 |
Authors: Thangaraj M., Sivagaminathan P. G. |
10.5120/cae2016652375 |
Thangaraj M., Sivagaminathan P. G. . An Improved Generic Crawler using Poisson Fit Distribution. Communications on Applied Electronics. 6, 1 ( Oct 2016), 7-13. DOI=10.5120/cae2016652375
The remarkable growth of Internet populates the World Wide Web to contain huge web data which is unexplored to whom it is intended for worth extraction and assimilation into knowledge. Retrieving potential information from web data needs a broad-spectrum crawler to collect relevant documents and metadata. Breadth first crawler algorithm is presented to fetch related web documents essential to create a web archive for alias extraction. In this paper, it is proved that the upgraded crawler generates better random depth rather than predetermined depth crawling. Contributing different mean values to this function enabled crawler it is possible to generate dynamic random depth.