Although web crawlers have been around for twenty years by now, there is virtually no freely available, open-source crawling software that guarantees high throughput, over- comes the limits of single-machine tools and at the same time scales linearly with the amount of resources available. This paper aims at filling this gap.
BUbiNG: Massive Crawling for the Masses
MARINO, ANDREA;
2014-01-01
Abstract
Although web crawlers have been around for twenty years by now, there is virtually no freely available, open-source crawling software that guarantees high throughput, over- comes the limits of single-machine tools and at the same time scales linearly with the amount of resources available. This paper aims at filling this gap.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.