StormCrawler!

StormCrawler alternatives

  • Scrapy

  • Scrapy is an open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

    tags: framework data-mining web-scraping
  • Heritrix

  • Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    tags: web-crawler web-crawling web-data-crawling
  • Apache Nutch

  • Apache Nutch --

    tags: web-crawler web-crawling web-scraper
  • ACHE Crawler

  • ACHE is a web crawler for domain-specific search

    tags: web-crawler web-crawling web-scraper web-scraping