DiffBot!

DiffBot alternatives

  • Portia

  • Portia is an open source visual scraping tool, allows you to scrape websites without any programming knowledge required! Simply annotate pages you're interested in, and Portia will create a spider to extract data from similar pages.

    tags: web-scraping web-crawler web-crawling screen-scraping
  • import.io

  • import.io is a free web-based platform that puts the power of the machine readable web in your hands. Using our tools you can create an API or crawl an entire website in a fraction of the time of traditional methods, no coding required. Our highly efficient and scalable platform allows you to process 1,000s of queries at once and get real-time data in any format you choose. We also offer an easy to use client library to make exporting, integrating and using your data as simple as extracting it.

    tags: data-mining data-extraction extractor crawler data-export
  • Kantu Web Automation Browser

  • Kantu allows you to visually automate your task, and makes web automation fun again. It lets you create solutions for web automation, web scraping or web testing in minutes. It's like the ultimate robot stand-in for all of your web browser automation needs! Whatever you used to do manually, Kantu can do automatically -- stuff like filling out forms, clicking on links, performing inquiries, you name it, Kantu has it. Multiple page forms? Not a problem for Kantu.

    tags: automation browser-integration computer-vision file-downloading file-uploading
  • Extracty

  • Extracty can extract any web data and create an API to the webpage's information.

    tags: web-based api framework data-mining search-engine-optimization
  • Apify

  • Apify is the world’s most advanced web automation platform. Web crawler that works on every website.

    tags: jquery-crawler web-crawler web-crawling web-scraper web-scraping
  • Webhose.io

  • Webhose.io is an advanced DaaS (Data as a Service) platform.

    tags: social-media search-engine search-tool big-data news-feed
  • Scrapinghub

  • Scrapinghub is the most advanced platform for deploying and running web crawlers (also known as "spiders"). It allows your organization to build crawlers easily, deploy them instantly and scale them on demand, without having to manage servers, backups or cron jobs. Everything is stored in our highly available database and retrievable from our API.

    tags: web-based data-mining company web-scraper
  • Product API by Fetchee

  • Simple API to extract product data for any URL.

    tags: api data-mining retailer web-scraping web-crawler
  • Helium Scraper

  • Extract massive amounts of text, images and filesAutomate it with user friendly Action Trees and/or JavaScriptDefine what to extract with just a few clicks; no programming is requiredExport extracted data to .CSV files, .XML or any other custom formatExport and import data to and from .MDB (Microsoft Access Database) filesCopy and paste from and to any spreadsheet applicationRun SQL queries against extracted dataInject and run JavaScript code in your local copy of any web pageAccess your data from your injected code with SQLProxy rotation supportMulti-tab browsing »

    tags: extract-text web-scraping website-screenshot web-scraper content-extraction
  • Diggernaut

  • Diggernaut is a cloud based service for web scraping, data extraction and other ETL tasks. Imagine spending hours a day manually collecting data from websites you need. It's very cumbersome and time consuming. With Diggernaut, you can speed up the data collection process a thousand times and save time to do more important tasks. Our tiny diggers can do web scraping on your behalf and get data from websites for you. Just leave it up to Diggernaut to get your job done.

    tags: big-data data-mining web-scraping etl web-scraper
  • Web Robots

  • Web Robots have several offers and tools:- For users without programming skills. A Chrome extension which guesses where is listing type data on a web page and coverts this data into CSV or Excel file.- For users with Javascript programming skills. Another Chrome extension which is an Integrated Development Environment to write and execute scraper robots. This allows running robots with all features on user's computer for free.- For companies. Web Robots can provide a fully managed data scraping service or license access to the whole platform where client can create and schedule robots, run them on cloud. »

    tags: web-scraping web-scraper web-scraping-tools web-scraping-software webdata
  • PromptCloud

  • What’s in it for you?Big data has become essential to closely monitor user sentiments and to respond to the dynamic market. But acquiring it places high technology barriers.With an aim to make Big data look really small so that you just get your relevant data served on the table, we have built our Data as a Service platform that uses cloud computing and machine learning techniques. Services range from getting you the information from your desired list of sites to building an internal search index to enable keyword-based searches. »

    tags: web-scraping web-crawling data-crawling-provider daas-provider data-crawling
  • Octoparse

  • Octoparse is a modern visual web data extraction software. Both experienced and inexperienced users would find it easy to use Octoparse to bulk extract information from websites, for most of scraping tasks no coding needed. Users can extract data from 98% of open websites using our tools. Octoparse with its point-and-click interface, makes web-scraping very easy to learn and understand. Use the data extracted to power your business intelligence, build up your customer database.

    tags: cloud-service crawler data-analytics data-extraction data-miner
  • Mozenda

  • Turn web page content into structured data all without coding. *Important* - Mozenda uses a Windows application that must be installed on Windows Vista or newer

    tags: crawler crawling data-extraction data-mining no-coding
  • 80legs

  • 80legs offers powerful web crawling. Extract data from web pages, images, and any other online content. Start crawling websites now faster, easier, and with unlimited reach.

    tags: data-mining crawling spidering harvesting htmlscraping