Scrapy is a free and open source web crawling framework, written in Python.
License: Open Source
import.io is a web-based platform for extracting data from websites without writing any code.The tool allows people to convert unstructured web data into a structured format for use in Machine Learning, Artificial Intelligence, Retail Price Monitoring, Store Locators as well as academic and other research. Developed by import.io
License: Commercial
Feature | Scrapy | import.io |
---|---|---|
Command line interface | ||
Data Mining | ||
Screen scraping | ||
Workflow Automation | ||
Screenshot OCR | ||
Anonymous web scraping | ||
In-app server browser | ||
Content-Type Filtering | ||
Support for Amazon S3 | ||
URL Filtering | ||
WARC Output | ||
Robot Process Automation | ||
Business process automation | ||
Macros | ||
Headless | ||
Jquery crawler | ||
Serverless | ||
Artificial intelligence | ||
Natural Language Processing | ||
Web-Based |