采集软件|采集插件

软件类

火车头采集

http://www.locoy.com/

版本比较

http://www.locoy.com/product#s3

简数采集


简数采集

http://www.keydatas.com/

化繁为简,让数据触手可得

采集无需下载软件,直接登录使用;不用手写规则,智能识别+鼠标点选生成规则;集成强大的SEO工具!采集和发布非常简单、高效的网页采集器。

简数采集器——文档中心

http://doc.keydatas.com/

Hawk采集

欢迎使用Hawk! HAWK无需编程,可见即所得的图形化数据采集和清洗工具,依据GPL协议开源。

项目主页: https://github.com/ferventdesert/Hawk

项目文档: https://ferventdesert.github.io/Hawk/

下载地址: https://github.com/ferventdesert/Hawk/releases

国内备份下载地址: https://gitee.com/deserthawk/Hawk/attach_files

示例工程文件: https://github.com/ferventdesert/Hawk-Projects/Hawk3

教学视频地址: https://space.bilibili.com/312273788/channel/detail?cid=68345

使用说明

https://ferventdesert.github.io/Hawk/

https://ferventdesert.github.io/Hawk/#3

Hawk5的最新教学视频都在这里:(终于放在了友爱且无广告的B站)

https://space.bilibili.com/312273788/channel/detail?cid=68345


插件类

Chrome插件

Instant Data Scraper

https://chrome.google.com/webstore/detail/instant-data-scraper/ofaokhiedipichpaobibbnahnkdoiiah

Offered by: webrobots.io

Instant Data Scraper extracts data from web pages and exports it as Excel or CSV files

Instant Data Scraper is an automated data extraction tool for any website. It uses AI to predict which data is most relevant on a HTML page and allows saving it to Excel or CSV file (XLS, XLSX, CSV).

Community Support group: https://www.facebook.com/groups/instantdata/

This tool does not require website specific scripts, instead it uses heuristic AI analysis of HTML structure to detect data for extraction. If the prediction is not satisfactory,  it lets the user customize the selections for greater accuracy. This type of scraping technology is much more convenient, because it does not require large user created libraries of scraping scripts,  which often become filled with outdated and redundant versions. This means that our scraping method works just as well with small and lesser known websites, as it does with global giants like Amazon. Also, our users do not need to have any coding, json or xml skills!