Apache Nutch is a highly extensible and scalable open source web crawler software project. Apache Nutch is a highly extensible and scalable open source web crawler software project.Nutch is coded entirely in the Java programming language, but data is written in language-independent formats. It has a highly modular architecture, allowing developers to create plug-ins for media-type parsing, data retrieval, querying and clustering.The fetcher (“robot” or “web crawler”) has been written from scratch specifically for this project.
Find Top 10
Apache Nutch
Alternatives
# | Image | App Name | Features | Platforms | Price | Website Link |
2 | ProxyCrawl | Web | Freemium | Website | ||
3 | StormCrawler | Mac Windows Linux |
Free | Website | ||
4 | Scrapy | Mac Windows BSD Linux |
Free | Website | ||
5 | Mixnode | Web | Commercial | Website | ||
6 | Heritrix | Mac Windows Linux |
Free | Website | ||
7 | ACHE Crawler | Mac Windows Linux |
Free | Website |