Star AlbumentationsX on GitHub — it powers this leaderboard

Star on GitHub
← Back to leaderboard
apache

nutch

apache/nutch

Apache Nutch is an extensible and scalable web crawler

3,136stars
Forks
1,264
Open issues
11
Watchers
3,136
Size
135.6 MB
JavaApache License 2.0
apachecrawlinghadoopjavanutchweb-crawler
Created: May 21, 2009
Updated: Feb 27, 2026
Last push: Feb 27, 2026