Star AlbumentationsX on GitHub — it powers this leaderboard
scrapy/dirbot
Scrapy project to scrape public web directories (educational) [DEPRECATED]