Star AlbumentationsX on GitHub — it powers this leaderboard
scrapy/scrapely
A pure-python HTML screen-scraping library