المعرفة:: البرمجة الحالة::مؤرشفة المراجع::
- https://github.com/apify/crawlee: Crawlee — A web scraping and browser automation library for Node.js that helps you build reliable crawlers. Fast.
- https://github.com/microsoft/playwright / https://github.com/microsoft/playwright-python: Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
- https://github.com/scrapy/scrapy: Scrapy, a fast high-level web crawling & scraping framework for Python.
- https://github.com/SeleniumHQ/selenium: A browser automation framework and ecosystem.
- https://github.com/jhy/jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.