| 1 |
browser-use
Frameworks
|
91,723 |
|
Web Scraping
|
→ |
|
Make websites accessible for AI agents with easy browser automation.
|
|
| 2 |
crawl4ai
Frameworks
|
64,916 |
|
Web Scraping
|
→ |
|
An open-source, LLM-friendly web crawler that provides lightning-fast, structured data extraction specifically designed for AI agents.
|
|
| 3 |
scrapy
Frameworks
|
61,530 |
|
Web Scraping
|
→ |
|
A fast high-level screen scraping and web crawling framework.
|
|
| 4 |
requests
HTTP & Scraping
|
53,950 |
|
HTTP Clients
|
→ |
|
HTTP Requests for Humans.
|
|
| 5 |
aiohttp
HTTP & Scraping
|
16,427 |
|
HTTP Clients
|
→ |
|
Asynchronous HTTP client/server framework for asyncio and Python.
|
|
| 6 |
httpx
HTTP & Scraping
|
15,251 |
|
HTTP Clients
|
→ |
|
A next generation HTTP client for Python.
|
|
| 7 |
trafilatura
Content Extraction
|
5,860 |
|
Web Scraping
|
→ |
|
A tool for gathering text and metadata from the web, with built-in content filtering.
|
|
| 8 |
mechanicalsoup
Frameworks
|
4,854 |
|
Web Scraping
|
→ |
|
A Python library for automating interaction with websites.
|
|
| 9 |
urllib3
HTTP & Scraping
|
4,020 |
|
HTTP Clients
|
→ |
|
A HTTP library with thread-safe connection pooling, file post support, sanity friendly.
|
|
| 10 |
sumy
Content Extraction
|
3,680 |
|
Web Scraping
|
→ |
|
A module for automatic summarization of text documents and HTML pages.
|
|
| 11 |
modoboa
HTTP & Scraping
|
3,480 |
|
Email
|
→ |
|
A mail hosting and management platform including a modern Web UI.
|
|
| 12 |
furl
HTTP & Scraping
|
2,800 |
|
HTTP Clients
|
→ |
|
A small Python library that makes parsing and manipulating URLs easy.
|
|
| 13 |
yagmail
HTTP & Scraping
|
2,725 |
|
Email
|
→ |
|
Yet another Gmail/SMTP client.
|
|
| 14 |
feedparser
Content Extraction
|
2,362 |
|
Web Scraping
|
→ |
|
|
|
| 15 |
html2text
Content Extraction
|
2,147 |
|
Web Scraping
|
→ |
|
Convert HTML to Markdown-formatted text.
|
|
| 16 |
micawber
Content Extraction
|
674 |
|
Web Scraping
|
→ |
|
A small library for extracting rich content from URLs.
|
|
| 17 |
httptap
HTTP & Scraping
|
491 |
|
HTTP Clients
|
→ |
|
Dissects an HTTP request into DNS, TCP, TLS, wait, and transfer phases and renders the timings as a waterfall.
|
|