Skip to content
@apify

Apify

Put the web to work.

Apify Banner

Apify is the largest ecosystem where developers build, deploy, and publish data extraction and web automation tools. We call them Actors.

Learn About Apify 🧑‍🎓

  • Find hundreds of ready-made Actors for your web scraping or automation project on Apify Store.
  • Learn everything about web scraping and automation with our free courses that will turn you into an expert scraping developer.
  • Publish your web scrapers as paid Actors on the Apify platform, attract people who need these solutions, and get regular passive income.
  • View our livestreams and video content at the Apify YouTube channel.
  • Learn more through tutorials and thought leadership content about web scraping on Apify Blog and Crawlee Blog.

We are hiring! 🕸️

Check out the open positions at Apify and help us make the web more programmable.

Pinned Loading

  1. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 23.2k 1.4k

  2. impit impit Public

    impit | rust library for browser impersonation

    Rust 466 39

  3. crawlee-python crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

    Python 9k 732

  4. apify-mcp-server apify-mcp-server Public

    The Apify MCP server enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and aut…

    TypeScript 1.2k 164

  5. mcpc mcpc Public

    A universal CLI client for MCP. mcpc supports persistent sessions, stdio/HTTP, OAuth 2.1, tasks, JSON output for code mode, proxy for AI sandboxes, x402, and more.

    TypeScript 591 56

  6. proxy-chain proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, HTTP/HTTPS, SOCKS5, authentication, and upstream proxy chaining.

    JavaScript 987 164

Repositories

Showing 10 of 218 repositories
  • apify-docs Public

    This project is the home of Apify's documentation.

    apify/apify-docs’s past year of commit activity
    JavaScript 68 Apache-2.0 189 108 (1 issue needs help) 43 Updated May 10, 2026
  • apify-mcp-server Public

    The Apify MCP server enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and automation tools available on the Apify Store.

    apify/apify-mcp-server’s past year of commit activity
    TypeScript 1,202 MIT 164 106 (4 issues need help) 15 Updated May 9, 2026
  • mcpc Public

    A universal CLI client for MCP. mcpc supports persistent sessions, stdio/HTTP, OAuth 2.1, tasks, JSON output for code mode, proxy for AI sandboxes, x402, and more.

    apify/mcpc’s past year of commit activity
    TypeScript 591 Apache-2.0 56 8 9 Updated May 9, 2026
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee’s past year of commit activity
    TypeScript 23,161 Apache-2.0 1,354 139 (1 issue needs help) 39 Updated May 9, 2026
  • crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee-python’s past year of commit activity
    Python 8,993 Apache-2.0 732 73 3 Updated May 9, 2026
  • workflows Public

    Apify's reusable github workflows

    apify/workflows’s past year of commit activity
    TypeScript 15 6 3 (1 issue needs help) 2 Updated May 9, 2026
  • actor-ai-sandbox Public

    Open-source Actor that provides sandbox environment for AI agentic and coding use cases 📦

    apify/actor-ai-sandbox’s past year of commit activity
    TypeScript 5 3 10 2 Updated May 8, 2026
  • apify-shared-js Public

    Utilities and constants shared across Apify projects.

    apify/apify-shared-js’s past year of commit activity
    TypeScript 18 Apache-2.0 12 2 4 Updated May 8, 2026
  • proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, HTTP/HTTPS, SOCKS5, authentication, and upstream proxy chaining.

    apify/proxy-chain’s past year of commit activity
    JavaScript 987 Apache-2.0 164 21 (2 issues need help) 16 Updated May 8, 2026
  • langchain-apify Public

    Apify integration for LangChain 🦜🔗

    apify/langchain-apify’s past year of commit activity
    Python 5 Apache-2.0 3 2 4 Updated May 8, 2026