Skip to content
Change the repository type filter

All

    Repositories list

    • Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
      Python
      Apache License 2.0
      3425.2k7913Updated Jan 31, 2025Jan 31, 2025
    • Model Context Protocol (MCP) Client for Apify's Actors
      0001Updated Jan 31, 2025Jan 31, 2025
    • crawlee

      Public
      Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
      TypeScript
      Apache License 2.0
      74017k13021Updated Jan 31, 2025Jan 31, 2025
    • workflows

      Public
      Apify's reusable github workflows
      Python
      4746Updated Jan 30, 2025Jan 30, 2025
    • Documentation site for the Actor Programming Model – a fresh take on serverless microapps. Built with Astro.
      MDX
      MIT License
      0133Updated Jan 30, 2025Jan 30, 2025
    • Apify API client for Python
      Python
      Apache License 2.0
      1253105Updated Jan 30, 2025Jan 30, 2025
    • Apify ESLint preset to be shared between projects
      JavaScript
      Apache License 2.0
      0212Updated Jan 30, 2025Jan 30, 2025
    • Model Context Protocol (MCP) Server for Apify's Actors
      TypeScript
      Apache License 2.0
      2302Updated Jan 30, 2025Jan 30, 2025
    • Apify SDK monorepo
      TypeScript
      Apache License 2.0
      411281110Updated Jan 30, 2025Jan 30, 2025
    • Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
      TypeScript
      Apache License 2.0
      1191.2k219Updated Jan 29, 2025Jan 29, 2025
    • This project is the home of Apify's documentation.
      API Blueprint
      Apache License 2.0
      81327619Updated Jan 29, 2025Jan 29, 2025
    • apify-cli

      Public
      Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
      TypeScript
      20128396Updated Jan 29, 2025Jan 29, 2025
    • Utilities and constants shared across Apify projects.
      TypeScript
      Apache License 2.0
      111251Updated Jan 29, 2025Jan 29, 2025
    • actor-cmd

      Public
      TypeScript
      0103Updated Jan 29, 2025Jan 29, 2025
    • The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
      Python
      Apache License 2.0
      10122132Updated Jan 29, 2025Jan 29, 2025
    • impit

      Public
      impit | rust library for browser impersonation
      Rust
      01313Updated Jan 29, 2025Jan 29, 2025
    • Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
      JavaScript
      Apache License 2.0
      147875711Updated Jan 29, 2025Jan 29, 2025
    • JavaScript
      1001Updated Jan 28, 2025Jan 28, 2025
    • rustls

      Public
      Patched fork of `ruslts` for `impit`
      Rust
      Other
      674000Updated Jan 28, 2025Jan 28, 2025
    • Apify API client for JavaScript / Node.js.
      TypeScript
      Apache License 2.0
      2867185Updated Jan 28, 2025Jan 28, 2025
    • The /llms.txt Generator Actor 🕸️📄 extracts website content to create an llms.txt file for AI apps 🤖✨ like LLM fine-tuning and indexing. Output is available 📥 in the Key-Value Store for easy download and integration into workflows. 🚀
      Python
      Apache License 2.0
      1311Updated Jan 27, 2025Jan 27, 2025
    • Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
      Python
      Apache License 2.0
      4520Updated Jan 25, 2025Jan 25, 2025
    • This project is the 🏠 home of Apify Actor templates to help users quickly get started. Contributions welcome!
      Python
      1826101Updated Jan 23, 2025Jan 23, 2025
    • Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
      TypeScript
      28000Updated Jan 22, 2025Jan 22, 2025
    • RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.
      TypeScript
      Apache License 2.0
      42230Updated Jan 17, 2025Jan 17, 2025
    • A Homebrew tap for Apify tools
      Ruby
      1804Updated Jan 16, 2025Jan 16, 2025
    • Base Docker images for Apify actors.
      Dockerfile
      Apache License 2.0
      247194Updated Jan 14, 2025Jan 14, 2025
    • h2

      Public
      Patched fork of h2 for impit
      Rust
      MIT License
      289000Updated Jan 14, 2025Jan 14, 2025
    • A GitHub Action to push an Actor the the Apify platform
      Apache License 2.0
      01500Updated Jan 14, 2025Jan 14, 2025
    • This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
      Apache License 2.0
      0772Updated Jan 8, 2025Jan 8, 2025