Apify MCP

Connect to Apify MCP to run web scraping, browser automation, and data extraction Actors directly from AI workflows.

Connect to Apify MCP to discover and run web scraping and data extraction Actors, retrieve Actor run status and output, search the web for AI pipelines, and browse Apify and Crawlee documentation — all from within your AI agent workflows.

Supports authentication: API Key

What you can build with this connector

Use case	Tools involved
Run a web scraper	`apifymcp_search_actors` → `apifymcp_fetch_actor_details` → `apifymcp_call_actor` → `apifymcp_get_actor_output`
Long-running extraction jobs	`apifymcp_call_actor` (async) → `apifymcp_get_actor_run` (poll) → `apifymcp_get_actor_output`
Real-time web research for RAG	`apifymcp_rag_web_browser` → feed Markdown content into LLM context
Find the right Actor for a task	`apifymcp_search_actors` with keywords → `apifymcp_fetch_actor_details` for input schema
Look up Apify or Crawlee docs	`apifymcp_search_apify_docs` → `apifymcp_fetch_apify_docs` for full page content

Key concepts:

Actors: Serverless cloud applications on the Apify platform. Each Actor has a specific input schema — always call apifymcp_fetch_actor_details with output: { inputSchema: true } before calling an Actor.
Sync vs async: apifymcp_call_actor runs synchronously by default and waits for the result. Pass async: true for long-running tasks, then poll with apifymcp_get_actor_run and retrieve output with apifymcp_get_actor_output.
Datasets: Actor output is stored in a dataset. Use apifymcp_get_actor_output with fields and pagination (limit, offset) to retrieve large result sets efficiently.
RAG web browser: apifymcp_rag_web_browser is a purpose-built tool for AI pipelines — it queries Google Search, scrapes the top N pages, and returns clean Markdown content ready for LLM grounding.

Set up the agent connector

Register your Apify API token with Scalekit so it can authenticate and proxy Actor requests on behalf of your users. Unlike OAuth connectors, Apify MCP uses API token authentication — there is no redirect URI or OAuth flow.

Get an Apify API token
- Go to console.apify.com and sign in or create a free account.
- In the left sidebar, click your avatar → Settings → API & Integrations → API tokens.
- Click + Create new token. Give it a name (e.g., Agent Auth) and click Create token.
- Copy the token immediately — it will not be shown again.
Create a connection in Scalekit
- In Scalekit dashboard, go to Agent Auth → Connections. Find Apify MCP and click Create.
- Note the Connection name — you will use this as connection_name in your code (e.g., apifymcp).
Add a connected account

Connected accounts link a specific user identifier in your system to an Apify API token. Add them via the dashboard for testing, or via the Scalekit API in production.

Via dashboard (for testing)
- Open the connection you created and click the Connected Accounts tab → Add account.
- Fill in:
  - Your User’s ID — a unique identifier for this user in your system (e.g., user_123)
  - Apify Token — the token you copied in step 1
- Click Save.
Via API (for production)
- Node.js
- Python
1 await scalekit.actions.upsertConnectedAccount({ 2 connectionName: 'apifymcp', 3 identifier: 'user_123', // your user's unique ID 4 credentials: { token: 'apify_api_...' }, 5 });
1 scalekit_client.actions.upsert_connected_account( 2 connection_name="apifymcp", 3 identifier="user_123", 4 credentials={"token": "apify_api_..."} 5 )

Usage

Connect a user’s Apify account and run web scraping and data extraction Actors through Scalekit. Scalekit handles token storage and tool execution automatically.

Apify MCP is primarily used through Scalekit tools. Use scalekit_client.actions.execute_tool() to discover Actors, fetch their input schemas, run them, and retrieve output — without handling Apify credentials in your application code.

Tool calling

Use this connector when you want an agent to run web scraping or data extraction tasks using Apify Actors.

Use apifymcp_search_actors to discover Actors for a specific platform or use case before deciding which to run.
Use apifymcp_fetch_actor_details to retrieve an Actor’s input schema before calling it — always pass output: { inputSchema: true } to keep the response concise.
Use apifymcp_call_actor to run an Actor synchronously, or with async: true for long-running tasks.
Use apifymcp_get_actor_run to poll the status of an async run, and apifymcp_get_actor_output to retrieve results once complete.
Use apifymcp_rag_web_browser when you need real-time web content for LLM grounding — it returns clean Markdown from the top search result pages.

Python
Node.js

1
import os
2
from scalekit.client import ScalekitClient
3

4
scalekit_client = ScalekitClient(
5
    client_id=os.environ["SCALEKIT_CLIENT_ID"],
6
    client_secret=os.environ["SCALEKIT_CLIENT_SECRET"],
7
    env_url=os.environ["SCALEKIT_ENV_URL"],
8
)
9

10
connected_account = scalekit_client.actions.get_or_create_connected_account(
11
    connection_name="apifymcp",
12
    identifier="user_123",
13
)
14

15
tool_response = scalekit_client.actions.execute_tool(
16
    tool_name="apifymcp_fetch_actor_details",
17
    connected_account_id=connected_account.connected_account.id,
18
    tool_input={
19
        "actor": "apify/web-scraper",
20
    },
21
)
22
print("Actor details:", tool_response)

1
import { ScalekitClient } from '@scalekit-sdk/node';
2
import 'dotenv/config';
3

4
const scalekit = new ScalekitClient(
5
  process.env.SCALEKIT_ENV_URL!,
6
  process.env.SCALEKIT_CLIENT_ID!,
7
  process.env.SCALEKIT_CLIENT_SECRET!
8
);
9
const actions = scalekit.actions;
10

11
const connectedAccount = await actions.getOrCreateConnectedAccount({
12
  connectionName: 'apifymcp',
13
  identifier: 'user_123',
14
});
15

16
const toolResponse = await actions.executeTool({
17
  toolName: 'apifymcp_fetch_actor_details',
18
  connectedAccountId: connectedAccount?.id,
19
  toolInput: {
20
    actor: 'apify/web-scraper',
21
  },
22
});
23
console.log('Actor details:', toolResponse.data);

Tool list

`apifymcp_search_actors`

Search the Apify Store to discover Actors for a given use case or platform. Returns Actor names, IDs, descriptions, and usage stats. Does not run any scraping — use this to find the right Actor before calling it.

Name	Type	Required	Description
`keywords`	string	No	Search terms (e.g., `"instagram scraper"`, `"google maps"`). Leave empty to browse popular Actors. Default: `""`
`limit`	integer	No	Number of results to return (1–100). Default: `5`
`offset`	integer	No	Number of results to skip for pagination. Default: `0`

`apifymcp_fetch_actor_details`

Retrieve detailed information about an Actor, including its input schema, README, pricing, and output schema. Always call this before apifymcp_call_actor to understand required and optional input parameters.

Name	Type	Required	Description
`actor`	string	Yes	The Actor ID or name (e.g., `apify/instagram-scraper`)
`output.description`	boolean	No	Include a short description of the Actor
`output.inputSchema`	boolean	No	Include the full JSON input schema — use this before calling the Actor
`output.mcpTools`	boolean	No	Include MCP tool definitions for the Actor
`output.metadata`	boolean	No	Include Actor metadata (version, author, categories)
`output.outputSchema`	boolean	No	Include the output data schema
`output.pricing`	boolean	No	Include pricing information
`output.rating`	boolean	No	Include user ratings and review count
`output.readme`	boolean	No	Include the full README (can be very large — use sparingly)
`output.stats`	boolean	No	Include usage statistics (total runs, users)

`apifymcp_call_actor`

Run an Actor from the Apify Store with the specified input. By default runs synchronously and waits for the result. Use async: true for long-running tasks, then track progress with apifymcp_get_actor_run and retrieve output with apifymcp_get_actor_output.

Name	Type	Required	Description
`actor`	string	Yes	The Actor ID or name to run (e.g., `apify/web-scraper`)
`input`	object	Yes	Input object matching the Actor’s input schema. Fetch the schema first with `apifymcp_fetch_actor_details`
`async`	boolean	No	Set to `true` to start the run and return immediately without waiting for results. Default: `false`
`previewOutput`	boolean	No	Set to `true` to include a preview of the output dataset in the response (sync mode only)
`callOptions.memory`	integer	No	Memory limit for the run in megabytes (e.g., `256`, `512`, `1024`)
`callOptions.timeout`	integer	No	Timeout for the run in seconds

`apifymcp_get_actor_run`

Get the current status and metadata for a specific Actor run. Use this to poll an async run until it completes. Returns run status, timestamps, performance stats, and storage resource IDs.

Name	Type	Required	Description
`runId`	string	Yes	The ID of the Actor run to check (returned by `apifymcp_call_actor` when `async: true`)

`apifymcp_get_actor_output`

Retrieve output dataset items from a completed Actor run. Supports field selection to reduce response size, and pagination for large datasets.

Name	Type	Required	Description
`datasetId`	string	Yes	The dataset ID to fetch output from (found in the `apifymcp_call_actor` response or `apifymcp_get_actor_run` result as `defaultDatasetId`)
`fields`	string	No	Comma-separated list of fields to include, with dot notation for nested fields (e.g., `"title,url,metadata.description"`). Returns all fields by default
`limit`	number	No	Maximum number of items to return. Default: `100`
`offset`	number	No	Number of items to skip for pagination. Default: `0`

`apifymcp_rag_web_browser`

Search Google and scrape the top N result pages, returning clean content for use in AI pipelines and RAG (Retrieval-Augmented Generation) workflows. Can also scrape a specific URL directly by passing it as the query.

Name	Type	Required	Description
`query`	string	Yes	A Google Search query (e.g., `"best vector databases 2025"`) or a specific URL to scrape directly
`maxResults`	integer	No	Number of top search result pages to scrape (default: `3`)
`outputFormats`	array	No	Content formats to return. Options: `"text"`, `"markdown"`, `"html"`. Default: `["markdown"]`

`apifymcp_search_apify_docs`

Search Apify and Crawlee documentation using full-text search. Returns matching page titles, URLs, and snippets. Follow up with apifymcp_fetch_apify_docs to retrieve the full content of a specific page.

Name	Type	Required	Description
`query`	string	Yes	The search query (e.g., `"dataset pagination"`, `"proxy configuration"`)
`docSource`	string	No	Documentation source to search. Options: `"apify"` (default), `"crawlee-js"`, `"crawlee-py"`
`limit`	number	No	Maximum number of results to return (1–20). Default: `5`
`offset`	number	No	Number of results to skip for pagination. Default: `0`

`apifymcp_fetch_apify_docs`

Fetch the full content of an Apify or Crawlee documentation page by URL. Use after finding a relevant page with apifymcp_search_apify_docs.

Name	Type	Required	Description
`url`	string	Yes	The full URL of the documentation page to fetch (e.g., `https://docs.apify.com/platform/actors`)

Apify MCP

Set up the agent connector

Get an Apify API token

Create a connection in Scalekit

Add a connected account

Usage

Tool calling

Tool list

apifymcp_search_actors

apifymcp_fetch_actor_details

apifymcp_call_actor

apifymcp_get_actor_run

apifymcp_get_actor_output

apifymcp_rag_web_browser

apifymcp_search_apify_docs

apifymcp_fetch_apify_docs