Supadata MCP connector

OAuth 2.1/DCRAIAnalyticsSearchTranscription

Connect with Supadata MCP to extract transcripts, metadata, and structured content from YouTube, social media, and the web using AI.

Supadata MCP connector

Install the SDK
Section titled “Install the SDK”
- Node.js
- Python
Terminal window
1 npm install @scalekit-sdk/node
Terminal window
1 pip install scalekit
Full SDK reference: Node.js | Python
Set your credentials
Section titled “Set your credentials”

Add your Scalekit credentials to your .env file. Find values in app.scalekit.com > Developers > API Credentials.
.env
```
SCALEKIT_ENVIRONMENT_URL=<your-environment-url>
SCALEKIT_CLIENT_ID=<your-client-id>
SCALEKIT_CLIENT_SECRET=<your-client-secret>
```

1
import { ScalekitClient } from '@scalekit-sdk/node'
2
import 'dotenv/config'
3

4
const scalekit = new ScalekitClient(
5
  process.env.SCALEKIT_ENV_URL,
6
  process.env.SCALEKIT_CLIENT_ID,
7
  process.env.SCALEKIT_CLIENT_SECRET,
8
)
9
const actions = scalekit.actions
10

11
const connector = 'supadatamcp'
12
const identifier = 'user_123'
13

14
// Generate an authorization link for the user
15
const { link } = await actions.getAuthorizationLink({ connectionName: connector, identifier })
16
console.log('Authorize Supadata MCP:', link)
17
process.stdout.write('Press Enter after authorizing...')
18
await new Promise(r => process.stdin.once('data', r))
19

20
// Make your first call
21
const result = await actions.executeTool({
22
  connector,
23
  identifier,
24
  toolName: 'supadatamcp_supadata_check_crawl_status',
25
  toolInput: { id: 'YOUR_ID' },
26
})
27
console.log(result)

1
import os
2
from scalekit.client import ScalekitClient
3
from dotenv import load_dotenv
4
load_dotenv()
5

6
scalekit_client = ScalekitClient(
7
    env_url=os.getenv("SCALEKIT_ENV_URL"),
8
    client_id=os.getenv("SCALEKIT_CLIENT_ID"),
9
    client_secret=os.getenv("SCALEKIT_CLIENT_SECRET"),
10
)
11
actions = scalekit_client.actions
12

13
connection_name = "supadatamcp"
14
identifier = "user_123"
15

16
# Generate an authorization link for the user
17
link_response = actions.get_authorization_link(
18
    connection_name=connection_name,
19
    identifier=identifier,
20
)
21
print("Authorize Supadata MCP:", link_response.link)
22
input("Press Enter after authorizing...")
23

24
# Make your first call
25
result = actions.execute_tool(
26
    tool_input={"id":"YOUR_ID"},
27
    tool_name="supadatamcp_supadata_check_crawl_status",
28
    connection_name=connection_name,
29
    identifier=identifier,
30
)
31
print(result)

What you can do

Connect this agent connector to let your agent:

Transcript supadata — Extract transcript from a video or file URL
Scrape supadata — Scrape a single web page and return its content
Metadata supadata — Fetch metadata from a media URL (YouTube, TikTok, Instagram, Twitter)
Map supadata — Discover URLs on a website
Extract supadata — Extract structured data from a video URL using AI
Crawl supadata — Create a crawl job to extract content from all pages on a website

Tool list

Use the exact tool names from the Tool list below when you call execute_tool. If you’re not sure which name to use, list the tools available for the current user first.

supadatamcp_supadata_check_crawl_status#Check crawl job status and retrieve results. Returns status: scraping, completed, failed, or cancelled.1 param

Check crawl job status and retrieve results. Returns status: scraping, completed, failed, or cancelled.

NameTypeRequiredDescription

idstringrequiredThe job ID returned by supadata_crawl when the crawl job was created.

supadatamcp_supadata_check_extract_status#Check extract job status and retrieve results. Returns status: queued, active, completed, or failed.1 param

Check extract job status and retrieve results. Returns status: queued, active, completed, or failed.

NameTypeRequiredDescription

idstringrequiredThe job ID returned by supadata_extract when the extraction job was created.

supadatamcp_supadata_check_transcript_status#Check transcript job status and retrieve results. Returns status: queued, active, completed, or failed.1 param

Check transcript job status and retrieve results. Returns status: queued, active, completed, or failed.

NameTypeRequiredDescription

idstringrequiredThe job ID returned by supadata_transcript for a large file transcription job.

supadatamcp_supadata_crawl#Create a crawl job to extract content from all pages on a website. Returns a jobId - use supadata_check_crawl_status with that jobId to poll for results.2 params

Create a crawl job to extract content from all pages on a website. Returns a jobId - use supadata_check_crawl_status with that jobId to poll for results.

NameTypeRequiredDescription

urlstringrequiredThe full URL of the website to crawl.

limitnumberoptionalMaximum number of pages to crawl. Optional.

supadatamcp_supadata_extract#Extract structured data from a video URL using AI. Provide a prompt for what to extract, a JSON Schema for the output format, or both. Returns a jobId for async processing.3 params

Extract structured data from a video URL using AI. Provide a prompt for what to extract, a JSON Schema for the output format, or both. Returns a jobId for async processing.

NameTypeRequiredDescription

urlstringrequiredThe full URL of the video to extract structured data from.

promptstringoptionalNatural language prompt describing what to extract from the video. Optional if schema is provided.

schemaobjectoptionalJSON Schema defining the structure of the extracted output. Optional if prompt is provided.

supadatamcp_supadata_map#Discover URLs on a website1 param

Discover URLs on a website

NameTypeRequiredDescription

urlstringrequiredThe full URL of the website to map.

supadatamcp_supadata_metadata#Fetch metadata from a media URL (YouTube, TikTok, Instagram, Twitter). Returns platform info, title, description, author details, engagement stats, media details, tags, and creation date.1 param

Fetch metadata from a media URL (YouTube, TikTok, Instagram, Twitter). Returns platform info, title, description, author details, engagement stats, media details, tags, and creation date.

NameTypeRequiredDescription

urlstringrequiredThe full URL of the media content to fetch metadata for.

supadatamcp_supadata_scrape#Scrape a single web page and return its content. Fetches and extracts the text content from the specified URL, with optional link removal and language filtering.3 params

Scrape a single web page and return its content. Fetches and extracts the text content from the specified URL, with optional link removal and language filtering.

NameTypeRequiredDescription

urlstringrequiredThe full URL of the web page to scrape.

langstringoptionalLanguage code to filter or request content in (e.g. 'en', 'fr'). Optional.

noLinksbooleanoptionalWhen true, strips all hyperlinks from the scraped content. Optional, defaults to false.

supadatamcp_supadata_transcript#Extract transcript from a video or file URL. For large files, returns a jobId instead of the transcript directly - use supadata_check_transcript_status with that jobId to poll for results.5 params

Extract transcript from a video or file URL. For large files, returns a jobId instead of the transcript directly - use supadata_check_transcript_status with that jobId to poll for results.

NameTypeRequiredDescription

urlstringrequiredThe full URL of the video or file to transcribe.

chunkSizenumberoptionalSize of text chunks to split the transcript into. Optional.

langstringoptionalLanguage code for the transcript (e.g. 'en', 'fr'). Optional.

modestringoptionalTranscription mode to use. Optional.

textbooleanoptionalWhen true, returns plain text instead of structured transcript data. Optional.

Supadata MCP connector

Supadata MCP connector

Install the SDK

Set your credentials

Authorize and make your first call

What you can do

Tool list