Supadata MCP connector
OAuth 2.1/DCRAIAnalyticsSearchTranscriptionConnect with Supadata MCP to extract transcripts, metadata, and structured content from YouTube, social media, and the web using AI.
Supadata MCP connector
-
Install the SDK
Section titled “Install the SDK”Terminal window npm install @scalekit-sdk/nodeTerminal window pip install scalekit -
Set your credentials
Section titled “Set your credentials”Add your Scalekit credentials to your
.envfile. Find values in app.scalekit.com > Developers > API Credentials..env SCALEKIT_ENVIRONMENT_URL=<your-environment-url>SCALEKIT_CLIENT_ID=<your-client-id>SCALEKIT_CLIENT_SECRET=<your-client-secret> -
Authorize and make your first call
Section titled “Authorize and make your first call”quickstart.ts import { ScalekitClient } from '@scalekit-sdk/node'import 'dotenv/config'const scalekit = new ScalekitClient(process.env.SCALEKIT_ENV_URL,process.env.SCALEKIT_CLIENT_ID,process.env.SCALEKIT_CLIENT_SECRET,)const actions = scalekit.actionsconst connector = 'supadatamcp'const identifier = 'user_123'// Generate an authorization link for the userconst { link } = await actions.getAuthorizationLink({ connectionName: connector, identifier })console.log('Authorize Supadata MCP:', link)process.stdout.write('Press Enter after authorizing...')await new Promise(r => process.stdin.once('data', r))// Make your first callconst result = await actions.executeTool({connector,identifier,toolName: 'supadatamcp_supadata_check_crawl_status',toolInput: { id: 'YOUR_ID' },})console.log(result)quickstart.py import osfrom scalekit.client import ScalekitClientfrom dotenv import load_dotenvload_dotenv()scalekit_client = ScalekitClient(env_url=os.getenv("SCALEKIT_ENV_URL"),client_id=os.getenv("SCALEKIT_CLIENT_ID"),client_secret=os.getenv("SCALEKIT_CLIENT_SECRET"),)actions = scalekit_client.actionsconnection_name = "supadatamcp"identifier = "user_123"# Generate an authorization link for the userlink_response = actions.get_authorization_link(connection_name=connection_name,identifier=identifier,)print("Authorize Supadata MCP:", link_response.link)input("Press Enter after authorizing...")# Make your first callresult = actions.execute_tool(tool_input={"id":"YOUR_ID"},tool_name="supadatamcp_supadata_check_crawl_status",connection_name=connection_name,identifier=identifier,)print(result)
What you can do
Section titled “What you can do”Connect this agent connector to let your agent:
- Transcript supadata — Extract transcript from a video or file URL
- Scrape supadata — Scrape a single web page and return its content
- Metadata supadata — Fetch metadata from a media URL (YouTube, TikTok, Instagram, Twitter)
- Map supadata — Discover URLs on a website
- Extract supadata — Extract structured data from a video URL using AI
- Crawl supadata — Create a crawl job to extract content from all pages on a website
Tool list
Section titled “Tool list”Use the exact tool names from the Tool list below when you call execute_tool. If you’re not sure which name to use, list the tools available for the current user first.
supadatamcp_supadata_check_crawl_status#Check crawl job status and retrieve results. Returns status: scraping, completed, failed, or cancelled.1 param
Check crawl job status and retrieve results. Returns status: scraping, completed, failed, or cancelled.
idstringrequiredThe job ID returned by supadata_crawl when the crawl job was created.supadatamcp_supadata_check_extract_status#Check extract job status and retrieve results. Returns status: queued, active, completed, or failed.1 param
Check extract job status and retrieve results. Returns status: queued, active, completed, or failed.
idstringrequiredThe job ID returned by supadata_extract when the extraction job was created.supadatamcp_supadata_check_transcript_status#Check transcript job status and retrieve results. Returns status: queued, active, completed, or failed.1 param
Check transcript job status and retrieve results. Returns status: queued, active, completed, or failed.
idstringrequiredThe job ID returned by supadata_transcript for a large file transcription job.supadatamcp_supadata_crawl#Create a crawl job to extract content from all pages on a website. Returns a jobId - use supadata_check_crawl_status with that jobId to poll for results.2 params
Create a crawl job to extract content from all pages on a website. Returns a jobId - use supadata_check_crawl_status with that jobId to poll for results.
urlstringrequiredThe full URL of the website to crawl.limitnumberoptionalMaximum number of pages to crawl. Optional.supadatamcp_supadata_extract#Extract structured data from a video URL using AI. Provide a prompt for what to extract, a JSON Schema for the output format, or both. Returns a jobId for async processing.3 params
Extract structured data from a video URL using AI. Provide a prompt for what to extract, a JSON Schema for the output format, or both. Returns a jobId for async processing.
urlstringrequiredThe full URL of the video to extract structured data from.promptstringoptionalNatural language prompt describing what to extract from the video. Optional if schema is provided.schemaobjectoptionalJSON Schema defining the structure of the extracted output. Optional if prompt is provided.supadatamcp_supadata_map#Discover URLs on a website1 param
Discover URLs on a website
urlstringrequiredThe full URL of the website to map.supadatamcp_supadata_metadata#Fetch metadata from a media URL (YouTube, TikTok, Instagram, Twitter). Returns platform info, title, description, author details, engagement stats, media details, tags, and creation date.1 param
Fetch metadata from a media URL (YouTube, TikTok, Instagram, Twitter). Returns platform info, title, description, author details, engagement stats, media details, tags, and creation date.
urlstringrequiredThe full URL of the media content to fetch metadata for.supadatamcp_supadata_scrape#Scrape a single web page and return its content. Fetches and extracts the text content from the specified URL, with optional link removal and language filtering.3 params
Scrape a single web page and return its content. Fetches and extracts the text content from the specified URL, with optional link removal and language filtering.
urlstringrequiredThe full URL of the web page to scrape.langstringoptionalLanguage code to filter or request content in (e.g. 'en', 'fr'). Optional.noLinksbooleanoptionalWhen true, strips all hyperlinks from the scraped content. Optional, defaults to false.supadatamcp_supadata_transcript#Extract transcript from a video or file URL. For large files, returns a jobId instead of the transcript directly - use supadata_check_transcript_status with that jobId to poll for results.5 params
Extract transcript from a video or file URL. For large files, returns a jobId instead of the transcript directly - use supadata_check_transcript_status with that jobId to poll for results.
urlstringrequiredThe full URL of the video or file to transcribe.chunkSizenumberoptionalSize of text chunks to split the transcript into. Optional.langstringoptionalLanguage code for the transcript (e.g. 'en', 'fr'). Optional.modestringoptionalTranscription mode to use. Optional.textbooleanoptionalWhen true, returns plain text instead of structured transcript data. Optional.