> **Building with AI coding agents?** If you're using an AI coding agent, install the official Scalekit plugin. It gives your agent full awareness of the Scalekit API — reducing hallucinations and enabling faster, more accurate code generation.
>
> - **Claude Code**: `/plugin marketplace add scalekit-inc/claude-code-authstack` then `/plugin install <auth-type>@scalekit-auth-stack`
> - **GitHub Copilot CLI**: `copilot plugin marketplace add scalekit-inc/github-copilot-authstack` then `copilot plugin install <auth-type>@scalekit-auth-stack`
> - **Codex**: run the bash installer, restart, then open Plugin Directory and enable `<auth-type>`
> - **Skills CLI** (Windsurf, Cline, 40+ agents): `npx skills add scalekit-inc/skills --list` then `--skill <skill-name>`
>
> `<auth-type>` / `<skill-name>`: `agent-auth`, `full-stack-auth`, `mcp-auth`, `modular-sso`, `modular-scim` — [Full setup guide](https://docs.scalekit.com/dev-kit/build-with-ai/)

---

# Diarize

<div class="grid grid-cols-5 gap-4 items-center">
 <div class="col-span-4">
  Connect to Diarize to transcribe and diarize audio and video content from YouTube, X, Instagram, and TikTok. Submit transcription jobs and retrieve results in JSON, TXT, SRT, or VTT format with speaker labels and word-level timestamps.
 </div>
 <div class="flex justify-center">
  <img src="https://cdn.scalekit.com/sk-connect/assets/provider-icons/diarize.svg" width="64" height="64" alt="Diarize logo" />
 </div>
</div>

Supports authentication: API Key

![Diarize connector card shown in Scalekit's Create Connection search](@/assets/docs/agent-connectors/diarize/scalekit-search-diarize.png)

## Set up the agent connector

<SetupDiarizeSection />

## Usage

<UsageDiarizeSection />

## Tool list

## `diarize_create_transcription_job`

Submit a new transcription and diarization job for an audio or video URL (YouTube, X, Instagram, TikTok). Returns a job ID that can be used to check status and download results.

| Name | Type | Required | Description |
| --- | --- | --- | --- |
| `url` | string | Yes | The URL of the audio or video content to transcribe (e.g. YouTube, X, Instagram, TikTok link) |
| `language` | string | No | Language code for transcription (e.g. `en`, `es`, `fr`). Defaults to auto-detection if not provided. |
| `num_speakers` | integer | No | Expected number of speakers in the audio. Helps improve diarization accuracy. |

## `diarize_get_job_status`

Retrieve the current status of a transcription job by its job ID. Returns job state (`PENDING`, `PROCESSING`, `COMPLETED`, `FAILED`), metadata, and an `estimatedTime` field (in seconds) indicating how long processing is expected to take. Use `estimatedTime` to determine polling frequency and max wait duration — for example, a 49-minute episode may have an `estimatedTime` of ~891 s (~15 mins), so the agent should wait at least that long before giving up.

| Name | Type | Required | Description |
| --- | --- | --- | --- |
| `job_id` | string | Yes | The unique ID of the transcription job to check |

## `diarize_download_transcript`

Download the transcript output for a completed transcription job in JSON, TXT, SRT, or VTT format, including speaker diarization, segments, and word-level timestamps.

| Name | Type | Required | Description |
| --- | --- | --- | --- |
| `job_id` | string | Yes | The unique ID of the completed transcription job |
| `format` | string | No | Output format: `json` (default), `txt`, `srt`, or `vtt` |

---

## More Scalekit documentation

| Resource | What it contains | When to use it |
|----------|-----------------|----------------|
| [/llms.txt](/llms.txt) | Structured index with routing hints per product area | Start here — find which documentation set covers your topic before loading full content |
| [/llms-full.txt](/llms-full.txt) | Complete documentation for all Scalekit products in one file | Use when you need exhaustive context across multiple products or when the topic spans several areas |
| [sitemap-0.xml](https://docs.scalekit.com/sitemap-0.xml) | Full URL list of every documentation page | Use to discover specific page URLs you can fetch for targeted, page-level answers |