Open Source -- MIT License

The video editing engine
built for AI agents.

An MCP server that lets Claude, GPT, and Gemini edit videos via tool calls. 17 tools. YAML projects. Apple Silicon native. Think CapCut for AI.

Star on GitHub See How It Works

MCP ProtocolFFmpegMetalWhisperYAML

vidya-mcp-server

$ claude "Edit my vlog"

vidya Analyzing 3 clips...

vidya Transcribing with Whisper...

vidya Adding captions: tiktok_bold

vidya Transition: whip_pan

vidya Self-critique pass...

OK Exported: vlog_final_1080p.mp4

$ _

17 Tools

MCP Native

Apple Silicon

Metal + VideoToolbox

How It Works

From brief to final cut
in one conversation

No timeline dragging. No manual edits. Just tell your AI what you want.

Agent Receives Brief

Your AI agent (Claude, GPT, Gemini) receives a natural language brief describing the video you want. It plans the edit using Vidya's 17 MCP tools.

Vidya Analyzes Footage

Vidya ingests your clips, transcribes audio with Whisper, detects scenes, and builds a .vidya YAML project the AI can read, reason about, and modify.

AI Edits and Exports

The agent assembles the timeline, adds captions, transitions, and music. A self-critique loop reviews the edit. Then Vidya renders the final video.

Features

Everything an AI needs to
edit professional video

Built for machines, readable by humans. Every feature designed for agent workflows.

MCP Native

17 tools exposed via the Model Context Protocol. Any MCP-compatible agent (Claude, GPT, Gemini) can call them directly. No custom API, no SDK, no glue code.

Stateful Projects

Every edit lives in a .vidya YAML file the AI can read, reason about, and modify. Version-controlled, diffable, and human-readable. No opaque binary formats.

Apple Silicon Native

Hardware-accelerated via Metal and VideoToolbox on M1/M2/M3/M4. 2x faster than generic FFmpeg. CoreML Whisper for real-time transcription.

6 Caption Presets

tiktok_bold, cinematic, karaoke, minimal, news_ticker, subtitle. Auto-synced to speech via Whisper timestamps. Styled and positioned automatically.

9 Transitions

dissolve, glitch, whip_pan, zoom_in, zoom_out, slide_left, slide_right, fade_black, cut. GPU-rendered on Metal for smooth real-time playback.

Self-Critique Loop

After assembly, the AI reviews its own edit: checks pacing, audio levels, caption sync, and transition coherence. Fixes issues before you even see the output.

7 Platform Exports

One-click export presets for TikTok, Instagram Reels, YouTube Shorts, YouTube Long, LinkedIn, Twitter/X, and custom resolution. Aspect ratios and encoding handled.

Open Source MIT

Fully open source under the MIT license. No vendor lock-in, no black boxes. Fork it, extend it, self-host it. The core MCP server is free forever.

Pricing

Open core. Free forever.

The MCP server is fully open source. Pay only for the native app and cloud features.

Free

forever

Open source MCP server

MCP server + all 17 tools
All caption presets
All transitions
FFmpeg rendering
.vidya YAML projects
Self-critique loop
Metal HW acceleration
Whisper CoreML
ProRes export

Get Started

Got questions?

Everything you need to know about Vidya.

MCP (Model Context Protocol) is an open standard by Anthropic that lets AI models call external tools. Vidya exposes 17 video editing tools via MCP, so any compatible agent (Claude, GPT, Gemini) can edit videos by calling these tools directly -- no custom integration needed.

No. The free open-source MCP server runs on any platform with FFmpeg (macOS, Linux, Windows). Apple Silicon (M1/M2/M3/M4) unlocks hardware acceleration via Metal and VideoToolbox for 2x faster rendering, plus on-device Whisper transcription via CoreML. These are available in the Solo plan.

Yes. Vidya uses the MCP standard protocol, which is agent-agnostic. Any AI that supports MCP tool calls can use Vidya. This includes Claude (native MCP), GPT (via MCP bridge), and Gemini (via MCP bridge). You are not locked into any single AI provider.

The MCP server is fully open source under the MIT license and free forever. All 17 tools, all caption presets, all transitions, FFmpeg rendering, and .vidya project format are included. The paid plans add the native macOS app, Metal GPU rendering, cloud features, and team collaboration.

Vidya supports all formats FFmpeg can handle as input: MP4, MOV, MKV, AVI, WebM, and more. For export, you get optimized presets for TikTok (9:16 H.264), Instagram Reels (9:16 H.264), YouTube Shorts (9:16), YouTube Long (16:9), LinkedIn (16:9 or 1:1), Twitter/X (16:9), and custom resolutions. ProRes export is available in the Studio plan.

Start editing videos
with AI today.

Clone the repo, configure your MCP client, and let your AI agent handle the rest. Open source. Free forever.

Star on GitHub

MIT LicenseNo vendor lock-in

quickstart.sh

# Install Vidya

$ git clone github.com/Stark-52/Vidya

$ cd Vidya && pip install -e .

# Add to your MCP config

$ vidya serve

# That's it. Your AI can now edit video.

Server running on stdio...

The video editing enginebuilt for AI agents.

From brief to final cutin one conversation

Agent Receives Brief

Vidya Analyzes Footage

AI Edits and Exports

Everything an AI needs toedit professional video

MCP Native

Stateful Projects

Apple Silicon Native

6 Caption Presets

9 Transitions

Self-Critique Loop

7 Platform Exports

Open Source MIT

Open core. Free forever.

Got questions?

Start editing videoswith AI today.

The video editing engine
built for AI agents.

From brief to final cut
in one conversation

Everything an AI needs to
edit professional video

Start editing videos
with AI today.