Open Source -- MIT License

The video editing engine
built for AI agents.

An MCP server that lets Claude, GPT, and Gemini edit videos via tool calls. 17 tools. YAML projects. Apple Silicon native. Think CapCut for AI.

MCP ProtocolFFmpegMetalWhisperYAML
vidya-mcp-server
$ claude "Edit my vlog"
vidya Analyzing 3 clips...
vidya Transcribing with Whisper...
vidya Adding captions: tiktok_bold
vidya Transition: whip_pan
vidya Self-critique pass...
OK Exported: vlog_final_1080p.mp4
$ _
17 Tools
MCP Native
Apple Silicon
Metal + VideoToolbox
How It Works

From brief to final cut
in one conversation

No timeline dragging. No manual edits. Just tell your AI what you want.

01

Agent Receives Brief

Your AI agent (Claude, GPT, Gemini) receives a natural language brief describing the video you want. It plans the edit using Vidya's 17 MCP tools.

02

Vidya Analyzes Footage

Vidya ingests your clips, transcribes audio with Whisper, detects scenes, and builds a .vidya YAML project the AI can read, reason about, and modify.

03

AI Edits and Exports

The agent assembles the timeline, adds captions, transitions, and music. A self-critique loop reviews the edit. Then Vidya renders the final video.

Features

Everything an AI needs to
edit professional video

Built for machines, readable by humans. Every feature designed for agent workflows.

MCP Native

17 tools exposed via the Model Context Protocol. Any MCP-compatible agent (Claude, GPT, Gemini) can call them directly. No custom API, no SDK, no glue code.

Stateful Projects

Every edit lives in a .vidya YAML file the AI can read, reason about, and modify. Version-controlled, diffable, and human-readable. No opaque binary formats.

Apple Silicon Native

Hardware-accelerated via Metal and VideoToolbox on M1/M2/M3/M4. 2x faster than generic FFmpeg. CoreML Whisper for real-time transcription.

6 Caption Presets

tiktok_bold, cinematic, karaoke, minimal, news_ticker, subtitle. Auto-synced to speech via Whisper timestamps. Styled and positioned automatically.

9 Transitions

dissolve, glitch, whip_pan, zoom_in, zoom_out, slide_left, slide_right, fade_black, cut. GPU-rendered on Metal for smooth real-time playback.

Self-Critique Loop

After assembly, the AI reviews its own edit: checks pacing, audio levels, caption sync, and transition coherence. Fixes issues before you even see the output.

7 Platform Exports

One-click export presets for TikTok, Instagram Reels, YouTube Shorts, YouTube Long, LinkedIn, Twitter/X, and custom resolution. Aspect ratios and encoding handled.

Open Source MIT

Fully open source under the MIT license. No vendor lock-in, no black boxes. Fork it, extend it, self-host it. The core MCP server is free forever.

Pricing

Open core. Free forever.

The MCP server is fully open source. Pay only for the native app and cloud features.

Free
0
forever

Open source MCP server

  • MCP server + all 17 tools
  • All caption presets
  • All transitions
  • FFmpeg rendering
  • .vidya YAML projects
  • Self-critique loop
  • Metal HW acceleration
  • Whisper CoreML
  • ProRes export
Get Started
Most Popular
Solo
9
per month

macOS app for power users

  • Everything in Free
  • Native macOS app
  • Metal GPU rendering
  • Whisper CoreML (on-device)
  • Premium caption styles
  • Priority FFmpeg pipeline
  • Team collaboration
  • Cloud rendering
  • ProRes export
Coming Soon
Studio
29
per month

Teams + cloud rendering

  • Everything in Solo
  • Team workspaces
  • Cloud rendering
  • Premium presets library
  • ProRes / DNxHR export
  • Priority support
  • Custom branding
  • SSO / SAML
  • SLA guarantee
Coming Soon
APIUsage-based pricing:0.05/min rendered

All paid plans include a 14-day free trial. Cancel anytime.

FAQ

Got questions?

Everything you need to know about Vidya.

MCP (Model Context Protocol) is an open standard by Anthropic that lets AI models call external tools. Vidya exposes 17 video editing tools via MCP, so any compatible agent (Claude, GPT, Gemini) can edit videos by calling these tools directly -- no custom integration needed.

No. The free open-source MCP server runs on any platform with FFmpeg (macOS, Linux, Windows). Apple Silicon (M1/M2/M3/M4) unlocks hardware acceleration via Metal and VideoToolbox for 2x faster rendering, plus on-device Whisper transcription via CoreML. These are available in the Solo plan.

Yes. Vidya uses the MCP standard protocol, which is agent-agnostic. Any AI that supports MCP tool calls can use Vidya. This includes Claude (native MCP), GPT (via MCP bridge), and Gemini (via MCP bridge). You are not locked into any single AI provider.

The MCP server is fully open source under the MIT license and free forever. All 17 tools, all caption presets, all transitions, FFmpeg rendering, and .vidya project format are included. The paid plans add the native macOS app, Metal GPU rendering, cloud features, and team collaboration.

Vidya supports all formats FFmpeg can handle as input: MP4, MOV, MKV, AVI, WebM, and more. For export, you get optimized presets for TikTok (9:16 H.264), Instagram Reels (9:16 H.264), YouTube Shorts (9:16), YouTube Long (16:9), LinkedIn (16:9 or 1:1), Twitter/X (16:9), and custom resolutions. ProRes export is available in the Studio plan.

Start editing videos
with AI today.

Clone the repo, configure your MCP client, and let your AI agent handle the rest. Open source. Free forever.

Star on GitHub
MIT LicenseNo vendor lock-in