Free Video to Text
Transcribe any video to text, free and online. Drop in an MP4, MOV, or WEBM — we pull out the audio and turn it into a timestamped transcript you can copy or download as .txt, .srt, or .vtt. 99 languages, 100% in your browser. Files never leave your device. No sign-up.
Drop an audio or video file
or click to browse. Best with files under 30 min on most browsers. Cap at 60 min — split longer files first with our Audio Splitter.
Don't have a file? Record one with our voice recorder to test how transcription works.
100% in your browser. Audio stays on your device. The Whisper AI model downloads once (~40 MB) from our servers, then runs locally for every transcription. We can't access your audio because it never leaves your computer. Privacy policy.
Runs free in your browser. Keep this tab open while it runs — we'll chime if you switch tabs. Models cache after first download. Need translation? Use the dedicated Audio Translator.
Transcript
Free video-to-text transcription — how it works
Drop in a video and SnipSound extracts its audio track in your browser, then transcribes it with OpenAI's open-source Whisper model running locally via WebAssembly. The first time you click Transcribe, your browser downloads a ~40 MB model file from our servers; after that, every video you transcribe is fully local. Your video never gets uploaded to any server — not ours, not OpenAI's, not anyone's. Supports MP4, MOV, WEBM, MKV and more.
Need a subtitle file instead of plain text? Use the SRT Generator. Transcribing audio files? See Audio & Video Transcription.
What it's good for
- Transcribing podcast interviews, meeting recordings, voice memos, lectures, or any clear-speech audio.
- Generating subtitles for video — .srt and .vtt downloads with accurate timestamps that drop into YouTube, Vimeo, or any editor.
- A free, private alternative to Otter, Rev, and Descript for journalists, researchers, students, and creators — no subscription, no upload, no per-file time cap.
- Privacy-sensitive audio you don't want on a third-party server — therapy notes, confidential interviews, internal meetings.
Edit the transcript — fix mistakes, find & replace
The transcript is fully editable. Hit ✎ Edit and click any line to retype a misheard word — changes save automatically and flow into every export. Use Find to jump to every place a word appears and Replace all to fix it everywhere at once (e.g. a name Whisper misspelled throughout). Matching is whole-word, and you can flip between the Original and Edited tabs anytime.
What it's not so good for
- Heavy background noise, music behind voice, or multiple overlapping speakers — tiny Whisper struggles with these.
- Heavy accents or non-mainstream dialects — bigger Whisper models handle these better but are too heavy for a browser.
- Speaker diarization ("who said what") — not supported by Whisper-tiny.
- Files longer than 60 minutes — we cap input length to keep browser RAM under control.
Translate audio to English
Tick "Translate to English" and Whisper renders any non-English audio as English text. Spanish podcast → English transcript. Mandarin interview → English notes. Dedicated Audio Translator tool here if translation is your primary need.