Transcription That's Actually Free

No per-minute fees. No cloud uploads. No usage limits. Runs on your machine, forever.

100% FREE - Runs Locally

You just recorded 30 minutes of content. Now you need captions, descriptions, tags, and a title. It all starts here.

Transcription is the backbone of Loki Studio. From one transcript, Loki generates your captions, descriptions, tags, and thumbnail text. A 30-minute video transcribes in about 3 minutes on your GPU. Your videos never leave your computer.

  • Free forever. No API keys, no cloud fees, no usage caps
  • GPU-accelerated. 2-22x faster than CPU
  • Word-level timestamps for karaoke-style captions
  • Separate mic from game audio (up to 4 tracks)
  • Strips filler words ("um", "ah", "like") automatically
Transcription Interface
Transcription with multi-track audio support

GPU-Accelerated Speed

See the difference CUDA makes

~60 min
CPU Processing

For a 30-minute video

~3 min
GPU (CUDA)

RTX 3060 or better

Even modest GPUs see 5-10x speedups. CPU fallback available for systems without CUDA.

What You Get

🌍

99+ Languages

Whisper supports English, Japanese, Chinese, Spanish, French, German, and dozens more. Auto-detect or manually specify.

⏱️

Word-Level Timestamps

Every word gets precise timing, enabling karaoke-style captions that highlight each word as it's spoken.

🎚️

Multi-Track Support

Configure up to 4 audio tracks. Transcribe just your mic, or include game audio. Each track is processed separately.

🚫

Filler Word Removal

The "Ah-Counter" strips verbal hesitations like "um", "ah", "like", and "you know" from your transcriptions.

🔄

Batch Processing

Select multiple videos and transcribe them all. Process your entire backlog while you sleep.

🌐

Built-in Translation

Translate transcriptions to 34+ languages with NLLB-200 neural translation. Word-level timing preserved.

Whisper Model Sizes

Choose the right balance of speed vs. accuracy

Model Size Speed Accuracy
tiny ~75 MB Fastest Basic
base ~140 MB Fast Good
small ~460 MB Moderate Better
medium ~1.5 GB Slower Great
large-v3-turbo ~800 MB Fast Best
large-v3 ~3 GB Slowest Excellent

Recommended: large-v3-turbo offers the best balance of speed and accuracy for most creators.

Stop Paying Per Minute. Start Transcribing for Free.

Download Loki Studio and process your entire backlog while you sleep. No limits. No fees.

Download Free Read the Guide
Buy me a coffee