Powered by Whisper & Gemma 3 AI

Easy to Use Transcription
Tool for Web

Transform your audio and video content into searchable transcripts, AI summaries, and organized notes in minutes.
Support for MP3, MP4, M4A, MOV, YouTube, and more.

YouTube Import

Paste any YouTube URL and get instant transcriptions with speaker identification.

Browser Recording

Record audio directly in your browser with our built-in recording tool.

AI Summaries

Get intelligent summaries and key insights powered by Gemma 3 AI.

Transform Audio & Video into Actionable Content

Your Complete Transcription Toolbox

From YouTube videos to audio recordings, ScribeFlow turns any content into searchable transcripts, AI summaries, and organized notes.

Auto-Transcribe Audio/Video

Upload MP3/MP4 or import directly from YouTube. Our Whisper-powered engine delivers accurate transcripts in minutes.

95%+ accuracy
Multiple formats
Speaker identification
Learn more

AI-Generated Summaries & Notes

Instant AI summaries, bullet points, and a Notes pane for each transcript—perfect for sharing with your team.

Powered by Gemma 3
Key insights
Action items
Learn more

Organize in One Library

Your transcripts, summaries, and notes live in one searchable library. Filter by date, language, or model with ease.

Smart search
Advanced filters
Cloud sync
Learn more

Import & Record with Ease

Seamlessly import YouTube links or record audio directly in your browser. No plugins required—just click and go.

YouTube import
Browser recording
Drag & drop
Learn more
Three Simple Ways to Get Started

Import & Record Made Simple

Whether it's YouTube, a local file, or live audio—get started in seconds.

Import from YouTube

Paste any YouTube URL (e.g., https://youtube.com/watch?v=...) and choose your Whisper model to transcribe instantly.

In-Browser Audio Recorder

Record lectures or meetings right in your browser. Pause, resume, and upload—all without leaving the page.

Drag-and-Drop Upload

Simply drag your MP3/MP4/WAV files (up to 128 MB each) into the upload area. Select from Tiny → XL Whisper models.

Choose Your Plan

Simple Pricing

Choose the plan that's right for you. No hidden fees, no surprises.

Free Tier

$0

Perfect for discovering ScribeFlow

  • Convert audio files to text
  • Edit Transcripts
  • Single audio file at once
  • Limited to 250 minutes of audio processing
  • Limited to 1GB of storage
Get Started

Personal Tier

$9.99/month

Perfect for individuals

  • All free tier features
  • Edit Transcripts
  • Multiple audio files at once
  • Limited to 5000 minutes of audio processing
  • Limited to 10GB of storage
  • Basic support with 72 hours response time
Get Started
Most Popular

Pro Tier

$29.99/month

Ideal for power users

  • All personal tier features
  • Unlimited audio files
  • Limited to 5.000 minutes of audio processing
  • Limited to 100GB of storage
  • Priority support with 48 hours response time

Business Tier

$99.99/month

Adapter for users who required big quotas

  • All pro tier features
  • Limited to 25.000 minutes of audio processing
  • Limited to 1TB of storage
  • Priority support with 24 hours response time
Get Started
Everything You Need to Know

Frequently Asked Questions

Everything you need to know about ScribeFlow's transcription capabilities, features, and how to get the most out of the platform.

Getting Started

ScribeFlow is a comprehensive transcription platform that converts audio and video content into searchable text. You can import YouTube videos, upload audio files (MP3, WAV, M4A), record directly in your browser, and get AI-powered summaries and notes. Everything is organized in your personal library with powerful search and filtering capabilities.

Simply sign up for an account and you can immediately start transcribing content. You can paste a YouTube URL, upload an audio file up to 128MB, or record directly in your browser. Choose your preferred Whisper model and language, then let ScribeFlow handle the rest.

ScribeFlow supports all major audio and video formats including MP3, WAV, M4A, FLAC, OGG, and MP4. You can also import directly from YouTube by pasting any video URL. Files can be up to 128MB in size.

Transcription & Models

Whisper models are AI transcription engines with different accuracy and speed trade-offs. Tiny is fastest for short audio, Medium offers the best balance of speed and accuracy (recommended for English and well-supported languages), and Large V3 provides the highest accuracy for longer content and works significantly better for languages that are less well-supported. We also offer optimized variants like Turbo and Distil models for specific use cases.

ScribeFlow uses OpenAI's Whisper models, which are among the most accurate transcription engines available. Accuracy depends on audio quality, speaker clarity, and the model chosen. The Medium and Large models typically achieve 95%+ accuracy on clear audio. We also include speaker diarization to identify different speakers automatically.

ScribeFlow supports 90+ languages including English, French, Spanish, German, Italian, Portuguese, Russian, Japanese, Chinese, Korean, Arabic, Hindi, and many more. You can either specify the language or let our system auto-detect it for you.

Transcription speed depends on the audio length and model chosen. Typically, a 10-minute audio file takes 1-3 minutes to transcribe with the Medium model. Tiny models are faster (30 seconds to 1 minute), while Large models may take 3-5 minutes but provide higher accuracy.

Features & Capabilities

Yes! Simply paste any YouTube URL and ScribeFlow will automatically extract the audio and transcribe it. This works with individual videos, live streams, and even private videos (if you have access). The original video metadata is preserved in your library.

ScribeFlow automatically generates AI-powered summaries that extract key points, main topics, and important insights from your transcripts. The Notes feature allows you to add your own annotations, comments, and observations about the content. AI summaries are perfect for quickly understanding long recordings, while notes help you capture your thoughts and share highlights with your team.

Absolutely! ScribeFlow includes a built-in browser recorder that lets you capture lectures, meetings, or any audio directly. You can pause, resume, and upload without leaving the page. No additional software or plugins required.

ScribeFlow uses advanced speaker diarization technology to automatically identify and label different speakers in your audio. Each speaker gets a unique identifier (Speaker 1, Speaker 2, etc.) and you can rename them for easier reference. This is especially useful for meetings, interviews, and multi-person recordings.

Library & Organization

All your transcripts, summaries, and notes are stored in your personal library. You can filter by date, language, transcription model, content type, or status. The search function works across all text content, making it easy to find specific topics or quotes across all your recordings.

Yes, ScribeFlow includes sharing capabilities. You can generate shareable links for specific transcripts, allowing others to view the content, transcript, and AI summaries without needing a ScribeFlow account. You maintain full control over what's shared and can revoke access anytime.

Transcript editing capabilities are planned for a future release. Currently, you can view and download your transcripts, but direct editing within ScribeFlow is not yet available. We're working on adding features like inline editing, timestamp adjustments, and speaker label corrections.

Technical & Pricing

Absolutely. ScribeFlow takes privacy seriously. Your audio files and transcripts are encrypted in transit. We don't share your content with third parties, and you maintain full ownership of your data. Audio files are processed securely and can be deleted after transcription if desired.

A comprehensive REST API is planned for a future release. The API will support all major features including file upload, YouTube import, and model selection. Currently, ScribeFlow is available through the web interface only.

ScribeFlow offers different plans with varying limits on monthly transcription minutes, file sizes, and features. Free accounts include a generous allowance to get started, while paid plans offer unlimited transcription, priority processing, and advanced features like custom models and team collaboration.

ScribeFlow is a cloud-based service that requires an internet connection for transcription processing. However, once your content is transcribed, you can download transcripts for offline viewing and editing. We're exploring offline capabilities for future releases.

Start transcribing in seconds

Ready to Get Started?

Transform your audio and video content into searchable transcripts, AI summaries, and organized notes in minutes.

No credit card required
Upload your first file in under 30 seconds
Free tier includes 60 minutes of transcription
Cancel anytime
95%+
Accuracy Rate
90+
Languages Supported
9
Whisper Models Available

Powered by Whisper & Gemma 3 • Runs locally on our servers • No vendor lock-in

✓ YouTube Import✓ Browser Recording✓ AI Summaries✓ Speaker Identification