Add cloud-based Whisper support for transcription #111

psmoros · 2025-03-09T18:27:37Z

This PR adds support for OpenAI's cloud-based Whisper API as an alternative to the local Whisper model. This enhancement provides several benefits:

Key Features:

Cloud-based Whisper integration via OpenAI's API
Accurate word-level timestamp generation for better animation synchronization
Compatible with ARM64 architectures (e.g., Apple Silicon)
No need to install large local Whisper models

Implementation Details:

Added use_cloud_whisper flag to enable cloud-based transcription
Created custom entry point script (manim_cloud_whisper.py) for easy usage
Updated SpeechService class to handle cloud-based transcription
Added environment variable support for configuration
Improved word boundary processing for accurate timing

Usage:
Users can enable cloud-based Whisper in three ways:

Using the custom entry point: python manim_cloud_whisper.py
Setting environment variable: MANIM_VOICEOVER_USE_CLOUD_WHISPER=1
Programmatically: use_cloud_whisper=True in OpenAIService

Example:
Added a comprehensive linear regression demo showcasing the cloud-based Whisper functionality with synchronized voiceovers and animations.

Note: Requires OpenAI API key to be set in environment variables.

psmoros added 3 commits March 9, 2025 17:40

feat: Add cloud-based Whisper support for ARM64 architectures

71f999f

cloud whisper works

3b51312

example

dfe2ad0

psmoros requested a review from osolmaz as a code owner March 9, 2025 18:27

psmoros closed this Mar 9, 2025

psmoros added 3 commits March 9, 2025 19:45

whisper cloud on by default

d1d97e0

robust whisper check

af08226

whisper cloud default

5c78d82

psmoros reopened this Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add cloud-based Whisper support for transcription #111

Add cloud-based Whisper support for transcription #111

Uh oh!

psmoros commented Mar 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add cloud-based Whisper support for transcription #111

Are you sure you want to change the base?

Add cloud-based Whisper support for transcription #111

Uh oh!

Conversation

psmoros commented Mar 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant