Whishper screenshot

Whishper

Author Avatar Theme by Pluja
Updated: 27 Jan 2025
2289 Stars

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Overview

Whishper is an open-source, 100% local audio transcription and subtitling suite with a full-featured web UI. It offers various features such as transcribing any media to text, downloading transcriptions in multiple formats, translating transcriptions to different languages, and a powerful subtitle editor. Whishper is fast and can run on both CPU and GPU. It also has future plans to include features like local folder as media input, full-text search, user authentication, audio recording, and non-NVIDIA GPU support. The project is structured into three main pieces: Transcription-API, Whishper-Backend, and Whishper-Frontend.

Features

  • Transcribe any media to text: Supports transcribing audio, video, and more.
  • Download transcriptions in many formats: Offers TXT, JSON, VTT, SRT formats, and allows copying raw text to clipboard.
  • Translate transcriptions: Can translate transcriptions to any language supported by Libretranslate.
  • Powerful subtitle editor: Includes features like transcription highlighting, CPS warnings, segment splitting, segment insertion, and subtitle language selection.
  • 100% Local: Transcription, translation, and editing happen locally on the user’s machine.
  • Fast: Uses FasterWhisper as the Whisper backend, resulting in faster transcription times, especially on CPUs.
  • Quick and easy setup: Provides a quick start script and simplified setup steps.
  • GPU support: Utilizes NVIDIA GPUs for even faster transcription times.
  • CPU support: Can run on CPUs for users without GPUs.

Installation

To install Whishper, follow these steps:

  1. Clone the repository using the command:
git clone <repository-url>
  1. Install the necessary dependencies.
  2. Set up the Transcription-API by navigating to the transcription-api folder and following the instructions provided in the documentation.
  3. Set up the Whishper-Backend by navigating to the backend folder and following the instructions provided in the documentation.
  4. Set up the Whishper-Frontend by navigating to the frontend folder and following the instructions provided in the documentation.
  5. Configure the libretranslate container for translation.
  6. Set up MongoDB as the database for storing transcriptions.
  7. Set up Nginx as the proxy to run the application from a single domain.

Summary

Whishper is a comprehensive open-source audio transcription and subtitling suite that offers a range of features. It allows users to transcribe various types of media, download transcriptions in different formats, translate transcriptions to different languages, and edit subtitles with a powerful editor. The application is designed for local use, enabling transcription, translation, and editing to occur on the user’s machine. Whishper is fast and can utilize both CPUs and GPUs for even faster transcription times. With its easy setup process and future roadmap for additional features, Whishper provides a valuable solution for audio transcription and subtitling needs.