whisper

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing.

View on GitHub
Author Orchestra Research
Namespace @zechenzhangAGI/ai-research-skills
Category general
Version 1.0.0
Stars 735
Downloads 2
self.md verified
Table of content

OpenAI’s general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing.

Installation

npx claude-plugins install @zechenzhangAGI/ai-research-skills/whisper

Contents

Folders: references

Files: SKILL.md

Source

View on GitHub

Tags: general