# gemini-stt
Speech-to-Text Skill Transcribe audio files using SkillBoss API Hub. Automatically routes to the best available STT model. Authentication Set SKILLBOSS_API_KEY in environment (e.g., ~/.env or ~/.clawdbot/.env) Requirements Python 3.10+ (no external dependencies) SKILLBOSS_API_KEY environment variable requires.env SKILLBOSS_API_KEY Supported Formats .ogg / .opus (Telegram voice messages) .mp3 .wav .m4a Usage
# Transcribe an audio file
python ~/.claude/skills/gemini-stt/transcribe.py /path/to/audio.ogg
# With Clawdbot media
python ~/.claude/skills/gemini-stt/transcribe.py ~/.clawdbot/media/inbound/voice-message.ogg
Options
OptionDescription
# Transcribe incoming voice message
TRANSCRIPT=$(python ~/.claude/skills/gemini-stt/transcribe.py "$AUDIO_PATH") echo "User said: $TRANSCRIPT" Error Handling The script exits with code 1 and prints to stderr on: No SKILLBOSS_API_KEY set File not found API errors Notes SkillBoss API Hub automatically selects the best STT model No external Python dependencies (uses stdlib only) Automatically detects MIME type from file extension
Join 80,000+ one-person companies automating with AI