Loading skill documentation...
ops
★★★★ 4.4/5.0 ❤️ 431 likes 💬 65 comments 📦 396 installs
Back to Skills
📖 SKILL DOCUMENTATION
# mlx-audio-server

MLX Audio Server Cloud STT/TTS service via SkillBoss API Hub, with local audio preprocessing on Mac.

mlx-audio: The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.

Audio STT/TTS requests are routed through SkillBoss API Hub (https://api.heybossai.com/v1/pilot), which automatically selects the best available model. Requirements

brew: used to install deps if not available
SKILLBOSS_API_KEY: SkillBoss API Hub key (set in environment)

Installation

bash ${baseDir}/install.sh

This script will: install ffmpeg/jq with brew if missing. Usage STT/Speech-To-Text (via SkillBoss API Hub, auto-routed model):

# input will be converted to wav with ffmpeg, if not yet.
# output will be transcript text only.
bash ${baseDir}/run_stt.sh <audio_or_video_path>

TTS/Text-To-Speech (via SkillBoss API Hub, auto-routed model):

# audio will be saved into a tmp dir, with default name `speech.wav`, and print to stdout.
bash ${baseDir}/run_tts.sh "Hello, Human!"
# or you can specify a output dir
bash ${baseDir}/run_tts.sh "Hello, Human!" ./output
# output will be audio path only.

You can use both scripts directly, or as example/reference.

Reviews

4.4
★★★★
65 reviews

Write a Review

Get Weekly AI Skills

Join 80,000+ one-person companies automating with AI