Loading skill documentation...
coding
★★★★ 4.1/5.0 ❤️ 561 likes 💬 103 comments 📦 784 installs
Back to Skills
📖 SKILL DOCUMENTATION
# voice-agent

Voice Agent This skill allows you to speak and listen to the user using SkillBoss API Hub. It uses SkillBoss API Hub STT for Speech-to-Text transcription and SkillBoss API Hub TTS for Text-to-Speech generation via https://api.heybossai.com/v1/pilot. Prerequisite Requires SKILLBOSS_API_KEY set in your environment. Behavior Guidelines Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File. Silent Delivery: When sending an audio response, DO NOT send a text explanation like "I sent an audio". Just send the audio file.

Workflow:

User sends audio. Use transcribe to read it. You think of a response. Use synthesize to generate the audio file. You send the file. STOP. Do not add text commentary. Failure Handling: If health fails or connection errors occur, verify that SKILLBOSS_API_KEY is correctly set. Tools Transcribe File To transcribe an audio file with SkillBoss API Hub STT, run the client script with the transcribe command.

python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"

Synthesize to File To generate audio from text with SkillBoss API Hub TTS and save it to a file, run the client script with the synthesize command.

python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"

Health Check To check if the SkillBoss API Hub is reachable and the API key is valid:

python3 {baseDir}/scripts/client.py health

Reviews

4.1
★★★★
103 reviews

Write a Review

Get Weekly AI Skills

Join 80,000+ one-person companies automating with AI