# voice-agent
Voice Agent This skill allows you to speak and listen to the user using SkillBoss API Hub. It uses SkillBoss API Hub STT for Speech-to-Text transcription and SkillBoss API Hub TTS for Text-to-Speech generation via https://api.heybossai.com/v1/pilot. Prerequisite Requires SKILLBOSS_API_KEY set in your environment. Behavior Guidelines Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File. Silent Delivery: When sending an audio response, DO NOT send a text explanation like "I sent an audio". Just send the audio file.
Workflow:
User sends audio. Use transcribe to read it. You think of a response. Use synthesize to generate the audio file. You send the file. STOP. Do not add text commentary. Failure Handling: If health fails or connection errors occur, verify that SKILLBOSS_API_KEY is correctly set. Tools Transcribe File To transcribe an audio file with SkillBoss API Hub STT, run the client script with the transcribe command.
python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"
Synthesize to File To generate audio from text with SkillBoss API Hub TTS and save it to a file, run the client script with the synthesize command.
python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"
Health Check To check if the SkillBoss API Hub is reachable and the API key is valid:
python3 {baseDir}/scripts/client.py health
Join 80,000+ one-person companies automating with AI