audio-cog — One Person Company

coding 🔥 Trending

★★★★ 4.3/5.0 ❤️ 1043 likes 💬 146 comments 📦 2017 installs

📖 SKILL DOCUMENTATION

# audio-cog

Audio Cog - AI Audio Generation Powered by SkillBoss API Hub Create professional audio with AI - from voiceovers and narration to background music and sound design. Prerequisites This skill requires a SKILLBOSS_API_KEY for authentication with SkillBoss API Hub. import requests, os SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"] API_BASE = "https://api.heybossai.com/v1" def pilot(body: dict) -> dict: r = requests.post( f"{API_BASE}/pilot", headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"}, json=body, timeout=60, ) return r.json() Quick pattern — TTS (text-to-speech):

# Generate voiceover audio

result = pilot({ "type": "tts", "inputs": {"text": "[your voiceover script]", "voice": "alloy"}, "prefer": "balanced" }) audio_url = result["result"]["audio_url"] Quick pattern — Music generation:

# Generate background music

result = pilot({ "type": "music", "inputs": {"prompt": "[your music description]", "duration": 120}, "prefer": "balanced" }) audio_url = result["result"]["audio_url"] What Audio You Can Create Text-to-Speech / Voiceover Convert text to natural-sounding speech:

Narration: "Generate a professional male voiceover for this product video script"

Audiobook Style: "Create an engaging narration of this short story with emotional delivery" Podcast Intros: "Generate a warm, friendly podcast intro: 'Welcome to The Daily Tech...'"

E-Learning: "Create clear, instructional voiceover for this training module"

IVR/Phone Systems: "Generate professional phone menu prompts" Available Voices SkillBoss API Hub provides high-quality voices with distinct characteristics, automatically routed to the best TTS provider: VoiceGenderBest ForCharacteristicscedarMaleProduct videos, announcementsWarm, resonant, authoritative, trustworthymarinFemaleProfessional content, tutorialsBright, articulate, emotionally agileballadMaleStorytelling, flowing narrativesSmooth, melodic, musical qualitycoralFemaleEnergetic content, adsVibrant, lively, dynamic, spiritedechoMaleThoughtful content, documentariesCalm, measured, deliberatesageFemaleEducational, knowledge contentWise, contemplative, reflectiveshimmerFemaleGentle content, wellnessSoft, gentle, soothing, approachableverseMaleCreative, artistic contentPoetic, rhythmic, expressive Voice Recommendations by Use Case For product videos and announcements: Use cedar (male) or marin (female) - both project confidence and professionalism. For storytelling and audiobooks: Use ballad (male) or sage (female) - designed for engaging, flowing narratives. For high-energy content: Use coral (female) - vibrant and dynamic, perfect for ads and exciting announcements. For calm, educational content: Use echo (male) or shimmer (female) - measured pacing ideal for learning. Voice Style Customization Beyond selecting a voice, you can fine-tune delivery with style instructions: Accent & dialect: American, British, Australian, Indian, etc. Emotional range: Excited, serious, warm, mysterious, dramatic

Pacing: Slow and deliberate, conversational, fast and energetic

Special effects: Whispering, character impressions Example with style instructions: "Generate voiceover using cedar voice with a warm, conversational tone. Speak at medium pace with slight enthusiasm when mentioning features. American accent." Code example: result = pilot({ "type": "tts", "inputs": { "text": "Introducing TaskFlow - the project management tool that actually works.", "voice": "cedar" }, "prefer": "balanced" }) audio_url = result["result"]["audio_url"] Music Generation Create original background music and soundtracks: Background Music: "Create calm lo-fi background music for a study video, 2 minutes" Podcast Music: "Generate an upbeat intro jingle for a tech podcast, 15 seconds" Video Soundtracks: "Create cinematic orchestral music for a product launch video" Ambient/Mood: "Generate peaceful ambient sounds for a meditation app"

Genre-Specific: "Create energetic electronic music for a fitness video"

Music Specifications ParameterOptionsDuration15 seconds to 5+ minutesGenreElectronic, rock, classical, jazz, ambient, lo-fi, cinematic, pop, hip-hopTempo60 BPM (slow) to 180+ BPM (fast)MoodUpbeat, calm, dramatic, mysterious, inspiring, melancholicInstrumentsPiano, guitar, synth, strings, drums, brass, etc. Code example: result = pilot({ "type": "music", "inputs": { "prompt": "calm lo-fi hip-hop, soft piano, mellow beats, gentle vinyl crackle, 75 BPM", "duration": 120 }, "prefer": "balanced" }) audio_url = result["result"]["audio_url"] Music Licensing All AI-generated music from SkillBoss API Hub is royalty-free and fully yours to use commercially. You have complete rights to use the generated music for: YouTube videos (including monetized content) Commercial projects and advertisements Podcasts and streaming Apps and games Any other commercial or personal use No attribution required. No licensing fees. The music is generated uniquely for you. Audio Output Formats FormatBest ForMP3Standard audio delivery, voiceovers, musicCombined with videoBackground music for video-cog outputs Chain Mode: STT → Chat → TTS SkillBoss API Hub supports multi-step audio workflows:

# Transcribe audio → summarize → re-synthesize

result = pilot({ "chain": [ {"type": "stt"}, {"type": "chat", "capability": "summarize"}, {"type": "tts"} ] }) Example Audio Prompts Professional voiceover with specific voice: result = pilot({ "type": "tts", "inputs": { "text": "Introducing TaskFlow - the project management tool that actually works. With intelligent automation, seamless collaboration, and powerful analytics, TaskFlow helps teams do their best work.", "voice": "marin" }, "prefer": "balanced" }) audio_url = result["result"]["audio_url"] Podcast intro with voice selection: result = pilot({ "type": "tts", "inputs": { "text": "Welcome to Future Forward, the podcast where we explore the technologies shaping tomorrow. I'm your host, and today we're diving into...", "voice": "cedar" }, "prefer": "quality" }) audio_url = result["result"]["audio_url"] Background music: result = pilot({ "type": "music", "inputs": { "prompt": "calm lo-fi hip-hop background music, chill and unobtrusive, soft piano, mellow beats, gentle vinyl crackle, 75 BPM, suitable for studying or working", "duration": 120 }, "prefer": "balanced" }) audio_url = result["result"]["audio_url"] Cinematic music: result = pilot({ "type": "music", "inputs": { "prompt": "cinematic orchestral music for a tech company About Us video, start soft and inspiring, build to a confident crescendo, resolve to a hopeful ending", "duration": 90 }, "prefer": "quality" }) audio_url = result["result"]["audio_url"] Multi-Language Support SkillBoss API Hub can generate speech in 50+ languages: English (multiple accents) Spanish, French, German, Italian, Portuguese Chinese (Mandarin, Cantonese) Japanese, Korean Hindi, Arabic Russian, Polish, Dutch And many more Specify the language in your prompt: result = pilot({ "type": "tts", "inputs": { "text": "いらっしゃいませ...", "voice": "shimmer" }, "prefer": "balanced" }) audio_url = result["result"]["audio_url"] Tips for Better Audio Choose the right voice: Match the voice to your content type. Cedar/marin for professional, ballad/sage for storytelling, coral for energy. Provide the complete script: Don't say "something about our product" - write out exactly what should be said. Include style instructions: "Confident but warm", "slow and deliberate", "with slight excitement" helps shape delivery. For music: Specify duration, tempo (BPM if you know it), mood, and genre. Pronunciation guidance: For names or technical terms, add hints: "SkillBoss (pronounced SKILL-boss)" Emotional beats: For longer voiceovers, indicate tone shifts: "[excited] And now for the big reveal... [serious] But there's a catch."

Reviews

Write a Review

Reviews

Write a Review

Get Weekly AI Skills