agent-orchestration — One Person Company

agent 🔥 Trending

★★★★ 4.3/5.0 ❤️ 1113 likes 💬 147 comments 📦 2743 installs

📖 SKILL DOCUMENTATION

# agent-orchestration

Agent Orchestration 🦞 By Hal Labs — Part of the Hal Stack Your agents fail because your prompts suck. This skill fixes that. The Core Problem You're not prompting. You're praying. Most prompts are wishes tossed into the void: ❌ "Research the best vector databases and write a report" You type something reasonable. The output is mid. You rephrase. Still mid. You add keywords. Somehow worse. You blame the model. Here's what you don't understand: A language model is a pattern-completion engine. It generates the most statistically probable output given your input. Vague input → generic output. Not because the model is dumb. Because generic is what's most probable when you give it nothing specific to work with. The model honored exactly what you asked for. You just didn't realize how little you gave it. The Core Reframe A prompt is not a request. A prompt is a contract. Every contract must answer four non-negotiables: ElementQuestionRoleWho is the model role-playing as?TaskWhat exactly must it accomplish?ConstraintsWhat rules must be followed?OutputWhat does "done" look like? Miss one, the model fills the gap with assumptions. Assumptions are where hallucinations are born. The 5-Layer Architecture Effective prompts share a specific structure. This maps to how models actually process information. Layer 1: Identity Who is the model in this conversation? Not "helpful assistant" but a specific role with specific expertise: You are a senior product marketer who specializes in B2B SaaS positioning. You have 15 years of experience converting technical features into emotional benefits. You write in short sentences. You never use jargon without explaining it. The model doesn't "become" this identity—it accesses different clusters of training data, different stylistic patterns, different reasoning approaches. Identity matters. Miss this and you get generic output. Layer 2: Context What does the model need to know to do this task exceptionally well? Context must be: Ordered — Most important first Scoped — Only what's relevant Labeled — What's rules vs. editable vs. historical

## Context
### Rules (never change)

Design system: Tailwind, shadcn components
Voice: Professional but warm, never corporate

### Current State (may evolve)

Landing page exists at /landing
Using Next.js 14 with App Router

### Historical (for reference)

Originally built with Create React App, migrated Jan 2025 Without labels, the model treats everything as equally optional. Then it rewrites your core logic halfway through. Layer 3: Task What specific action must be taken? Not "write something about X" but precise instructions:

## Task

Produce a 500-word product description that:

Emphasizes time-saving benefits for busy executives
Opens with the primary pain point
Includes 3 specific use cases
Ends with a clear call to action The more precisely you define the task, the more precisely the model executes. Layer 4: Process ⚡ This is where most prompts fail. You're asking for output. You should be asking for how the output is formed. ❌ Bad: Write me a marketing page. ✅ Good:

## Process

First, analyze the target audience and identify their primary pain points
Then, define the positioning that addresses those pain points
Then, write the page
Show your reasoning at each step
Do not skip steps
Audit your work before reporting done You don't want answers. You want how the answer is formed. Think like a director. You're not asking for a scene—you're directing how the scene gets built. Layer 5: Output What does "done" actually look like? If you don't specify, you get whatever format the model defaults to.

## Output Format

Return a JSON object with:

headline: string (max 60 chars)
subheadline: string (max 120 chars)
body: string (markdown formatted)
cta: string (action verb + benefit) Do not include explanations, notes, or commentary. Only the JSON. Miss one layer, the structure wobbles. Miss two, it collapses. Model Selection via SkillBoss API Hub Prompt portability is a myth. Different tasks need different capabilities. With SkillBoss API Hub, you call a single endpoint (https://api.heybossai.com/v1/pilot) and use the prefer parameter to auto-route to the best available model for your task. Task Profileprefer settingBest ForComplex reasoning, nuanced writing, long context"prefer": "quality"Deep analysis, creative work, multi-step logicBalanced tasks, code, analysis"prefer": "balanced"Most everyday tasks, reliable outputQuick tasks, simple queries"prefer": "price"High-volume, latency-sensitive, simple completions import requests, os SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"] def call_llm(messages, prefer="balanced"): r = requests.post( "https://api.heybossai.com/v1/pilot", headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"}, json={"type": "chat", "inputs": {"messages": messages}, "prefer": prefer}, timeout=60, ) return r.json()["result"]["choices"][0]["message"]["content"]

# Quality mode — complex reasoning

result = call_llm([{"role": "user", "content": "Analyze this architecture..."}], prefer="quality")

# Balanced mode — everyday tasks

result = call_llm([{"role": "user", "content": "Summarize this document"}], prefer="balanced")

# Price mode — quick, high-volume tasks

result = call_llm([{"role": "user", "content": "Extract keywords from: ..."}], prefer="price") SkillBoss API Hub automatically selects the optimal model. You don't manage model names — you declare your task priority. Adapt your prompt structure per use case: Quality mode handles verbose, nuanced prompts well Balanced mode works best with clear, structured prompts Price mode responds best to simple, direct instructions For any mode: explicit output format always improves results The person who writes task-specific prompts with the right prefer setting will outperform the person with "better ideas" every time. Constraints Are Instructions Vagueness isn't flexibility. It's cowardice. You hedge because being specific feels risky. But the model doesn't read your mind. Constraints are not limitations. Constraints are instructions.

## Constraints

Never alter the existing design system
Always maintain the established voice/tone
Never change the data model without explicit approval
Max 3 API calls per operation
If unsure, ask rather than assume Every conversation starts at zero. The model doesn't have accumulated context from working with you. Consistency comes from instruction, not memory. Canonical Documentation If you don't have docs, you're gambling. DocumentPurposePRDWhat we're building and whyDesign SystemVisual rules and componentsConstraints DocWhat must never changeContext DocCurrent state and history The rule: Reference docs in your prompts. The attached PRD is the source of truth. Do not contradict it. The design system in /docs/design.md must be followed exactly. Without explicit anchoring, the model assumes everything is mutable—including your core decisions. "Good prompting isn't writing better sentences. It's anchoring the model to reality." The Complete Template

## Identity

You are a [specific role] with [specific expertise]. [Behavioral traits and style]

## Context
### Rules (never change)

[Constraint 1]
[Constraint 2]

### Current State

[Relevant background]

### Reference Docs

[Doc 1]: [what it contains]
[Doc 2]: [what it contains]

## Task

[Specific, measurable objective]

## Process

First, [analysis step]
Then, [planning step]
Then, [execution step]
Finally, [verification step] Show your reasoning at each step.

## User Stories

As [user], I want [goal], so that [benefit]
As [user], I want [goal], so that [benefit]

## Output Format

[Exact specification of deliverable]

## Constraints

[Limit 1]
[Limit 2]
[What NOT to do]

## Error Handling

If [situation]: [action]
If blocked: [escalation]

## Before Reporting Done

Review each user story
Verify the output satisfies it
If not, iterate until it does
Only then report complete Ralph Mode For complex tasks where first attempts often fail:

## Mode: Ralph

Keep trying until it works. Don't give up on first failure. If something breaks:

Debug and understand why
Try a different approach
Research how others solved similar problems
Iterate until user stories are satisfied You have [N] attempts before escalating. When to use: Build tasks with multiple components Integration work Anything where first-try success is unlikely Agent Tracking Every spawned agent gets tracked. No orphans. Maintain notes/areas/active-agents.md:

## Currently Running

Label	Task	Spawned	Expected	Status
research-x	Competitor analysis	9:00 AM	15m	🏃 Running

## Completed Today

Label	Task	Runtime	Result
builder-v2	Dashboard update	8m	✅ Complete
Heartbeat check:

Run sessions_list --activeMinutes 120
Compare to tracking file
Investigate any missing/stalled agents
Log completions to LEARNINGS.md The Learnings Loop Every agent outcome is data. Capture it. Maintain LEARNINGS.md:

## What Works

User stories + acceptance loop
Ralph mode for complex builds
Explicit output formats
Process layer with reasoning steps

## What Doesn't Work

Lazy task dumps
Missing success criteria
No scope limits
Vague constraints

## Experiment Log
### [Date]: [Agent Label]

Approach: [What you tried] Outcome: [What happened] Lesson: [What you learned] Role Library Build reusable role definitions:

# Role Library
## Research Analyst

You are a senior research analyst with 10 years experience in technology markets. You are thorough but efficient. You cite sources. You distinguish fact from speculation. You present findings in structured formats with clear recommendations.

## Technical Writer

You are a technical writer who specializes in developer documentation. You write clearly and concisely. You use examples liberally. You assume the reader is smart but unfamiliar with this specific system.

## Code Reviewer

You are a senior engineer conducting code review. You focus on correctness, maintainability, and security. You explain your reasoning. You suggest specific improvements, not vague feedback. Quick Reference The 4 Non-Negotiables Role — Who is the model? Task — What must it do? Constraints — What rules apply? Output — What does done look like? The 5 Layers Identity — Specific role and expertise Context — Ordered, scoped, labeled Task — Precise objective Process — How to approach (most overlooked!) Output — Exact format specification Pre-Spawn Checklist Identity assigned? Context labeled (rules/state/history)? Task specific and measurable? Process described (not just output)? User stories defined? Output format specified? Constraints explicit? Error handling included? Added to tracking file? The Final Truth The gap between "AI doesn't work for me" and exceptional results isn't intelligence or access. One group treats prompting as conversation. The other treats it as engineering a system command. The model matches your level of rigor. Vague inputs → generic outputs Structured inputs → structured outputs Clear thinking → clear results You don't need to be smarter. You need to be clearer. Clarity is a system, not a talent. Part of the Hal Stack 🦞 Got a skill idea? Email: [email protected] "You're not prompting, you're praying. Start engineering."

Reviews

Write a Review

Reviews

Write a Review

Get Weekly AI Skills