Quick Start

This guide will walk you through importing your first voice agent into Relyable, generating test cases, and running your first automated tests.

Prerequisites: You need an existing Vapi or Retell AI voice agent. If you don’t have one yet, create one first in your platform of choice.

Step 1: Create Your Relyable Account

Visit app.relyable.ai and create your account. You’ll be taken to your workspace dashboard.

Create a Workspace

If you’re starting fresh, you’ll create your first workspace. Think of workspaces as separate environments for different clients or projects.

Step 2: Import Your Voice Agent

Navigate to Agents

In your Relyable dashboard, click on Agents in the top navigation, then click Create New Agent.

Select Your Platform

Choose your AI provider:

Vapi
Retell AI

Get Your Credentials

You’ll need two things from your voice platform:For Vapi:

Go to your Vapi Dashboard
Navigate to Settings → API Keys
Copy your Private Key
Go to your assistant and copy the Assistant ID from the URL or settings

For Retell:

Go to your Retell dashboard
Get your API key from Settings
Copy your agent ID

Import the Agent

Paste your API key and Assistant/Agent ID into Relyable. Click Next.Relyable will automatically:

Import your agent configuration
Pull in your system prompt
Sync your phone number (if configured)
Load your agent settings

Verify Import

You should see your agent’s prompt displayed in Relyable. Review it to make sure everything synced correctly, then click Create Agent.

You can sync your prompt at any time by clicking the Sync Prompt button in your agent settings. This is useful when you update your prompt in Vapi/Retell.

Step 3: Generate Test Cases

Test cases are the criteria Relyable uses to evaluate whether your agent is following your instructions correctly.

Navigate to Test Cases

Click on Test Cases in your agent’s navigation menu.

Generate AI Test Cases

Click Generate Test Cases.Relyable’s AI will analyze your prompt and suggest 15-20 test cases. For example:

“Agent must introduce itself as Emily from Inflate Real Estate”
“Agent must ask for the caller’s full name and spell it back letter by letter”
“Agent must confirm the address by reading it back to the caller”

Review and Customize

Go through each test case and:

Verify it’s accurate for your use case
Adjust the priority level (Critical, High, Medium, Low)
Edit the description if needed
Add custom test cases for specific scenarios

Critical test cases have the biggest impact on your score. Use this priority for must-have behaviors like emergency call routing or compliance requirements.

Example Test Cases

Here are some examples of well-written test cases: Critical Priority:

“Agent must transfer emergency calls to a human operator within 10 seconds”
“Agent must capture and confirm the customer’s email address before ending the call”

High Priority:

“Agent must spell back the customer’s name letter by letter for confirmation”
“Agent must follow the conversation flow step-by-step without skipping questions”

Medium Priority:

“Agent should maintain a friendly and professional tone throughout the call”
“Agent should handle interruptions gracefully and not lose context”

Low Priority:

“Agent should use the caller’s name at least once during the conversation”

Step 4: Create a Persona

Personas simulate different types of callers to stress-test your agent with realistic scenarios.

Navigate to Personas

Go to the Simulation section and click on the Personas tab.

Generate a Persona

Click Generate Personality and describe the type of caller you want to simulate.Examples:

“A 65-year-old male who is frustrated and has no patience”
“A 28-year-old female professional who speaks quickly and is tech-savvy”
“A non-native English speaker with a strong accent who speaks slowly”

Review the Generated Persona

Relyable will create a detailed persona with:

Name and demographics
Personality traits
Communication style
Speaking patterns
Accent/voice characteristics

You can edit any of these details to fine-tune the persona.

Create 3-5 diverse personas representing your actual customer base. This helps you discover issues across different user types.

Step 5: Run Your First Test

Create a Scenario

Go to the Run tab in the Simulation section.Click Generate Tests and:

Select a persona (like the one you just created)
Select which test cases to evaluate
Let AI generate a realistic scenario, or write your own

Example AI-generated scenario:

“You are Stanley Miller, a 79-year-old retired mechanic. Your kids are forcing you to sell your house you’ve lived in for 50 years because it’s getting too hard to manage. You’re calling the agency reluctantly and annoyed by the whole process.”

Run the Test

Select your scenario(s) and click Run Test.You can:

Run a single scenario multiple times
Run multiple different scenarios at once
Give your test run a descriptive name

Wait for Results

Relyable will make actual phone calls to your agent using the personas and scenarios. This typically takes 2-3 minutes per call.You can see the progress in real-time on the Results page.

Step 6: Analyze Your Results

View Your Score

Once testing completes, you’ll see:

Overall Score (aim for 70%+ for production-ready)
Average Latency (response time)
Words Per Minute (speaking speed)
Test Case Results (which ones passed/failed)

Review Failed Test Cases

Click on any failed test case to see:

The exact conversation where it failed
Why it failed (AI explanation)
Suggestions for fixing it

For example:

“Agent asked ‘Would you like to book a walkthrough?’ but skipped the required question ‘What is your timeline for moving in?’ The prompt requires asking about the timeline BEFORE offering the walkthrough.”

Listen to Recordings

Click on any conversation to:

Listen to the full audio recording
Read the full transcript
See which test cases passed/failed in that specific call

Step 7: Iterate and Improve

Fix Your Prompt

Based on the failures, update your prompt in Vapi or Retell. Common fixes:

Add step-by-step numbering to enforce question order
Add explicit confirmation steps (repeat name, address back)
Add handling for edge cases discovered in testing

Sync to Relyable

After updating your prompt in Vapi/Retell, go to your agent in Relyable and click Sync Prompt to pull the latest version.

Test Again

Run your tests again with the same scenarios to see if your score improved. Keep iterating until you consistently score 70%+.

Pro tip: Don’t try to fix everything at once. Pick the 2-3 most critical failures, fix those, test again, and repeat. This helps you isolate which changes actually improve performance.

Step 8: Enable Live Monitoring (Optional)

Once you’re happy with your test scores, enable live monitoring for production calls:

Enable Call Monitoring

Go to your agent’s Settings and toggle Enable Call Monitoring.This adds a webhook to your Vapi/Retell agent that sends call data to Relyable.

Monitor Real Calls

Every real call to your agent will now be:

Evaluated against your test cases
Scored automatically
Visible in your Relyable dashboard under the Monitoring tab

Get Alerted

Set up alerts for critical failures so you know immediately if your agent starts having issues in production.

What’s a Good Score?

Here’s what scores typically mean:

Score	Status	What It Means
90%+	Excellent	Production-ready, minimal failures
70-89%	Good	Acceptable for production with monitoring
50-69%	Fair	Needs improvement before production
Below 50%	Poor	Significant issues, not production-ready

In the demonstration from the video, an agent that worked perfectly in manual testing scored only 51-61% in automated testing. Don’t trust manual testing alone!

Next Steps

Deep Dive Development Guide

Learn advanced prompt engineering and testing strategies

API Integration

Automate testing with the Relyable API

Best Practices

Learn from production voice AI experts

Join Community

Connect with other Relyable users

Common Issues

My agent won't import

Make sure you’re using the correct API key and Assistant/Agent ID. For Vapi, you need the Private Key (not Public Key). Check that your agent is published in your voice platform.

Test calls aren't working

Verify that your agent has a phone number configured in Vapi/Retell. Relyable needs a phone number to call for testing.

Scores seem too low

This is normal! Most agents that seem to work fine in manual testing score 50-60% initially. This is why automated testing is so valuable - it finds issues you wouldn’t discover manually.

How do I improve my score?

Focus on test cases marked as Critical or High priority first. Read the AI explanations for why tests failed, update your prompt, sync, and test again. It’s an iterative process.

Need Help?

Check out the full development guide for detailed prompt engineering tips
Review the API documentation for programmatic access
Reach out on X/Twitter @relyableai

Get Started

API Documentation

Agents

Personalities

Evaluations

Runs

Webhooks

Step 1: Create Your Relyable Account

Step 2: Import Your Voice Agent

Step 3: Generate Test Cases

Example Test Cases

Step 4: Create a Persona

Step 5: Run Your First Test

Step 6: Analyze Your Results

Step 7: Iterate and Improve

Step 8: Enable Live Monitoring (Optional)

What’s a Good Score?

Next Steps

Deep Dive Development Guide

API Integration

Best Practices

Join Community

Common Issues

Need Help?

Get Started

API Documentation

Agents

Personalities

Evaluations

Runs

Webhooks

​Step 1: Create Your Relyable Account

​Step 2: Import Your Voice Agent

​Step 3: Generate Test Cases

​Example Test Cases

​Step 4: Create a Persona

​Step 5: Run Your First Test

​Step 6: Analyze Your Results

​Step 7: Iterate and Improve

​Step 8: Enable Live Monitoring (Optional)

​What’s a Good Score?

​Next Steps

Deep Dive Development Guide

API Integration

Best Practices

Join Community

​Common Issues

​Need Help?

Step 1: Create Your Relyable Account

Step 2: Import Your Voice Agent

Step 3: Generate Test Cases

Example Test Cases

Step 4: Create a Persona

Step 5: Run Your First Test

Step 6: Analyze Your Results

Step 7: Iterate and Improve

Step 8: Enable Live Monitoring (Optional)

What’s a Good Score?

Next Steps

Common Issues

Need Help?