AI Agents Prototype · Dapper × SoluteLabs

Section 01 / What we understood

What the brief asked for.

Three observations from the brief that shape everything below.

1.1 An operating layer, not a chatbot

Dapper's brief is precise: an AI-powered layer that removes friction between physicians, patients, pharmacies, and fulfillment. One coherent surface, met in whichever channel fits the moment.

1.2 Three audiences, three different shapes

A1 Physician

Prescribing assistant

Formulary, Rx generation, EHR pull, prior auth

Highest clinical stakes, deepest integrations. Strategically the most valuable, and the most expensive to prototype credibly.

A2 Patient

Care companion prototype slice

Medication Q&A, onboarding, refills, escalation

Largest audience, most visible value, shallowest integration. Demoable on synthetic data and a curated KB.

A3 Front Desk

Operations enablement prototype slice

SOP Q&A, account walkthroughs, Rx and shipping lookups

Lowest clinical risk, RAG-heavy, internal. Shares the same brain and KB as the Patient surface.

1.3 The constraint

A $15-25K prototype budget covers six to ten weeks for a senior pair. Enough for a credible demo across two surfaces and three modalities. Not enough to prototype the physician surface honestly, because the valuable capabilities live behind EHR integration.

Section 02 / The bounded slice

What fits the budget.

Two agents, three modalities, one conversational brain. The Physician surface is deferred to Phase 2 on purpose.

2.1 In scope

Two agents on one brain and one knowledge base, served through a Next.js web app with text, voice, and avatar. Patient handles Q&A, onboarding, refill and shipment lookups on mock data, education, and in-app escalation. Front Desk handles SOPs, account walkthroughs, and the same lookups for staff. Same voice, same avatar, same brand across modalities.

2.2 Out of scope, on purpose

Phase 2 territory

Physician agent. Rx generation, EHR/EPIC, prior auth need real integration and clinical-grade evals.
EHR, payments, insurance, FSA/HSA. Each is its own project. Mock data carries the demo.
Auth, HIPAA BAAs, PHI. Synthetic data behind a password-protected URL.
Email, SMS, real CareTeam routing. Escalation is an in-app UX state in the prototype.
Observability, eval harness, prompt evaluation. Console logs only at prototype quality.
Voice cloning of Dapper staff. Library voice or Voice Design; cloning needs consent and a recording session.

2.3 What "prototype" means

A working web app, two agents, three modalities, real RAG over real content, password-protected URL. Credible enough to put in front of physicians, clinics, and investors. Productionization cost is called out item by item above, not folded into the price.

Section 03 / Use Case 1

The Patient agent.

Thirty-plus patient capabilities in the brief, narrowed to a focused slice. The agent is the same one whether the patient types, talks, or sees a face.

3.1 The problem

A Dapper patient on a GLP-1 or HRT protocol has dozens of small questions every week. Where is my shipment. What is this side effect. Can I take this with coffee. How do I auto-refill. Each one currently stalls the patient or lands on a human.

3.2 Capabilities in the prototype

Education	Medication explainers in plain language. GLP-1, HRT, peptides. Timelines, dosing, side effects.
Onboarding	Account, delivery, payment, tracking, answered conversationally. Scripted walkthroughs.
Refill & shipment	Read-only lookups on mock data. "When does my refill ship?" Demonstrates the shape without production wiring.
Plain-language labs	Paste a result, get a non-clinical explanation with "talk to your provider" guardrails.
Escalation	An honest handoff. Sensitivity, frustration, or "talk to a human" swaps the UI to a CareTeam state.
Nudge demo	One refill reminder queued in-app. Real email and SMS belong to Phase 2.

3.3 Where it shows up

Text for everyday questions. Voice for hands-free moments, with native barge-in. Avatar for high-trust demos. One brain, one KB, one escalation rule set across all three.

3.4 Out of scope

Payments and PCI. Real insurance, FSA, HSA. CareTeam routing through Slack or Twilio. Adherence tracking on real patient data. Subscription management. All named in the brief, all Phase 2.

Section 04 / Use Case 2

The Front Desk agent.

Front desk staff decide whether Dapper feels useful inside a partner clinic. Same brain as the Patient agent, different audience and retrieval filter.

4.1 The problem

A front-desk team fields the same dozen questions every week. Where is this order. How do I create a Dapper account. What's our SOP for X. Answers live in a training manual, in Slack, in someone's head, but rarely where the team can actually use them mid-shift.

4.2 Capabilities in the prototype

Internal enablement	Q&A over the Dapper training manual and SOPs. Retrieval-grounded, with citations on the source section.
Account walkthroughs	Guided scripts for creating and troubleshooting patient accounts. Agent prompts the next field.
Rx & shipping	The same mock-data lookups as the Patient agent, surfaced through the staff UI.
Pharmacy credibility	Talking points on credentials, sourcing, post-visit care. Lifts staff confidence when patients ask.
Escalation	The same in-app handoff as the Patient surface. One pattern across both agents.

4.3 Where it shows up

Same three modalities as Patient, gated to the staff view by URL. Text for desk-bound queries. Voice for hands-busy moments. Avatar as the same brand face the patient sees.

4.4 Out of scope

Real account creation against production systems. Payment collection. Translation. Operational dashboards (Rx queues, incomplete-order monitors, on-call routing). Conversational layer here, data layer in Phase 2.

Why one brain

Same KB, same voice, same escalation logic. Two agents on one brain is what makes the budget work. Audience tuning lives in prompt + retrieval filter, not in a second implementation.

Section 05 / Architecture

One brain, three surfaces.

The same conversational agent powers text, voice, and avatar. The same retrieval pipeline grounds every answer. The avatar vendor is a swappable visual layer, not a structural commitment.

5.1 The principle

One ElevenLabs agent across all three modalities. Text and voice are native modes. The avatar consumes the same audio stream. Modality is presentation, not a different brain.

5.2 Layered architecture & request flow

User · Patient or Front Desk Staff

01 · request

L1 Presentation

Text Streaming chat in patient and staff UIs.

shadcn/ui elevenlabs-ui Next.js 16 React 19

Voice Hands-free with native interruption.

Flash v2.5 TTS Scribe STT Barge-in Push-to-talk

Avatar Audio-driven lip sync, vendor-swappable.

Simli HeyGen Tavus WebRTC

02 · turn

L2 Orchestration

ElevenLabs Conversational AI · single agent STT → LLM → TTS in one pipeline. One prompt, one persona, one tool set. Audience switches by retrieval filter and prompt variant, not by a second agent.

OpenAI GPT-4o GPT-4o-mini Scribe STT Flash v2.5 TTS Custom LLM endpoint Server tools Turn-taking

03 · tool call

L3 Knowledge

Retrieval webhook Embed query, audience filter, rerank, return ranked chunks + citations.

Next.js route OpenAI embeddings Cohere rerank Audience filter

Qdrant Cloud Audience-filtered collections. Our retrieval, not vendor-locked.

Qdrant Cloud text-embedding-3-small Patient · Staff filters Free tier 1GB

Escalation branchTrigger keywords, frustration cues, or explicit "talk to a human" swap the UI to a CareTeam handoff state and log the conversation.

CareTeam handoff Keyword trigger Sentiment cue Conversation log

5.3 Why this shape

Consistency. One agent, one persona across modalities. Cost. Two agents on one brain fits the budget. Optionality. Avatar vendor sits at the edge, swappable without re-architecting.

Section 06 / The stack

The stack we'd build on.

Eight layers, every choice with a one-line rationale and a documented escape hatch.

Web framework	Next.js 16 + React 19 + Tailwind + shadcn/ui. One codebase, both surfaces, route handlers for RAG and mock data.
Conversation UI	elevenlabs-ui registry. Voice button, orb, waveform, transcript. Copied in, restyled to brand.
Agent runtime	ElevenLabs Conversational AI. One agent for text and voice. Turn-taking and barge-in native.
LLM	OpenAI GPT-4o for answers, GPT-4o-mini for routing. Via ElevenLabs Custom LLM. Swappable.
Vector database	Qdrant Cloud, free tier. HTTP API fits serverless. 1 GB covers the prototype KB.
Embeddings	OpenAI text-embedding-3-small. Cheap, accurate. Voyage or Cohere as swap-ins.
Avatar	Simli, audio-driven. Cleanest pairing with ElevenLabs audio. HeyGen and Tavus are documented fallbacks.
Hosting	Vercel, password-protected preview. No auth code in prototype.

Not in the stack on purpose: observability, email/SMS, payments, real auth. Real production needs, real Phase 2 line items.

Section 07 / Role matrix

The team.

A two-person build core with light PM and QA. Tech Lead owns the agent, RAG, and vendor wiring. Fullstack Engineer owns the web app and deploy.

Role	Responsibility	Allocation	Rate
Tech Lead / AI Engineer	ElevenLabs agent, prompts, tools. RAG on Qdrant. Avatar wiring. Owns conversational quality.	75%	$55/hr
Fullstack Engineer	Next.js app, route handlers, mock data, deploy. Builds the chat, voice, and avatar shells.	100%	$45/hr
Project Manager	Cadence, demo coordination, KB-content handoff, change-requests.	15%	$55/hr
QA Engineer	Regression on the three modality paths. Smoke tests on answer quality and escalation triggers.	20%	$30/hr

2.1 FTE-equivalent across six weeks. The team that scopes the work is the team that ships it. No staffing churn between discovery and delivery.

Section 08 / Schedule and investment

Six weeks. One fixed price.

Five milestones, both agents, three modalities. Price fixed at signing. Anything outside the locked scope routes through a change-request.

#	Milestone	Description	Duration	Investment
01	Foundation & RAG	Next.js shell up. Qdrant seeded with initial Dapper KB. ElevenLabs agent wired to OpenAI. Retrieval webhook live.	1.0 wk	$3,800
02	Patient agent · text + voice	Patient surface in elevenlabs-ui + shadcn. RAG-grounded chat and voice. Escalation UX. Mock refill/shipment lookups.	1.5 wk	$5,400
03	Front Desk agent	Staff surface on the same brain. SOP Q&A, account walkthroughs, shared lookups.	1.0 wk	$3,600
04	Avatar integration	Simli in audio-driven mode against the ElevenLabs stream. Mode toggle in both UIs. Documented HeyGen/Tavus fallback.	1.0 wk	$3,600
05	Polish, demo scripts & handover	Three vetted demo scripts. Password-protected deploy. Walkthrough video. QA pass.	1.5 wk	$5,500
→	Fixed price · prototype	Six weeks, two agents, three modalities. Tooling credits included.	6 wk	$22,500

What you walk away with

A password-protected web app, two agents on one brain, three modalities, KB grounded in Dapper content. Three demo scripts the team can run unaccompanied. A written Phase 2 plan, priced item by item.

Section 09 / Case studies

We've shipped this shape of work before.

Three engagements that map directly onto the Dapper prototype: one near-twin on multi-agent conversational AI, one HIPAA-compliant patient platform at lean-team scale, one AI intake POC delivered in two months.

Multi-agent conversational AI for drug denial & patient education

Client · confidential US healthcare provider · telemedicine, pharmacy benefits

RAG-based voicebot and chatbot, built on ElevenLabs over GCP. Multi-agent flows with PHI-safe orchestration, identity verification, and modular integration with existing EHR and CRM systems. The near-twin to what we are proposing for Dapper.

Challenges

Patients lacked clarity on drug denial reasons, PGx testing, and insurance coverage.
Manual, fragmented workflows delayed treatment decisions and risked patient attrition.
Repetitive Q&A burdened staff, raising operational cost and burnout risk.

Solution

Multi-agent conversational AI for providers and patients across voice and chat.
Agentic flows with PHI-safe orchestration and EHR/CRM integration.
RAG pipelines grounded in PGx reports and SOPs, with tool orchestration for kit requests, appointment scheduling, and PharmD consults.

Impact

Reduced average handling time for drug denial cases.
Increased patient clarity on denial reasons and test benefits.
Improved provider engagement via automated callbacks and multi-contact fallback logic.

Architecture diagram for the drug-denial conversational AI. PGx outbound call scheduler triggers a Twilio telephony system protected by a HIPAA Business Associate Agreement. Calls flow through ElevenLabs platform components: ASR, Voice Activity Detection, an LLM layer with multiple model options, and TTS. Function calls fetch provider details from a PGx report system and APIs, then send back PGxReport summaries and post-call webhook transcripts. ElevenLabs platform layers: Agents, Prompts, Tooling, Vector Database. HIPAA-compliant with Zero Data Retention. — Architecture · ElevenLabs + Twilio + RAG with HIPAA BAA and zero data retention

ElevenLabsRAGMulti-agentGCPPHI-safe orchestrationEHR / CRM integration

Ollie Pediatrics · concierge HIPAA-compliant platform

Ollie Pediatrics Pediatric primary care · US

A modern, integrated, AI-powered pediatric experience uniting parents, doctors, and technology under one HIPAA-compliant hub. "Clinician Ask AI" answers provider questions about a single patient in scope. Validates the lean-team, fixed-scope shape we are proposing for Dapper.

Scope

Product discovery, UX/UI, and full-stack development.
All-in-one hub for child wellness: birth through adolescence.
Real-time care access: telehealth, clinic, home visits, callbacks.

Delivery

4 months agile engagement with continuous delivery.
4-member lean team.
HIPAA-compliant, mobile-first iOS/Android, modular architecture.

Outcome

"Clinician Ask AI" in production answering provider questions per-patient.
"We've aligned on the goals and ways of working that suit Ollie, and our partnership is growing stronger."
Ramses Rivero, Founder, Ollie Pediatrics.

Ollie · provider dashboard + parent app

HIPAA-compliantiOS & AndroidTelehealthClinician AI4-month MVP

Ambient AI intake companion · clinician note generation

Client · confidential US venture-backed elderly-care platform

Real-time transcription with speaker separation, converted into schema-aligned JSON with provenance and confidence scoring. Pre-fills intake forms and generates clinician-ready SOAP and HPI summaries. Validates the prototype shape: lean team, fixed scope, demoable MVP in two months.

Scope

Ambient capture of nurse-patient conversations.
Schema-aligned intake form pre-fill with provenance and confidence.
Reviewer-centric queue + side-panel field validation, one-click autofill.

Delivery

2 months agile engagement, fixed scope, MVP delivery.
3-member lean team.
HIPAA-conscious POC with documented hardening plan (BAA, encryption, IAM).

Outcome

Draft SOAP/HPI summaries linked back to source evidence for rapid verification.
Clinician-ready notes generated inside the existing intake workflow.
POC accepted as the basis for the productionization roadmap.

AWS-based architecture for the ambient AI intake companion. Clinical-data VPC boundary holds Amazon Cognito for auth, an API Gateway, AWS Lambda compute, Amazon S3 storage, Amazon HealthLake for FHIR data, Amazon Transcribe Medical for speaker-separated transcription, and Amazon CloudFront. Surrounding the boundary: AWS Amplify, AWS Config, AWS KMS for encryption, Amazon CloudWatch, and AWS CloudTrail. Encounter and client actors connect from outside the boundary. — Architecture · AWS-native, HIPAA-conscious clinical-data boundary

Real-time STTSpeaker separationStructured extractionHIPAA-conscious2-month POC

alldayDr · UK telemedicine & online pharmacy platform

alldayDr UK telemedicine + pharmacy · nationwide

Built and scaled a UK-compliant telemedicine and pharmacy platform from concept to nationwide rollout. Demonstrates the scale path the Dapper prototype is designed to graduate into: real regulatory posture, multi-app surface area, multi-year engagement.

Scope

End-to-end delivery from ideation to execution.
8 mobile and web apps developed across patient, provider, and pharmacy surfaces.
SNOMED and dm+d standards integrated for clinical and prescription data.

Compliance

Awarded a place on the NHS GPIT framework.
CQC and ISO 27001 certified for data security.
UK-healthcare-aware SDLC across the engagement.

Outcome

2+ years agile engagement with continuous delivery.
25% reduction in time and cost across release cycles.
"The team quickly grasped and understood the UK Healthcare system and always came up with new ideas." Suhel Ahmed, Founder & CEO.

alldayDr patient telemedicine app screens. Patient flow from triage questions through online doctor consultation, prescription issue, and pharmacy fulfilment. UK-compliant clinical UX with SNOMED and dm+d standards integrated. — Patient app · triage, consult, prescription

alldayDr pharmacy ops dashboard. Welcome screen for a pharmacy operator showing Private Orders, NHS Uploaded, Messages, Payments, Statistics, Organization, and Intranet. Order queue with status filters (New, Approved, Awaiting Shipment, Shipped, Rejected, Queried, Cancelled) and a per-order table with customer, product, timeline, and cost. — Pharmacy dashboard · NHS order queue + ops

NHS GPITCQC certifiedISO 27001SNOMEDdm+d8 appsNationwide rollout

Pattern match for Dapper

The first case study is the technical pattern (ElevenLabs + RAG + multi-agent on healthcare). The second is the HIPAA-compliant lean-team pattern. The third is the prototype-shaped MVP pattern. The fourth is the regulated-scale path a successful prototype graduates into. Dapper is the first three today, the fourth tomorrow.

Section 10 / Why SoluteLabs · The ask

Why SoluteLabs, and what's next.

A senior pair as the build core. Same faces from day one through demo day.

SoluteLabs ships text and voice AI agents in production at enterprise scale. The Tech Lead owns ElevenLabs end-to-end. The Fullstack Engineer ships Next.js + shadcn every week. Not a side project for a generalist team.

US-hours overlap	The build pair works in significant overlap with US business hours. Daily async updates, weekly working demos.
ElevenLabs depth	Production agent work is our primary practice. Custom LLM routing, webhook tools, voice tuning, audio-driven avatars.
BYO RAG pattern	We own the retrieval layer rather than handing it to a vendor. Qdrant + audience filters is our standard shape.
Next.js & shadcn	Next.js latest, React 19, shadcn/ui, Tailwind. The tools we ship every week.
Regulated B2B	Experience with HIPAA-adjacent products informs the Phase 2 list in Section 02.
12 years · US entity	50+ team. Founded 2014. US entity in Delaware.

The ask

Confirm scope, vendors, and the $22.5K fixed price. You send the KB seed content in week one. We ship the demo six weeks later. If anything here should be re-shaped first, that is the next conversation.

Two agents. One AI brain. Built for Dapper.

What the brief asked for.

1.1 An operating layer, not a chatbot

1.2 Three audiences, three different shapes

Formulary, Rx generation, EHR pull, prior auth

Medication Q&A, onboarding, refills, escalation

SOP Q&A, account walkthroughs, Rx and shipping lookups

1.3 The constraint

What fits the budget.

2.1 In scope

2.2 Out of scope, on purpose

2.3 What "prototype" means

The Patient agent.

3.1 The problem

3.2 Capabilities in the prototype

3.3 Where it shows up

3.4 Out of scope

The Front Desk agent.

4.1 The problem

4.2 Capabilities in the prototype

4.3 Where it shows up

4.4 Out of scope

One brain, three surfaces.

5.1 The principle

5.2 Layered architecture & request flow

5.3 Why this shape

The stack we'd build on.

The team.

Six weeks. One fixed price.

We've shipped this shape of work before.

Why SoluteLabs, and what's next.