Skip to contentSoluteLabs × Dapper / AI Agents Prototype ProposalConfidential / 21 May 2026 · Fixed-bid SoW
SoluteLabs · AI-Native Product Engineering for Healthcare
Two agents. One AI brain. Built for Dapper.
Text, voice, and avatar across both surfaces. One six-week sprint, one fixed price, one demo your team can put in front of physicians, patients, and partner clinics.
Fixed-price prototype$22.5K
Six weeks, two agents, three modalitiesOne conversational brain powering Patient and Front DeskDemo-ready, password-protected, vendor-swappable
A working demo, not a production system. Synthetic data, no PHI, no real CareTeam routing. The aim is to put a credible artifact in front of stakeholders so the productionization decision is informed, not theoretical.
Section 01 / What we understood
What the brief asked for.
Three observations from the brief that shape everything below.
1.1 An operating layer, not a chatbot
Dapper's brief is precise: an AI-powered layer that removes friction between physicians, patients, pharmacies, and fulfillment. One coherent surface, met in whichever channel fits the moment.
1.2 Three audiences, three different shapes
A1Physician
Prescribing assistant
Formulary, Rx generation, EHR pull, prior auth
Highest clinical stakes, deepest integrations. Strategically the most valuable, and the most expensive to prototype credibly.
A2Patient
Care companion prototype slice
Medication Q&A, onboarding, refills, escalation
Largest audience, most visible value, shallowest integration. Demoable on synthetic data and a curated KB.
A3Front Desk
Operations enablement prototype slice
SOP Q&A, account walkthroughs, Rx and shipping lookups
Lowest clinical risk, RAG-heavy, internal. Shares the same brain and KB as the Patient surface.
1.3 The constraint
A $15-25K prototype budget covers six to ten weeks for a senior pair. Enough for a credible demo across two surfaces and three modalities. Not enough to prototype the physician surface honestly, because the valuable capabilities live behind EHR integration.
Section 02 / The bounded slice
What fits the budget.
Two agents, three modalities, one conversational brain. The Physician surface is deferred to Phase 2 on purpose.
2.1 In scope
Two agents on one brain and one knowledge base, served through a Next.js web app with text, voice, and avatar. Patient handles Q&A, onboarding, refill and shipment lookups on mock data, education, and in-app escalation. Front Desk handles SOPs, account walkthroughs, and the same lookups for staff. Same voice, same avatar, same brand across modalities.
2.2 Out of scope, on purpose
Phase 2 territory
Physician agent. Rx generation, EHR/EPIC, prior auth need real integration and clinical-grade evals.
EHR, payments, insurance, FSA/HSA. Each is its own project. Mock data carries the demo.
Auth, HIPAA BAAs, PHI. Synthetic data behind a password-protected URL.
Email, SMS, real CareTeam routing. Escalation is an in-app UX state in the prototype.
Observability, eval harness, prompt evaluation. Console logs only at prototype quality.
Voice cloning of Dapper staff. Library voice or Voice Design; cloning needs consent and a recording session.
2.3 What "prototype" means
A working web app, two agents, three modalities, real RAG over real content, password-protected URL. Credible enough to put in front of physicians, clinics, and investors. Productionization cost is called out item by item above, not folded into the price.
Section 03 / Use Case 1
The Patient agent.
Thirty-plus patient capabilities in the brief, narrowed to a focused slice. The agent is the same one whether the patient types, talks, or sees a face.
3.1 The problem
A Dapper patient on a GLP-1 or HRT protocol has dozens of small questions every week. Where is my shipment. What is this side effect. Can I take this with coffee. How do I auto-refill. Each one currently stalls the patient or lands on a human.
3.2 Capabilities in the prototype
Education
Medication explainers in plain language. GLP-1, HRT, peptides. Timelines, dosing, side effects.
Read-only lookups on mock data. "When does my refill ship?" Demonstrates the shape without production wiring.
Plain-language labs
Paste a result, get a non-clinical explanation with "talk to your provider" guardrails.
Escalation
An honest handoff. Sensitivity, frustration, or "talk to a human" swaps the UI to a CareTeam state.
Nudge demo
One refill reminder queued in-app. Real email and SMS belong to Phase 2.
3.3 Where it shows up
Text for everyday questions. Voice for hands-free moments, with native barge-in. Avatar for high-trust demos. One brain, one KB, one escalation rule set across all three.
3.4 Out of scope
Payments and PCI. Real insurance, FSA, HSA. CareTeam routing through Slack or Twilio. Adherence tracking on real patient data. Subscription management. All named in the brief, all Phase 2.
Section 04 / Use Case 2
The Front Desk agent.
Front desk staff decide whether Dapper feels useful inside a partner clinic. Same brain as the Patient agent, different audience and retrieval filter.
4.1 The problem
A front-desk team fields the same dozen questions every week. Where is this order. How do I create a Dapper account. What's our SOP for X. Answers live in a training manual, in Slack, in someone's head, but rarely where the team can actually use them mid-shift.
4.2 Capabilities in the prototype
Internal enablement
Q&A over the Dapper training manual and SOPs. Retrieval-grounded, with citations on the source section.
Account walkthroughs
Guided scripts for creating and troubleshooting patient accounts. Agent prompts the next field.
Rx & shipping
The same mock-data lookups as the Patient agent, surfaced through the staff UI.
Pharmacy credibility
Talking points on credentials, sourcing, post-visit care. Lifts staff confidence when patients ask.
Escalation
The same in-app handoff as the Patient surface. One pattern across both agents.
4.3 Where it shows up
Same three modalities as Patient, gated to the staff view by URL. Text for desk-bound queries. Voice for hands-busy moments. Avatar as the same brand face the patient sees.
4.4 Out of scope
Real account creation against production systems. Payment collection. Translation. Operational dashboards (Rx queues, incomplete-order monitors, on-call routing). Conversational layer here, data layer in Phase 2.
Why one brain
Same KB, same voice, same escalation logic. Two agents on one brain is what makes the budget work. Audience tuning lives in prompt + retrieval filter, not in a second implementation.
Section 05 / Architecture
One brain, three surfaces.
The same conversational agent powers text, voice, and avatar. The same retrieval pipeline grounds every answer. The avatar vendor is a swappable visual layer, not a structural commitment.
5.1 The principle
One ElevenLabs agent across all three modalities. Text and voice are native modes. The avatar consumes the same audio stream. Modality is presentation, not a different brain.
5.2 Layered architecture & request flow
User · Patient or Front Desk Staff
01 · request
L1Presentation
TextStreaming chat in patient and staff UIs.
shadcn/uielevenlabs-uiNext.js 16React 19
VoiceHands-free with native interruption.
Flash v2.5 TTSScribe STTBarge-inPush-to-talk
AvatarAudio-driven lip sync, vendor-swappable.
SimliHeyGenTavusWebRTC
02 · turn
L2Orchestration
ElevenLabs Conversational AI · single agentSTT → LLM → TTS in one pipeline. One prompt, one persona, one tool set. Audience switches by retrieval filter and prompt variant, not by a second agent.
Consistency. One agent, one persona across modalities. Cost. Two agents on one brain fits the budget. Optionality. Avatar vendor sits at the edge, swappable without re-architecting.
Section 06 / The stack
The stack we'd build on.
Eight layers, every choice with a one-line rationale and a documented escape hatch.
Web framework
Next.js 16 + React 19 + Tailwind + shadcn/ui. One codebase, both surfaces, route handlers for RAG and mock data.
Conversation UI
elevenlabs-ui registry. Voice button, orb, waveform, transcript. Copied in, restyled to brand.
Agent runtime
ElevenLabs Conversational AI. One agent for text and voice. Turn-taking and barge-in native.
LLM
OpenAI GPT-4o for answers, GPT-4o-mini for routing. Via ElevenLabs Custom LLM. Swappable.
Vector database
Qdrant Cloud, free tier. HTTP API fits serverless. 1 GB covers the prototype KB.
Embeddings
OpenAI text-embedding-3-small. Cheap, accurate. Voyage or Cohere as swap-ins.
Avatar
Simli, audio-driven. Cleanest pairing with ElevenLabs audio. HeyGen and Tavus are documented fallbacks.
Hosting
Vercel, password-protected preview. No auth code in prototype.
Not in the stack on purpose: observability, email/SMS, payments, real auth. Real production needs, real Phase 2 line items.
Section 07 / Role matrix
The team.
A two-person build core with light PM and QA. Tech Lead owns the agent, RAG, and vendor wiring. Fullstack Engineer owns the web app and deploy.
Regression on the three modality paths. Smoke tests on answer quality and escalation triggers.
20%
$30/hr
2.1 FTE-equivalent across six weeks. The team that scopes the work is the team that ships it. No staffing churn between discovery and delivery.
Section 08 / Schedule and investment
Six weeks. One fixed price.
Five milestones, both agents, three modalities. Price fixed at signing. Anything outside the locked scope routes through a change-request.
#
Milestone
Description
Duration
Investment
01
Foundation & RAG
Next.js shell up. Qdrant seeded with initial Dapper KB. ElevenLabs agent wired to OpenAI. Retrieval webhook live.
1.0 wk
$3,800
02
Patient agent · text + voice
Patient surface in elevenlabs-ui + shadcn. RAG-grounded chat and voice. Escalation UX. Mock refill/shipment lookups.
1.5 wk
$5,400
03
Front Desk agent
Staff surface on the same brain. SOP Q&A, account walkthroughs, shared lookups.
1.0 wk
$3,600
04
Avatar integration
Simli in audio-driven mode against the ElevenLabs stream. Mode toggle in both UIs. Documented HeyGen/Tavus fallback.
1.0 wk
$3,600
05
Polish, demo scripts & handover
Three vetted demo scripts. Password-protected deploy. Walkthrough video. QA pass.
1.5 wk
$5,500
→
Fixed price · prototype
Six weeks, two agents, three modalities. Tooling credits included.
6 wk
$22,500
What you walk away with
A password-protected web app, two agents on one brain, three modalities, KB grounded in Dapper content. Three demo scripts the team can run unaccompanied. A written Phase 2 plan, priced item by item.
Section 09 / Case studies
We've shipped this shape of work before.
Three engagements that map directly onto the Dapper prototype: one near-twin on multi-agent conversational AI, one HIPAA-compliant patient platform at lean-team scale, one AI intake POC delivered in two months.
Multi-agent conversational AI for drug denial & patient education
RAG-based voicebot and chatbot, built on ElevenLabs over GCP. Multi-agent flows with PHI-safe orchestration, identity verification, and modular integration with existing EHR and CRM systems. The near-twin to what we are proposing for Dapper.
Challenges
Patients lacked clarity on drug denial reasons, PGx testing, and insurance coverage.
Manual, fragmented workflows delayed treatment decisions and risked patient attrition.
Repetitive Q&A burdened staff, raising operational cost and burnout risk.
Solution
Multi-agent conversational AI for providers and patients across voice and chat.
Agentic flows with PHI-safe orchestration and EHR/CRM integration.
RAG pipelines grounded in PGx reports and SOPs, with tool orchestration for kit requests, appointment scheduling, and PharmD consults.
Impact
Reduced average handling time for drug denial cases.
Increased patient clarity on denial reasons and test benefits.
Improved provider engagement via automated callbacks and multi-contact fallback logic.
Architecture · ElevenLabs + Twilio + RAG with HIPAA BAA and zero data retention
A modern, integrated, AI-powered pediatric experience uniting parents, doctors, and technology under one HIPAA-compliant hub. "Clinician Ask AI" answers provider questions about a single patient in scope. Validates the lean-team, fixed-scope shape we are proposing for Dapper.
Scope
Product discovery, UX/UI, and full-stack development.
All-in-one hub for child wellness: birth through adolescence.
Real-time care access: telehealth, clinic, home visits, callbacks.
Delivery
4 months agile engagement with continuous delivery.
Real-time transcription with speaker separation, converted into schema-aligned JSON with provenance and confidence scoring. Pre-fills intake forms and generates clinician-ready SOAP and HPI summaries. Validates the prototype shape: lean team, fixed scope, demoable MVP in two months.
Scope
Ambient capture of nurse-patient conversations.
Schema-aligned intake form pre-fill with provenance and confidence.
Reviewer-centric queue + side-panel field validation, one-click autofill.
alldayDr · UK telemedicine & online pharmacy platform
alldayDrUK telemedicine + pharmacy · nationwide
Built and scaled a UK-compliant telemedicine and pharmacy platform from concept to nationwide rollout. Demonstrates the scale path the Dapper prototype is designed to graduate into: real regulatory posture, multi-app surface area, multi-year engagement.
Scope
End-to-end delivery from ideation to execution.
8 mobile and web apps developed across patient, provider, and pharmacy surfaces.
SNOMED and dm+d standards integrated for clinical and prescription data.
Compliance
Awarded a place on the NHS GPIT framework.
CQC and ISO 27001 certified for data security.
UK-healthcare-aware SDLC across the engagement.
Outcome
2+ years agile engagement with continuous delivery.
25% reduction in time and cost across release cycles.
"The team quickly grasped and understood the UK Healthcare system and always came up with new ideas." Suhel Ahmed, Founder & CEO.
The first case study is the technical pattern (ElevenLabs + RAG + multi-agent on healthcare). The second is the HIPAA-compliant lean-team pattern. The third is the prototype-shaped MVP pattern. The fourth is the regulated-scale path a successful prototype graduates into. Dapper is the first three today, the fourth tomorrow.
Section 10 / Why SoluteLabs · The ask
Why SoluteLabs, and what's next.
A senior pair as the build core. Same faces from day one through demo day.
SoluteLabs ships text and voice AI agents in production at enterprise scale. The Tech Lead owns ElevenLabs end-to-end. The Fullstack Engineer ships Next.js + shadcn every week. Not a side project for a generalist team.
US-hours overlap
The build pair works in significant overlap with US business hours. Daily async updates, weekly working demos.
ElevenLabs depth
Production agent work is our primary practice. Custom LLM routing, webhook tools, voice tuning, audio-driven avatars.
BYO RAG pattern
We own the retrieval layer rather than handing it to a vendor. Qdrant + audience filters is our standard shape.
Next.js & shadcn
Next.js latest, React 19, shadcn/ui, Tailwind. The tools we ship every week.
Regulated B2B
Experience with HIPAA-adjacent products informs the Phase 2 list in Section 02.
12 years · US entity
50+ team. Founded 2014. US entity in Delaware.
The ask
Confirm scope, vendors, and the $22.5K fixed price. You send the KB seed content in week one. We ship the demo six weeks later. If anything here should be re-shaped first, that is the next conversation.