PrimeAssist
// the trusted AI support layer

Answers grounded in your knowledge. Actions governed by your policies. Escalations the moment confidence drops.

Gartner predicts agentic AI will autonomously resolve 80% of common customer service issues by 2029. Today, only 46% of people trust AI — even as 66% use it regularly. PrimeAssist closes that gap.

Prefer concierge onboarding? Join the waitlist

$1.3T

Worldwide AI IT spending by 2029

IDC, Worldwide AI IT Spending Forecast (Aug 2025) →

78%

of organizations now use AI in at least one business function

Stanford HAI, AI Index Report (Apr 2025) →

53%

of companies are deploying AI agents in customer service

BCG, Agentic AI in Customer Service (Dec 2025) →

// the four pillars

Trust is built one constraint at a time.

Four constraints, enforced together — each one a promise the agent keeps on every answer.

  • 01 / 04

    Grounded

    Every answer cites a source. Confidence is first-class.

    PrimeAssist answers only from documents you have uploaded and approved. Every claim carries a citation back to the originating page. When confidence drops below your threshold, the agent says I do not know instead of guessing.

  • 02 / 04

    Controlled

    Topic allowlists, policy packs, and refusal templates that explain why.

    PrimeAssist enforces your topic boundaries before the model ever runs. Policy packs codify what the agent can say about pricing, claims, eligibility, refunds. Refusals explain to the customer why — never a blank stare.

  • 04 / 04

    Escalatable

    The agent knows when to hand off. The human gets full context.

    PrimeAssist watches its own confidence in real time. When it dips below your threshold, when a topic falls outside policy, when the customer explicitly asks for a human — the agent hands off with the full conversation, the relevant policies, and a clean summary. No please describe your issue again.

// how it works

Four steps. Every step traced, cited, and governed.

A real PrimeAssist trace for one customer turn. Each step is a structured event the agent must explain — to your compliance team, to a human handoff, or back to the customer who asked.

  1. Retrieve

    Search your approved knowledge — never the open web.

    The agent searches across the documents you have uploaded and approved (PDFs, Word, Markdown, scraped sites, API specs) and ranks the top relevant chunks. A confidence value rides every retrieval. Nothing the agent does not have grounding for can be claimed.

  2. Reason

    Draft an answer inside the topics and policies you set.

    The model drafts an answer using only the retrieved chunks, inside your topic allowlist and your policy pack. Refusal templates fire when the question is out of scope; PII gets masked before it ever reaches the model; confidence is checked against your threshold.

  3. Act

    Call your APIs — with dry-run, confirmation, and audit.

    When the answer requires real work — a lookup, a refund, a booking — the agent calls your API. Destructive operations dry-run by default. The customer confirms anything irreversible. Every call is audited with the policy that authorized it, the inputs, and the result.

  4. Escalate

    Hand off the moment confidence drops — never silently.

    Below your confidence threshold? Outside your topics? Customer asks? The agent hands off to a human with the conversation, the relevant policies, the candidate answer it was about to send, and the confidence score. No please describe your issue again.

primeassist · trace.log
# user · how do I cancel my policy?
[01] retrieve
policy-handbook.pdf §4.2 rel 0.91
policy-handbook.pdf §4.3 rel 0.88
faq.md cancellation rel 0.71
[02] reason confidence 0.94
topic policy.cancellation allow
policy pii.mask applied
refusal pack policy.unsupported n/a
[03] act
api://policies/lookup(p_491c) -> active
tool.policy read-only no-confirm
audit call#t8a1c2 stored
[04] respond
"You can cancel your active policy in three..."
cites §4.2, §4.3, faq.md conf 0.94 >= 0.85
# handoff path stays armed for next turn

// what the platform does

Grounded, controlled, actionable — built as platform capabilities.

The pillars are not slogans — they are the surfaces a tenant configures. Eight capabilities, each one a working part of the platform.

  • Grounded citations

    Every reply is drawn from retrieved passages of the tenant's own knowledge, with those sources surfaced inline. An answer that cannot be grounded is never shown as confident.

  • Topic guardrails & policy packs

    Each agent is bounded by configured topics and policy packs. Out-of-scope questions are declined or routed, and sensitive triggers force a human handoff.

  • Tool / action calling

    Tools let the agent read and write through the tenant's existing APIs. Writes run behind dry-run, customer confirmation, and approval queues so every action is deliberate and audited.

  • Knowledge ingestion

    Tenants bring their own handbooks, plan documents, and product docs. PrimeAssist chunks and embeds them into a per-tenant vector namespace, isolated from every other tenant.

  • Evals & quality

    Golden sets run against every agent version and A/B comparisons measure each change. Citation-regression alerts catch grounding drift before it reaches customers.

  • Operator console

    The platform-side surface for running PrimeAssist as a product: cross-tenant management, plan administration, and impersonation backed by a full audit trail.

  • Per-tenant theming

    The widget and dashboard take on each tenant's brand colours and radii. Theming is driven entirely by CSS custom properties, with no runtime style scripting.

  • Vertical templates

    Starter packs for insurance, banking, healthcare, SaaS, and telecom — each a curated knowledge base, policy pack, tool surface, and golden set. Day one is a configuration, not a build.

// one agent, every channel

The same grounded agent, wherever customers reach you.

Chat, voice, phone, SMS, WhatsApp, and email all run the same agent loop — same knowledge, same policies, same audit trail, same handoff path. Pick a channel to see how it answers.

Chat widget

An embeddable script drop-in that streams grounded answers with citations, on any site.

  • One <script> tag, around 30 kB gzipped, no framework runtime.
  • Streamed token-by-token answers with inline source citations.
  • Adopts each tenant's brand through scoped CSS custom properties.

// peer-reviewed productivity

An academic study of 5,179 customer-support agents found generative AI lifted resolutions per hour by 14% on average — and 34% for novice agents.

+14% +34%

14% on average. 34% for novice agents. n = 5,179.


PrimeAssist applies the same productivity math to your team. For the answers AI handles, your humans handle the rest — with more context and less repetition.
Brynjolfsson, Li & Raymond — NBER Working Paper 31161 (2023), Quarterly Journal of Economics vol. 140(2), 2025. →

// what this looks like in production

Speed where the agent is grounded. Humans where it is not. The discipline PrimeAssist designs in by default.

case fintech

Klarna, first month

Conversations handled
2.3M
~two-thirds of total chats
Resolution time
11 → <2 min
avg time to resolve
Repeat inquiries
−25%
vs. human-only baseline

Klarna also revised parts of its all-AI stance in 2025 and rehired support staff for complex cases. That tradeoff — fast where the agent is grounded, human where it is not — is precisely what PrimeAssist treats as the default, not the lesson learned the hard way.

Klarna press release, Feb 2024 →

// regulator-grade by design

The same controls regulators are starting to require — built in from day one.

  • Transparent by mandate.

    EU AI Act Article 13 requires high-risk AI systems to be sufficiently transparent for deployers to interpret output and use it appropriately. PrimeAssist makes citations and per-answer confidence first-class.

    EU AI Act, Article 13 (in force Aug 2026) →
  • Confabulation, named as the risk.

    NIST AI 600-1 names confabulation in current-generation LLMs as pervasive. PrimeAssist's grounded retrieval and confidence threshold are the prescribed mitigation.

    NIST AI 600-1 (July 2024) →
  • Plain-language sources.

    OECD AI Principle 1.3 requires plain, easy-to-understand information on the sources of data, inputs, and logic that led to a prediction. PrimeAssist surfaces them inline with the answer.

    OECD AI Principles, updated May 2024 →

// be among the first

We are onboarding policy-bound businesses by hand.

Drop your details. We will reach out personally when we have a slot that fits your vertical.

The hosted waitlist form is not wired in yet. Reach out directly and we will add you to the list by hand.

Email to join

Operators: set PUBLIC_WAITLIST_PROVIDER and the matching form ID in .env to drop in a Tally or ConvertKit embed without touching code.

No backend yet — your details go straight to our inbox and stay there.