SharkRouter is the deterministic data plane for agentic AI. It sits between AI agents and tool execution, providing a 14-step governance pipeline that includes ToolGuard (function-call firewall), Agent Passport (cryptographic identity), Dry-Run Preview (impact preview before execution), Output Assurance (post-execution verification), Kill Switch (immediate halt), and an immutable WORM audit chain.

How is SharkRouter different from prompt filters or monitoring tools?

SharkRouter is not a prompt filter, not an output scanner, and not a monitoring tool. It is a stateful gateway that intercepts every function call an AI agent makes and enforces deterministic business rules before execution. Prompt filters (Pangea, Lakera) only inspect input. Out-of-band monitors (Zenity, Protect AI) observe but cannot enforce. JIT access tools (Oasis Security) manage permissions but do not audit execution. SharkRouter is the only product that intercepts, governs, and audits at the function-call layer with cryptographic proof.

Is SharkRouter OpenAI-compatible?

Yes. SharkRouter is a drop-in replacement for the OpenAI API. Change your base_url to https://api.sharkrouter.ai/v1 and your existing code works unchanged. One line. Full governance. Zero lock-in.

What is Agent Passport?

Agent Passport assigns cryptographic identity (ECDSA-signed) to every AI agent in your environment. Each passport carries a scoped tool universe, a 9-state lifecycle FSM, and delegation chains with scope narrowing. Trust stages progress STRANGER → KNOWN → TRUSTED → EXTENSION based on observed behavior.

What is Dry-Run Preview?

Dry-Run Preview shows the impact of a destructive tool call before it executes — affected rows, blast radius, estimated cost. Zero of 19 competitors in our State of AI Governance benchmark offer this capability. It is what enables CISOs to approve agentic AI for production systems.

Can SharkRouter be deployed air-gapped?

Yes. SharkRouter offers three deployment tiers: Cloud Gateway (5 minutes), Private VPC (1 day), and Air-Gapped On-Premise (1 week). The air-gapped tier uses offline licensing and runs with zero outbound connectivity — designed for banking, defense, and government environments that cannot use cloud AI services.

What compliance frameworks does SharkRouter support?

SharkRouter is designed compliant by architecture: SOC 2, GDPR, HIPAA, ISO 27001, BOI 364, and EU AI Act Article 14 (human oversight of high-risk AI). The WORM audit chain provides cryptographic chain-of-custody that satisfies banking and regulated-industry audit requirements.

Warden is SharkRouter's open-source governance scanner. Run it against any AI framework or environment and it produces a 17-dimension governance score out of 100. Across 19 AI frameworks and competing gateways, the market average is 28/100. SharkRouter scores 91/100. Warden is free, runs in 60 seconds, and is the first tool a CISO uses in our evaluation funnel.

SharkRouter — The Deterministic Data Plane for Agentic AI

The only gateway between AI agent and tool execution.

Every request, every tool call, every response — scanned, classified, governed, and cryptographically audited before it touches your infrastructure. SharkRouter operates at the function-call layer, not the prompt layer. It intercepts the moment an AI agent tries to do something (delete records, transfer funds, access files) and enforces deterministic business rules before execution.

The Seven Pillars of the Platform

1. ToolGuard — The Function-Call Firewall

Deny-by-default policy engine for every tool call. A 7-guard chain — Regex → Keyword → Schema → Policy → Semantic → LLM → MoralCompass — cost-ordered so the first block wins. Sub-150ms added latency. This is the centerpiece product.

2. Agent Passport — Cryptographic Identity

ECDSA-signed identity for every AI agent. A 9-state lifecycle FSM, scoped tool universe, delegation chains with scope narrowing. Trust stages: STRANGER → KNOWN → TRUSTED → EXTENSION.

3. Dry-Run Preview — Impact Before Execution

See affected rows, blast radius, and estimated cost before a destructive tool call executes. 0 of 19 competitors in our benchmark offer this capability. This is what enables CISOs to approve agentic AI for production.

4. Output Assurance — Post-Execution Verification

Verify that the AI agent did what it claimed. Behavioral comparison, contract validation, page walking, API probing. Closes the loop after the tool call.

5. Kill Switch — Instant Revocation

Immediate halt of any agent mid-execution, with cryptographic proof of the kill signal.

6. WORM Audit Chain — Banking-Grade Chain of Custody

SHA-256 hash-chained, Ed25519-signed, immutable audit logs with 7-year retention and 5-sink fan-out. WORM-compliant. Tamper with one entry, break the entire chain.

7. Warden & the Governance Score

Open-source governance scanner. Scores AI environments across 17 measurable dimensions. SharkRouter scores 91/100. Market average across 19 AI frameworks and gateways: 28/100. Run Warden yourself — it takes 60 seconds.

What SharkRouter Is Not

SharkRouter is not a prompt filter (Pangea, Lakera — input-only). Not an out-of-band monitor (Zenity, Protect AI — observes but does not enforce). Not a JIT access tool (Oasis Security — permissions only, no execution audit). SharkRouter is the only product that intercepts, governs, AND audits at the function-call layer with cryptographic proof.

Technical Architecture

A 14-step pipeline runs on every request: ingestion → rate limiting → PII detection (54 entity types across 13 regions) → ToolGuard policy evaluation → Agent Passport verification → semantic routing → cache → LLM processing with provider failover → Output Assurance → PII re-hydration → response validation → audit chain entry (SHA-256 linked) → metrics → delivery.

Infrastructure: FastAPI, PostgreSQL + pgvector, Redis, Docker/Kubernetes. OpenAI-compatible API — change one line (base_url) and your existing code works unchanged.

Deployment Options

Cloud Gateway — Cloud-hosted, deploy in 5 minutes.
Private VPC — Your cloud, our software. 1 day setup.
Air-Gapped On-Premise — Complete isolation for banking, defense, government. 1 week setup with offline licensing.

Compliance

Designed compliant by architecture, not by audit: SOC 2, GDPR, HIPAA, ISO 27001, BOI 364, EU AI Act Article 14 (human oversight of high-risk AI).

Company

SharkRouter was founded by Gilad Gabay, Co-Founder & Chief Architect. Mission: make enterprise AI adoption safe, governed, and auditable.

LinkedIn · GitHub · info@sharkrouter.ai

Explore

WardenOpen-source AI scannerExplore →

Building Universal Memory: Stateful AI at Scale

Back to Blog

Engineering

Building Universal Memory: Stateful AI at Scale

Gilad GabayDecember 28, 20251 min read

How we designed Universal Memory to give your AI persistent context across sessions. PostgreSQL, pgvector, and intelligent memory extraction.

Building Universal Memory: Stateful AI at Scale

LLMs are stateless by design. Every conversation starts fresh. Universal Memory is our solution.

The Architecture

Memory Extraction - LLM extracts memorable facts
Vector Storage - pgvector with HNSW indexing
Context Injection - Relevant memories injected into prompts

7 Memory Types

Instructions - System guidance
Preferences - User choices
Projects - Active work
Skills - Capabilities
Facts - Known info
Relationships - Connections
Corrections - Learnings

Performance

Retrieval Latency: less than 10ms
Storage per User: ~1KB per memory
Max Memories: 1000 per user

#memory#architecture#pgvector

Gilad Gabay

Co-Founder & Chief Architect

LinkedIn Email

We use cookies for analytics to understand how visitors use our site. No advertising cookies. Privacy Policy