Vora IQ
Product story

Building a Foundationally Strong AI Agents Platform: How We Think About Architecture

By Khalel Dumaz

The engineering philosophy behind Vora IQ. Why most AI agent platforms collapse under their own weight, and the principles we use to avoid that fate without giving away how we do it.

  • engineering
  • architecture
  • ai-agents
  • platform

Building a Foundationally Strong AI Agents Platform: How We Think About Architecture

Most AI startups in 2026 are prompt engineering shops with a wrapper. That is fine for a demo. It is fatal for a platform. The companies that survive the next 24 months will be the ones who treated agents as software systems with state, contracts, and failure modes, not as conversations dressed up as products.

This post is about how we think at Vora IQ. It is not a code dump. We do not give away trade secrets here. But the principles below are the difference between a platform that scales and a platform that becomes a support ticket factory.

Principle 1: Determinism boundaries are non-negotiable

Every agent has to know which parts of its behavior are deterministic and which parts are probabilistic. Routing, validation, scoring, security checks, and audit logging are deterministic code. Generation, summarization, and judgment calls are model calls. If you let the model do routing, you have built a slot machine. If you let deterministic code do generation, you have built a Mad Libs generator.

The discipline is drawing the line and enforcing it. We do this with explicit handoff protocols, schema validation on every model output, and structured fallbacks when the model returns something the system cannot parse. The model is a contributor, not the controller.

Principle 2: Context is a layer, not a parameter

If you treat context as something you append to a prompt, you will always be one update behind. We treat context as a live data layer that every agent reads from at invocation and writes to at completion. This is the Business Context Layer. The architecture decision that makes this work is per-user key isolation with AES-256-GCM encryption and a write contract that prevents one agent from corrupting another agent's view of the world.

We are not the first to talk about persistent memory. We are saying the implementation matters more than the marketing. The features built on this layer — viability score, roadmap, documents — are load-bearing infrastructure, not chat messages.

Principle 3: Agents are services, not personalities

When you ship a feature called Brief or Scribe or Echo, it is tempting to think of it as a character. Internally, it cannot be. Each agent is a service with a contract: defined inputs, defined outputs, defined cost ceiling, defined latency budget, defined failure mode. The personality lives in the system prompt and the UI. The reliability lives in the contract.

This is why we can iterate Sarrif from v4.1 to v5.0 without breaking everything downstream. The contract held. The behavior inside it changed.

Principle 4: Multi-tenancy from day one or never

Retrofitting multi-tenancy into an agent platform is a rewrite. We architected for it from the first commit. Every query is scoped, every cache is keyed by tenant, every log line carries a tenant identifier. This is unglamorous work that nobody on Twitter rewards you for. It is also the difference between scaling to 10,000 founders and getting paged at 3 AM because two users saw each other's data.

Principle 5: Stack choices are bets, and we placed ours intentionally

Azure for backend and database. Next.js for web. Langbase for AI orchestration. SwiftUI for iOS. These are not the trendy picks. They are the picks that gave us the best balance of throughput, cost predictability, and team velocity given the people we have and the customers we serve.

Trendy stacks lose founders. Boring stacks ship. Jeff and I have shipped together for over twelve years — from Loot Crate to Amazon to Ring. That history is in our story.

What we will not tell you

The exact schema of the Business Context Layer. The retrieval ordering. The agent routing logic. The cost guardrails. The specific prompts. None of that is in this post and none of it will be. Two patents are filed on the BCL architecture for a reason.

What we will tell you is the philosophy, because the philosophy is what most platforms are missing. If you are building in this space and you have not drawn your determinism boundaries, defined your context layer, and committed to multi-tenancy, you do not have an architecture. You have a prototype with a brand on it.

Build the foundation. Everything else is downstream. For the product story behind the 13 agents, read why I built 13 AI agents.

Meet the team →

← Back to Founders Log