Skip to content
LavX Managed Systems
HomePlatformSolutions
Case studiesResearchTrust Center
AboutWorkshopContact
EN | HUGet in touch

We use cookies to keep this site working, remember your language, and (if you opt in) measure how it's used. You decide.

LavX Managed Systems

EU-resident AI engineering for European business. RAG, LLMOps, agents, custom software. Model-agnostic, no vendor lock-in.

Budapest · EU · EU-resident AI engineering

Product

PlatformSolutions

Proof

Case studiesResearchTrust Center

Company

AboutWorkshopContact
© 2026 LavX Managed Systems · Budapest · EU
Privacy policyImprintEN | HU
PlatformEU-resident data handling · Auditable
The platform

An auditable AI layerover your systems,with the right model for the task.

Retrieval, a model gateway, guardrails, evaluation and an audit log, in one system. Fitted to your existing systems, with data residency tailored to your needs, up to full on-prem.

Architecture ↓The full system →Workflows →
Live routingEU
requestreceived
classificationAI agent
modelauto · best-fit
answerwith sources
approvalhuman
logwritten ✓
Many workflows, one gatewayProvider-independent modelsHuman in the loopFrom BPMN to a runnable process
Architecture

How the platform works

From the question to the answer, every step is traceable. Data stays inside the EU as needed, and every answer names its source.

01

Source

Your documents, systems and APIs, together with their permissions.

02

Retrieval

The relevant passages surfaced, always with a citation.

03

Model gateway

The right model for the task, across providers, with no lock-in.

04

Guardrails

Typed tools and strict stopping conditions at every step.

05

Evaluation

Caching and an automatic check on every answer.

06

Audit log

Every step traceable and exportable.

07

Data residency

Data and inference inside the EU as needed, up to on-prem.

Orchestration

Workflows, connected

Many workflows, one shared gateway. Each gets the right model for the task, the human is there at the decision points, and your systems work together end to end.

SystemAI agentHumanModel / gateway
1 · CUSTOMER REQUEST2 · DOCUMENTMODELSOUTPUTIncoming requestSystem · CRM, APIClassificationAI agent, with toolsDocumentSystem · uploadExtractionAI agentModel gatewaythe right modelfor the task, no lock-inFrontier model AFrontier model BLocal modelthe right provider for the taskApprovalHuman in the loopActionSystem actionAudit logExport, trail
Security, riskFrontier model A
Code quality, reviewFrontier model B
Fast, high-volume workLocal model
The right provider for the task, not tied to a single model. For high-volume, frequent work we train a locally running model: we fine-tune and distill on your data with our own evaluation harness, so it is cheaper and faster, and the data never leaves your environment.
A

System integration

We connect your existing systems: ERP, CRM, ticketing, custom databases and APIs. One process instead of islands.

B

AI in the workflow

We fit the AI step into your existing process, with an explicit human approval point where it is needed.

C

BPMN, into a runnable process

We translate your existing BPMN flowcharts into a runnable AI workflow, with guardrails and a human control point.

The full system

An institutional digital asset desk: trading, clearing, custody and treasury in one system, with AI where it truly matters

The client trades, and the platform executes across many venues and chains, holds in custody, nets, then settles on both bank and on-chain rails at once. Behind it run a real-time ledger, risk, KYC/AML and treasury, and the AI tasks are routed by a model gateway always to the right model for the task. The AI does not suggest, it acts within guardrails, and at the points where money moves, human approval is there.

System topology

EXTERNAL SYSTEMSCENTRAL PLATFORMOUTPUTSG1 Trading venuesCentralized exchanges (FIX 4.4 / REST / WS)OTC and RFQ desksDEX aggregator (RPC)G2 Financial railsSWIFT / SEPAStablecoin on/off-rampG3 ChainsMultiple L1 and L2 chainsRPC nodes, indexersG4 CustodyMPC custody, cold storageHot walletsG5 Data providersKYC / identity, sanctions, PEPMarket dataChain analyticsG6 Enterprise systemsCRM, ERP / ledgerTicketing, BPMN engineData warehouseGATEWAY BUSC1Gateway & normalizationFIX / REST / WSwebhook / KafkaC2Execution & SORsmart order routingmultiple venuesC3Ledger, positionreal-time limitmargin, exposureC4Clearing & nettingnovationT+0 / T+1C5Settlement orch.fiat rail + on-chaincross-chain DvPC6Custody orchestrationMPC policy enginewithdrawal approvalC7Client & KYC/AMLonboarding, portalinstitutional APIC8Treasury & liquidityreallocationcollateral managementClient portal +institutional APISettlement outboundfiat + on-chainCompliance &regulatory reportAudit log /trail12345678AIAIordermarket datasettlementon-chainAI inferenceAI touchpoint + human approval
Figure 1: system topology. On the left the six external system groups, in the center the central process from C1 to C8, on the right the four outputs. The numbered gold diamonds mark the eight AI touchpoints on their respective service.

What we integrate

A platform like this is worth as much as it can connect into your existing financial and chain infrastructure, so the gateway layer is built to accept heterogeneous venues and protocols uniformly, normalized. On the trading side, centralized exchanges connect over FIX 4.4 sessions, REST and WebSocket channels, alongside OTC and RFQ desks, plus DEX aggregators over chain RPC. Money moves on two rails: the fiat side over SWIFT and SEPA messages, the digital side over stablecoin on- and off-ramps. The chain connection covers multiple L1 and L2 networks, over our own RPC nodes and indexers, so that finality and balance are not seen from a single external source. Custody is implemented with MPC threshold signing, cold storage and hot wallets, with withdrawal approval driven by a policy engine. The input for risk and compliance comes from external data providers: KYC and identity checks, sanctions and PEP lists, market data and chain analytics. Finally, enterprise systems (CRM, ERP and ledger, ticketing, BPMN process engine, data warehouse) connect event-based, over Kafka streams, so every transaction stays accounting- and audit-ready. The integrations are brand-independent: we work with categories and standard protocols, we do not tie the system to specific providers.

  • FIX 4.4
  • REST / WebSocket
  • OTC / RFQ
  • DEX aggregator (RPC)
  • SWIFT / SEPA
  • Stablecoin on/off-ramp
  • L1 / L2 RPC + indexer
  • MPC custody
  • Cold / hot wallet
  • KYC / identity
  • Sanctions / PEP
  • Chain analytics
  • Market data
  • CRM / ERP / ledger
  • Ticketing
  • BPMN engine
  • Data warehouse
  • Kafka stream
Model routing

One gateway, every task to its own model

Every AI task enters through a single gateway, and the router decides where it goes along four axes: capability, cost, latency and data residency. We do not tie to one model, we always pick the one that fits the task.

The router first classifies every call, then routes it to one of four layers: high-volume, sensitive work runs locally, the rare case that needs judgment goes to a frontier model, code review to a specialized model, document processing to a vision model. A verify layer checks the output: if it is weak or uncertain, the fallback automatically goes to a higher tier. Where money or risk moves, a human approval gate stands in front of the call. For frequent tasks we run a distillation loop, training a locally running model from the frontier model's samples, measured by our own evaluation harness to confirm the switch holds the quality. Every call goes into the audit log: which model, on what input, what the decision was. This makes the system cheaper and faster, the sensitive data never leaves your environment, and it is not tied to any single vendor.

INPUT & ROUTINGMODEL LAYERSVERIFICATION & TRAILIncoming AI task8 touchpoints,one entry pointClassifier& routerCAPABILITYCOSTLATENCYADAT­REZIDENCIAR1Local / fine-tunedon-prem GPU, high-volume + sensitive,cheapest, data stays inR2Frontier reasoningEU region or global, judgment,rarely called, with a human gateR3Specialized codeengine review on every PR,a real AI review routineR4Visiondocument / KYC,vision + reasoningdistillationVERIFYSelf-checkeval-harness + fallback to another tierHUMAN GATEApprovalat the money & risk pointsAUDITAudit logevery call traceableAI inferencehuman gateaudit loghuman approval

Figure 2: the model routing fabric. The incoming task goes into a classifier and router decision node, from there it branches into one of the four model layers (R1 to R4), then verify, a human approval gate and the audit log follow. The distillation loop trains a locally running model from the frontier layer.

  • R1

    Local / fine-tuned

    A model running on an on-prem GPU, fine-tuned on your data for high-volume and sensitive work. The cheapest and fastest layer, the data stays in.

  • R2

    Frontier reasoning

    For rare cases that need judgment. In the EU region or globally, always through a human gate, at the money and risk points.

  • R3

    Specialized code

    A model tuned for code review: it takes every engine change under review and catches the real bugs before release.

  • R4

    Vision

    Document and KYC processing: reading scanned documents, IDs and forms into structured data.

Where AI helps

Eight points where AI really works

AI is not a good idea everywhere. We only bring it in where it takes measurable work off your plate or catches risk, and every step that touches money movement or a regulatory decision is approved by a human. Each task runs on the model layer that fits it.

  • T1Local specialized (chain analytics)

    On-chain AML and sanctioned-address screening

    C5 Settlement orchestration / C6 Custody orchestration

    It screens every deposit and withdrawal before the transaction is finalized: against sanctions lists, known mixers and risky address clusters. On a hit, the withdrawal automatically stops and waits for human review.

    Why: High-volume, repetitive screening where a local model tuned for chain analytics runs cheaply and fast, and the sensitive address data stays in.

  • T2Frontier vision and reasoning, human in the loop

    KYC document and PEP adjudication

    C7 Client (onboarding, KYC/AML)

    It reads and cross-checks the onboarding documents, flags the contradictions and PEP or sanctions matches, then prepares a decision proposal. The final approval is always the compliance officer's.

    Why: It needs document understanding and judgment at once, so the frontier layer with visual and reasoning ability, but the decision stays with a human.

  • T3Local fine-tuned, human approves

    Settlement and reconciliation discrepancy detection

    C4 Clearing and netting / C5 Settlement orchestration

    It compares expected and actual settlements, surfaces the unmatched item and discrepancy, then proposes a concrete resolution step. The proposal is approved by the clearing team before execution.

    Why: A well-contextualized, repetitive reconciliation pattern that a fine-tuned local model handles accurately and cheaply.

  • T4Pattern model and frontier case write-up

    Trade surveillance across venues

    C2 Execution / C3 Position and risk

    From the order flow arriving from multiple venues, it flags suspicious patterns, for example wash trading or spoofing. The frontier layer takes the suspicious cases and prepares them with a reasoned write-up for supervisory review.

    Why: The high-volume screening goes to a fast pattern model, the case write-up to the rarely, more expensively called frontier layer: so cost follows the signal.

  • T5Frontier reasoning, a human approves every step

    Treasury and liquidity copilot

    C8 Treasury and liquidity

    It sees the positions and the liquidity situation, proposes reallocation between rails and wallets, and justifies the move. Execution only happens after human approval.

    Why: It needs judgment and seeing the connections, with rare calls, so frontier reasoning, but with a strict human gate because money moves.

  • T6Frontier code (a real AI review routine)

    Engine change-risk review on every PR

    Core (platform engine)

    It runs on every change request, reviews the change for risk and regression, and gives concrete comments to the developer. The merge decision is made by the code owner.

    Why: Our live AI code review routine caught real security bugs in open-source projects, which needs the frontier layer tuned for code.

  • T7Mid-tier and RAG

    Support and client communication triage

    C7 Client (portal and API)

    It understands and categorizes incoming requests, assembles a draft answer from the internal knowledge base, and escalates the urgent case. The answer is always sent by a human.

    Why: Our internal RAG assistant saves about 100 work hours a week across 24 users, and for triage the cost-effective mid-tier and source-bound RAG are enough.

  • T8Agentic frontier, a human executes

    Operations incident copilot

    Ops (observability)

    From the observability data it assembles the picture of the incident, correlates the signals and proposes a concrete intervention plan step by step. The intervention is carried out by the on-call human.

    Why: It calls for multi-step, tool-using reasoning, so an agentic frontier layer, but the actual action is triggered by a human.

The clearing core

The ledger, the risk and the settlement: this is what we build and operate

This core grew out of the daily operation of a European FX clearing house. Real-time ledger and position, limit and margin control at the moment of the order, novation netting under T+0/T+1, then settlement on bank and on-chain rails, with treasury and liquidity management in the background. Not a demo: auditable infrastructure operated at 99.9%+ availability, every item of which can be traced back to its source.

The clearing core ties together four services. Ledger, position and risk (C3) runs the limits, the margin and the exposure in real time on every order, so risk does not come to light after the fact but at the moment of the order. Clearing and netting (C4) steps between the trades with novation, and nets under T+0/T+1, so that the number of items to settle and the counterparty risk both drop. Settlement orchestration (C5) closes the same item either on the fiat rail or on-chain, with atomic DvP in the cross-chain case, so that the two legs settle together or neither does. Treasury and liquidity (C8) makes sure the collateral is where and when settlement needs it. At the money and risk points a human approves, and every step goes into the audit log.

  • real-time ledger
  • limit & margin
  • exposure in real time
  • novation
  • netting T+0/T+1
  • atomic DvP
  • fiat & on-chain settlement
  • treasury & liquidity
  • human in the loop
  • audit log
  • 99.9%+ availability
Web3 & custody

Many chains, one clearing logic

The digital asset part is still a reference design: this is what we can design and operate, not delivered work. The core (ledger, netting, settlement, risk) is grounded by operating a European FX clearing house, the chain side we draw up with precise cryptographic and clearing concepts.

Settlement runs on both rails: bank transfer (fiat) and on-chain finality, between them cross-chain atomic DvP, so the asset and the consideration either move together or neither does. Custody orchestration is built on MPC threshold signing: a policy engine gates the withdrawal (thresholds, address whitelist, time window, human approval at the money points), and a rule decides what goes where between cold and hot storage. On every deposit and withdrawal the T1 on-chain AML and sanctioned-address screening runs, with a local, specialized chain analytics model: the sensitive data stays in, and we call the frontier models only at the judgment points, with a human gate. Compliance is thus not an after-the-fact report but a programmable rule wired into the withdrawal path. Novation, T+0/T+1 netting and real-time limit and margin calculation are the same engine we already operate in the FX world, only extended toward more venues and more chains.

  • MPC threshold signing
  • cross-chain atomic DvP
  • on-chain finality
  • on-chain attestation
  • policy engine for withdrawals
  • programmable compliance
  • sanctioned-address screening
  • HD wallets (cold/hot)
  • multiple L1 and L2 chains
  • two rails: fiat + on-chain

Reference design: the web3, custody and on-chain layer is built on precise cryptographic concepts, but there is no public crypto case study behind it. We name no specific exchange, chain, custodian, data provider or model vendor, the concreteness comes from the protocols and the routing logic.

This is a reference system that we can design and operate. Clearing, netting, settlement, ledger and risk are built on real foundations: on our experience gained operating a European FX clearing house at 99.9%+ availability. The part covering digital assets, the multiple chains, MPC custody and on-chain settlement is a planned system, not a delivered project: the concepts (MPC threshold signing, HD wallets, atomic DvP, on-chain finality) are precise, but we do not present web3 work here. Two AI touchpoints rest on our own, real work: an internal RAG assistant that saves about 100 work hours a week for 24 users, and the AI code review that found real security bugs in open-source projects.

Process

From the pilot to live operation

01

Mapping

We look at your data, your systems and the concrete use case.

02

Prototype

A working demo on your data, with a measurable output, in one to two weeks.

03

Go-live

A system that passes the audit, with EU hosting and citations.

04

Operation

Monitoring, evaluation and continuous improvement in live operation.

Let's talk

Let's connect your systems.

Show us your process, and we will see where AI fits into it. On the call you speak with an engineer, not a salesperson.

Request a callbackSee the process →