For Financial Services

Reasoning agents,
purpose-built for the financial stack.

Pokee deploys RL-trained AI agents on your terms — on-prem, on-device, or inside your VPC — so investment, risk, and back-office teams can move at the speed of their data without ever leaving the perimeter.

Request a demo See the engine

0M+

Token context window — single GPU, no chunking required

Private deployment — model weights and data never leave your perimeter

RL/native

Agents trained with reinforcement learning, not prompt-engineered chains

Three capabilities, one engine

Move from chatbot demos to production agents that earn their seat at the desk.

Pokee gives front-, middle-, and back-office teams the same set of primitives — research, reason, and act — wired into the systems where finance actually happens.

Research

Surface auditable answers from filings, transcripts, market data, and proprietary desks — grounded in citations your compliance team will actually accept.

Reason

Agents trained with reinforcement learning — not stitched-together prompts — to plan across long horizons, fall back gracefully, and stay within policy.

Act

Deterministic tool use across your stack — Bloomberg, OMS/EMS, ERPs, ticketing, ledgers — with full audit trails for every API call an agent makes.

Use cases

Workflows that compound, not chatbots that conversate.

Five places Pokee agents are already replacing brittle scripts and outsourced labor inside banks, asset managers, and accounting platforms.

For analysts and PMs who can't afford to wait for the next analyst note.

Real-time due diligence on a target and its competitor set
Source SEC, EDGAR, and EDINET filings with summarized takeaways
Draft memos that cite primary sources, not hallucinated abstracts
Update positions overnight as filings, prints, and macro data land

Replace the offshore queue with agents that read, classify, and reconcile.

Classify journal entries, invoices, and receipts at scale
Reconcile against source ledgers with explainable mappings
Flag anomalies for human review with full document trace
Process multilingual artifacts — JP, KR, ZH, EN — in one pipeline

Synthesize structured signals and unstructured signal noise in one place.

Score counterparties using public filings, news, and proprietary feeds
Detect fraud patterns across transaction memos and ticket narratives
Generate suspicious-activity drafts with cited evidence
Adapt to new patterns via continuous RL fine-tuning, not redeploys

For compliance teams shipping under the ever-shifting weight of new rules.

Monitor regulator websites and rulemaking notices in near-real-time
Map regulatory changes back to internal policies and procedures
Draft initial 10-K/10-Q narrative sections from internal data
Maintain immutable audit trails — every output traceable to source

The data-sensitive support work generic LLMs simply cannot touch.

Equip agents with PII-aware copilots that handle escalations
Auto-summarize each customer relationship before every interaction
Resolve account disputes with policy-grounded responses
Route complex cases with full context — never start from zero

AGENT_TRACE · INVESTMENT_RESEARCH

Pull latest 10-Q from EDGAR; cross-check against transcript Q&A

↓

Compute segment-level deltas vs. consensus; surface contradictions

↓

Draft three-paragraph memo with primary-source citations

↓

Post to research portal · log audit trace · notify PM via Slack

Engine

Inside the Pokee Engine

The reasoning core that lets one GPU hold an entire client relationship.

The Pokee Engine is our proprietary inference architecture — a black box for now — that pushes context windows past ten million tokens on a single GPU. No chunking, no retrieval gymnastics, no lossy summaries. The whole portfolio fits in memory.

Context

0M+tokens

Footprint

0GPU

Latency

Sub-sectool call

Training

RLnative

Read the technical overview

Verified, not vibes

We beat the frontier on the benchmark finance teams actually trust.

FinanceBench is the industry-standard evaluation built on 150+ real 10-Ks, 10-Qs, and earnings transcripts. On the headline Oracle test — the configuration that mirrors a production RAG pipeline — Pokee outscores GPT-5.4 outright.

Benchmark: FinanceBench
Corpus: 150+ filings & transcripts
Comparison: Pokee vs. GPT-5.4
Evaluation date: Q1 2026, internal

Winner

Headline · Oracle accuracy

0.0%vs. GPT-5.4 at 0.0%

On the Oracle test — where the model is given the relevant document and must extract, compute, and reason — Pokee edges out the frontier model by +0.4 points. This is the configuration that maps to a real production RAG pipeline, and it's the one we lead with.

Pokee

0.0%

GPT-5.4

0.0%

Oracle config · n=150 · higher is better

Long-context accuracy, by reasoning category

PokeeGPT-5.4

0.0

Extraction

+3.0 pts

0.0

Numerical

Closing

0.0

Domain

Parity

0.0

Logical

Frontier gap

Source: FinanceBench public corpus, Pokee internal evaluationLong-context config · higher is better

Where Pokee leads outright

+0.0pts on extraction

Extraction is the dominant workload in enterprise RAG — and the one that decides whether your 10-K agent ships or stays in pilot. Pokee 93.9% vs GPT-5.4 90.9%.

Long-context overall

0.0vs 0.0

On the long-context aggregate, GPT-5.4 edges Pokee by 1.7 points — driven by the logical-reasoning subset, which represents under 10% of real finance workloads.

Cost & deployability

1GPU, on-prem

Pokee delivers these scores on a single workstation-class GPU inside your VPC. GPT-5.4 doesn't ship that way — at all.

Built for the regulated stack

Production-ready the day procurement finishes their checklist.

Pokee was designed from day zero to live behind your firewall. We don't ship a SaaS that pretends to be enterprise — we ship infrastructure your security team can actually own.

Privately deployable

Run inside your VPC, on-prem datacenter, or directly on workstation silicon. Weights and data never leave your perimeter.

Regulator-ready

Every agent action is logged, traceable, and replayable. Built-in usage monitoring, output auditing, and policy guardrails.

Fully customizable

Fine-tune on your tickets, memos, and ledgers. RL-based adaptation means policy changes ship without retraining from scratch.

Forward-deployed

Pokee engineers embed with your team for the first deployment. We ship to production — not slides — inside the first quarter.

Get started

Move your AI roadmap from pilot to production.

Connect with our team to scope an agent that fits your stack, your data, and your regulator's appetite for risk.

Request a demo

Align an agent to a real workflow within two weeks

Choose between VPC, on-prem, or on-device deployment

Ship to production, safely and securely, with our forward-deployed team

Move from chatbot demos to production agents that earn their seat at the desk.

Pokee gives front-, middle-, and back-office teams the same set of primitives — research, reason, and act — wired into the systems where finance actually happens.

Research

Surface auditable answers from filings, transcripts, market data, and proprietary desks — grounded in citations your compliance team will actually accept.

Reason

Agents trained with reinforcement learning — not stitched-together prompts — to plan across long horizons, fall back gracefully, and stay within policy.

Act

Deterministic tool use across your stack — Bloomberg, OMS/EMS, ERPs, ticketing, ledgers — with full audit trails for every API call an agent makes.

Workflows that compound, not chatbots that conversate.

Five places Pokee agents are already replacing brittle scripts and outsourced labor inside banks, asset managers, and accounting platforms.

For analysts and PMs who can't afford to wait for the next analyst note.

Real-time due diligence on a target and its competitor set
Source SEC, EDGAR, and EDINET filings with summarized takeaways
Draft memos that cite primary sources, not hallucinated abstracts
Update positions overnight as filings, prints, and macro data land

Replace the offshore queue with agents that read, classify, and reconcile.

Classify journal entries, invoices, and receipts at scale
Reconcile against source ledgers with explainable mappings
Flag anomalies for human review with full document trace
Process multilingual artifacts — JP, KR, ZH, EN — in one pipeline

Synthesize structured signals and unstructured signal noise in one place.

Score counterparties using public filings, news, and proprietary feeds
Detect fraud patterns across transaction memos and ticket narratives
Generate suspicious-activity drafts with cited evidence
Adapt to new patterns via continuous RL fine-tuning, not redeploys

For compliance teams shipping under the ever-shifting weight of new rules.

Monitor regulator websites and rulemaking notices in near-real-time
Map regulatory changes back to internal policies and procedures
Draft initial 10-K/10-Q narrative sections from internal data
Maintain immutable audit trails — every output traceable to source

The data-sensitive support work generic LLMs simply cannot touch.

Equip agents with PII-aware copilots that handle escalations
Auto-summarize each customer relationship before every interaction
Resolve account disputes with policy-grounded responses
Route complex cases with full context — never start from zero

AGENT_TRACE · INVESTMENT_RESEARCH

Pull latest 10-Q from EDGAR; cross-check against transcript Q&A

↓

Compute segment-level deltas vs. consensus; surface contradictions

↓

Draft three-paragraph memo with primary-source citations

↓

Post to research portal · log audit trace · notify PM via Slack

The reasoning core that lets one GPU hold an entire client relationship.

Context

0M+tokens

Footprint

0GPU

Latency

Sub-sectool call

Training

RLnative

Production-ready the day procurement finishes their checklist.

Pokee was designed from day zero to live behind your firewall. We don't ship a SaaS that pretends to be enterprise — we ship infrastructure your security team can actually own.

Privately deployable

Run inside your VPC, on-prem datacenter, or directly on workstation silicon. Weights and data never leave your perimeter.

Regulator-ready

Every agent action is logged, traceable, and replayable. Built-in usage monitoring, output auditing, and policy guardrails.

Fully customizable

Fine-tune on your tickets, memos, and ledgers. RL-based adaptation means policy changes ship without retraining from scratch.

Forward-deployed

Pokee engineers embed with your team for the first deployment. We ship to production — not slides — inside the first quarter.

Reasoning agents,
purpose-built for the financial stack.

Move from chatbot demos to production agents that earn their seat at the desk.

Research

Reason

Act

Workflows that compound, not chatbots that conversate.

Investment research & due diligence

Back-office automation & journal classification

Counterparty risk & fraud monitoring

Regulatory & disclosure intelligence

Operational customer service

The reasoning core that lets one GPU hold an entire client relationship.

We beat the frontier on the benchmark finance teams actually trust.

Long-context accuracy, by reasoning category

Production-ready the day procurement finishes their checklist.

Privately deployable

Regulator-ready

Fully customizable

Forward-deployed

Move your AI roadmap from pilot to production.

Reasoning agents,
purpose-built for the financial stack.

Move from chatbot demos to production agents that earn their seat at the desk.

Research

Reason

Act

Workflows that compound, not chatbots that conversate.

Investment research & due diligence

Back-office automation & journal classification

Counterparty risk & fraud monitoring

Regulatory & disclosure intelligence

Operational customer service

The reasoning core that lets one GPU hold an entire client relationship.

We beat the frontier on the benchmark finance teams actually trust.

Long-context accuracy, by reasoning category

Production-ready the day procurement finishes their checklist.

Privately deployable

Regulator-ready

Fully customizable

Forward-deployed

Move your AI roadmap from pilot to production.

Reasoning agents, purpose-built for the financial stack.

Move from chatbot demos to production agents that earn their seat at the desk.

Research

Reason

Act

Workflows that compound, not chatbots that conversate.

Investment research & due diligence

Back-office automation & journal classification

Counterparty risk & fraud monitoring

Regulatory & disclosure intelligence

Operational customer service

The reasoning core that lets one GPU hold an entire client relationship.

We beat the frontier on the benchmark finance teams actually trust.

Long-context accuracy, by reasoning category

Production-ready the day procurement finishes their checklist.

Privately deployable

Regulator-ready

Fully customizable

Forward-deployed

Move your AI roadmap from pilot to production.

Reasoning agents, purpose-built for the financial stack.

Move from chatbot demos to production agents that earn their seat at the desk.

Research

Reason

Act

Workflows that compound, not chatbots that conversate.

Investment research & due diligence

Back-office automation & journal classification

Counterparty risk & fraud monitoring

Regulatory & disclosure intelligence

Operational customer service

The reasoning core that lets one GPU hold an entire client relationship.

We beat the frontier on the benchmark finance teams actually trust.

Long-context accuracy, by reasoning category

Production-ready the day procurement finishes their checklist.

Privately deployable

Regulator-ready

Fully customizable

Forward-deployed

Move your AI roadmap from pilot to production.

Reasoning agents,
purpose-built for the financial stack.

Reasoning agents,
purpose-built for the financial stack.