05Platform / RL Red-Team Engine

A learning attacker that hardens a deterministic defender.

The only machine learning in Phorvex is an adversarial reinforcement-learning red-team that stress-tests the decision core around the clock. It is strictly walled off from decisions and human-gated.

/01

Continuous co-training

The red-team trains against the live defender, probing for structural weaknesses a human team would take months to find.

/02

Walled off from decisions

Learning never touches the decision path. The defender stays deterministic, reproducible, and explainable. The red-team only attacks it.

/03

Human-gated

Findings become fixes only through human review. Each weakness is closed as an auditable change and re-verified to neutralization.

02Track record

It has already earned its keep.

The red-team has found two real structural weaknesses in the decision core, and we fixed both. Each was closed as an auditable, regression-gated change and re-verified to neutralization.

That is the loop working as designed. A learning attacker makes the deterministic defender stronger, without ever becoming part of it.

Why no ML in the defender?

A learned defender can be probed, drifted, and fooled, and it cannot explain itself in a postmortem. We put the learning where it belongs: on the attacking side, where its only job is to break our assumptions before a real adversary does.

Deterministic core. The defender remains a single deterministic control loop. No LLM, no model, no surprises in the decision path.

Ask us what the red-team found.

The two structural findings, and how each was closed, are covered in the technical briefing.