Moving-target defense, extended onto the GPU.
AI inference created a whole new class of persistable attacker state: models, KV-caches, LoRA adapters, GPU partitions, inference routes. Phorvex is the first MTD brain that can see and evict all of them.
Attacker state that survives a pod restart.
Model theft through side-channels. Prompt-injection persistence that outlives a restart. Poisoned fine-tuned adapters. GPU side-channels and co-residency. Pod-level tools address none of it.
Cache and session eviction
Flush a poisoned prefix or KV cache in place, without a restart. The foothold dies. The service does not.
Adapter rotation
Evict a poisoned LoRA adapter while preserving the clean base model, reloading only from a trusted, pinned source.
Request draining and guardrail rotation
Evict in-flight footholds and rotate the safety tier underneath an attacker mid-session.
MIG re-binding and warm model swap
Re-bind GPU partitions and warm-swap models. Change the GPU ground under the attacker.
A completeness invariant
A proof obligation that every class of persistable serving-stack state has an evictor. No orphaned attacker state, by construction.
An AI-aware kill-chain
Models the attacker stages unique to AI serving and times moves to preempt the next one.
All mechanisms built behind mockable interfaces and CI-tested. GPU efficacy pending.
Implemented. Tested. Not yet claimed.
Every mechanism above is implemented and tested with mock serving stacks. The magnitudes, meaning how much each move actually disrupts a real attacker on real GPUs, are validated on hardware with DCGM telemetry as part of our roadmap. They are not claimed until measured.
Why partner on validation?
Design partners get the first measured GPU results on their own serving stack, direct influence on the validation matrix, and the same honesty guarantee. If a number is not measured on your hardware class, we will not claim it.
The same brain. A new battleground.
One decision core defends both your clusters and your inference fleet. We will prove the second the same way we proved the first.