LlamaCppEx.Server.Strategy.Balanced (LlamaCppEx v0.8.13)

Balanced batching strategy.

Splits the token budget equally between decode and prefill operations. Decode tokens always use 1 token per slot, so the decode half is capped at the number of generating slots. The prefill half gets the remainder.

Fair under mixed workloads where both generation latency and prefill throughput matter equally.