Process-Level Evaluation (PRMs)

Evaluate agent behavior step-by-step, not just final outcomes.