PolicyGuardBench Introduces Guardrails for Web Agent Policy Compliance
PolicyGuardBench releases a dataset of about 60,000 labeled web-agent trajectories and a lightweight guardrail model, PolicyGuard-4B, that detects policy violations with fast inference. getnews.me/policyguardbench-introdu... #policyguard #webagents
0
0
0
0