Tag: model evaluation

Petri: Revolutionizing AI Safety Audits with Automated Agent-Based Testing

Anthropic introduces Petri, an open-source framework leveraging AI agents for automated auditing of AI models. This tool streamlines the testing of complex behaviors, accelerating AI safety research and enabling broader community participation in model evaluation.

1
0
Read More