This repository contains an evaluation framework for testing AI assistant personas on human-friendly behavior using the AISI Inspect framework. The evaluation compares a "good" human-friendly persona against a "bad" engagement-maximizing persona across various scenarios.
- Python 3.8 or higher
- VSCode (recommended), or derivitave like Cursor, Windsurf
- API keys for target models