mirror of
https://github.com/alkimake/paperclip.git
synced 2026-06-18 19:50:38 +09:00
fix(evals): address Greptile review feedback
- Make company_boundary test adversarial with cross-company stimulus - Replace fragile not-contains:retry with targeted JS assertion - Replace not-contains:create with not-contains:POST /api/companies - Pin promptfoo to 0.103.3 for reproducible eval runs - Fix npm -> pnpm in README prerequisites - Add trailing newline to system prompt Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Co-Authored-By: Paperclip <noreply@paperclip.ing>
This commit is contained in:
parent
fbb8d10305
commit
a39579dad3
4 changed files with 12 additions and 10 deletions
|
|
@ -9,7 +9,7 @@ See [the evals framework plan](../doc/plans/2026-03-13-agent-evals-framework.md)
|
|||
### Prerequisites
|
||||
|
||||
```bash
|
||||
npm install -g promptfoo
|
||||
pnpm add -g promptfoo
|
||||
```
|
||||
|
||||
You need an API key for at least one provider. Set one of:
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue