Deprecate evaluator cutoff config fields and add CLI cutoff precedence by shuyangli · Pull Request #6580 · tensorzero/tensorzero

shuyangli · 2026-02-25T19:01:50Z

General idea is that cutoffs should not be configured at the evaluator level. They are contextual, so if people use them today for regression testing setups, they should pass it on the CLI.

#6603

Note

Medium Risk
Changes evaluation pass/fail behavior by introducing CLI-driven cutoff thresholds with precedence over config values, which can affect CI/regression outcomes. Deprecation warnings and cutoff resolution errors (e.g., unknown evaluator names) may also surface in existing workflows.

Overview
Evaluation cutoffs are migrated from evaluator config fields to a new CLI flag, adding --cutoffs evaluator=value,... (validated as non-negative) and using it to determine pass/fail exit status.

The evaluation runner now resolves effective cutoffs by merging legacy config cutoffs with CLI cutoffs (CLI wins, with warnings), errors on unknown evaluator names, and logs cutoff failures via tracing before failing the run. Evaluator cutoff fields are explicitly deprecated across configs/tests and the tutorial example removes in-config cutoffs.

^{Written by Cursor Bugbot for commit 4ba600c. This will update automatically on new commits. Configure here.}

docs/evaluations/inference-evaluations/cli-reference.mdx

docs/evaluations/inference-evaluations/configuration-reference.mdx

evaluations/src/evaluators/llm_judge/mod.rs

evaluations/src/evaluators/regex_eval.rs

evaluations/src/lib.rs

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 50d9b9d938

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

evaluations/src/lib.rs

shuyangli · 2026-02-27T15:03:21Z

@BugBot review

evaluations/tests/tests.rs

evaluations/src/cli.rs

evaluations/src/lib.rs

…add config-only deprecation warning, fix backticks in error message

shuyangli · 2026-03-01T22:16:56Z

@BugBot review

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-01T22:21:41Z

evaluations/src/lib.rs

+) -> Result<HashMap<String, f32>> {
+    for evaluator_name in cli_cutoffs.keys() {
+        if !evaluator_configs.contains_key(evaluator_name) {
+            return Err(anyhow!("Unknown evaluator in --cutoff: `{evaluator_name}`"));


Error message references wrong CLI flag name --cutoff

Medium Severity

The error message says --cutoff (singular) but the actual CLI flag is --cutoffs (plural). A user seeing this error would try --cutoff which doesn't exist. The doc comment on line 675 has the same mismatch. Notably, the warning messages on lines 704 and 712 correctly reference --cutoffs.

Additional Locations (1)

evaluations/src/lib.rs#L674-L675

github-actions bot added the stacked-pr-blocked-on-base-pr label Feb 25, 2026

shuyangli force-pushed the sl/regex-evaluator branch from 43777d9 to f287b57 Compare February 25, 2026 20:32

github-actions bot added the has-merge-conflicts label Feb 25, 2026

shuyangli force-pushed the sl/deprecate-cutoff-config branch from b40e5e6 to 1aff483 Compare February 25, 2026 20:48

shuyangli force-pushed the sl/regex-evaluator branch from f287b57 to 298c7da Compare February 25, 2026 21:11

shuyangli force-pushed the sl/deprecate-cutoff-config branch from 1aff483 to 6db61cb Compare February 25, 2026 21:11

github-actions bot removed the has-merge-conflicts label Feb 25, 2026

shuyangli force-pushed the sl/deprecate-cutoff-config branch from 6db61cb to f0dfab1 Compare February 25, 2026 22:25

github-actions bot added the has-merge-conflicts label Feb 25, 2026

shuyangli force-pushed the sl/regex-evaluator branch from 298c7da to 507f480 Compare February 25, 2026 22:28

shuyangli force-pushed the sl/deprecate-cutoff-config branch from f0dfab1 to df16908 Compare February 25, 2026 22:29

github-actions bot removed the has-merge-conflicts label Feb 25, 2026

shuyangli force-pushed the sl/deprecate-cutoff-config branch from df16908 to 18a6444 Compare February 26, 2026 15:04

shuyangli force-pushed the sl/regex-evaluator branch from 507f480 to 78f3349 Compare February 26, 2026 15:04

shuyangli force-pushed the sl/deprecate-cutoff-config branch from 18a6444 to 5925ae2 Compare February 26, 2026 22:30

shuyangli force-pushed the sl/regex-evaluator branch from 423de65 to db0471b Compare February 26, 2026 22:30

Base automatically changed from sl/regex-evaluator to main February 26, 2026 23:01

github-actions bot added has-merge-conflicts and removed stacked-pr-blocked-on-base-pr labels Feb 26, 2026

shuyangli force-pushed the sl/deprecate-cutoff-config branch from 5925ae2 to 50d9b9d Compare February 27, 2026 01:23

github-actions bot removed the has-merge-conflicts label Feb 27, 2026

shuyangli marked this pull request as ready for review February 27, 2026 01:38

shuyangli requested a review from GabrielBianconi as a code owner February 27, 2026 01:38

shuyangli commented Feb 27, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Feb 27, 2026

View reviewed changes

evaluations/src/lib.rs Outdated Show resolved Hide resolved

shuyangli force-pushed the sl/deprecate-cutoff-config branch 3 times, most recently from 5f28647 to 6b126f8 Compare February 27, 2026 15:02

shuyangli assigned GabrielBianconi Feb 27, 2026

shuyangli assigned virajmehta Feb 27, 2026

shuyangli force-pushed the sl/deprecate-cutoff-config branch from 6b126f8 to a622918 Compare February 27, 2026 15:04

cursor bot reviewed Feb 27, 2026

View reviewed changes

evaluations/tests/tests.rs Outdated Show resolved Hide resolved

evaluations/src/cli.rs Outdated Show resolved Hide resolved

Deprecate evaluator cutoff config fields and add CLI cutoff precedence

daafa66

shuyangli force-pushed the sl/deprecate-cutoff-config branch from a622918 to daafa66 Compare February 27, 2026 16:10

GabrielBianconi requested changes Mar 1, 2026

View reviewed changes

evaluations/src/lib.rs Outdated Show resolved Hide resolved

evaluations/src/lib.rs Outdated Show resolved Hide resolved

GabrielBianconi assigned shuyangli and unassigned GabrielBianconi and virajmehta Mar 1, 2026

Address PR review comments: fix cutoff checks for non-pretty output, …

4ba600c

…add config-only deprecation warning, fix backticks in error message

shuyangli assigned GabrielBianconi and unassigned shuyangli Mar 1, 2026

shuyangli requested a review from GabrielBianconi March 1, 2026 22:17

cursor bot reviewed Mar 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate evaluator cutoff config fields and add CLI cutoff precedence#6580

Deprecate evaluator cutoff config fields and add CLI cutoff precedence#6580
shuyangli wants to merge 2 commits intomainfrom
sl/deprecate-cutoff-config

shuyangli commented Feb 25, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

shuyangli commented Feb 27, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shuyangli commented Mar 1, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shuyangli commented Feb 25, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

shuyangli commented Feb 27, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shuyangli commented Mar 1, 2026

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 1, 2026

Choose a reason for hiding this comment

Error message references wrong CLI flag name --cutoff

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shuyangli commented Feb 25, 2026 •

edited by cursor bot

Loading

Error message references wrong CLI flag name `--cutoff`