Add evaluation test scenarios and remove orphaned azure-ai-evaluation-py tests by imatiach-msft · Pull Request #106 · microsoft/skills

imatiach-msft · 2026-02-06T22:50:07Z

Summary

Adds comprehensive evaluation test scenarios to azure-ai-projects-py and removes the orphaned azure-ai-evaluation-py test folder.

Changes

Added to `tests/scenarios/azure-ai-projects-py/scenarios.yaml`

6 new evaluation scenarios (+514 lines):

Scenario	Description
`evaluation_with_inline_data`	Complete eval workflow: inline JSONL data, create eval, run, poll, retrieve results
`agent_evaluation_with_sample`	Agent evaluation with `{{sample.output_text}}` and `{{sample.output_items}}` mapping
`custom_code_evaluator`	`CodeBasedEvaluatorDefinition` with `project_client.evaluators.create_version()`
`safety_evaluators`	`builtin.violence`, `builtin.sexual`, `builtin.hate_unfairness`
`openai_graders`	`label_model`, `string_check`, `text_similarity` grader types
`list_builtin_evaluators`	Evaluator discovery via `project_client.evaluators.list_latest_versions()`

Deleted

tests/scenarios/azure-ai-evaluation-py/ - orphaned after skill was merged into azure-ai-projects-py in PR fix: Skills grid cropping and documentation cards consistency #93

Test Results

All 18 scenarios pass with 100% score:

Scenarios: 18 Passed: 18 Failed: 0 Pass Rate: 100.0% Average Score: 100.0

Notes

The new scenarios ensure the deprecated azure-ai-evaluation SDK patterns are flagged as forbidden:

from azure.ai.evaluation import → forbidden
ViolenceEvaluator, @evaluator decorator → forbidden
Class-based graders like AzureOpenAILabelGrader → forbidden

…aned azure-ai-evaluation-py tests

Add evaluation test scenarios to azure-ai-projects-py and remove orph…

e814719

…aned azure-ai-evaluation-py tests

imatiach-msft force-pushed the cleanup-azure-ai-evaluation-tests branch from a522b59 to e814719 Compare February 6, 2026 22:51

thegovind merged commit 843898b into microsoft:main Feb 7, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add evaluation test scenarios and remove orphaned azure-ai-evaluation-py tests#106

Add evaluation test scenarios and remove orphaned azure-ai-evaluation-py tests#106
thegovind merged 1 commit intomicrosoft:mainfrom
imatiach-msft:cleanup-azure-ai-evaluation-tests

imatiach-msft commented Feb 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

imatiach-msft commented Feb 6, 2026

Summary

Changes

Added to tests/scenarios/azure-ai-projects-py/scenarios.yaml

Deleted

Test Results

Notes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added to `tests/scenarios/azure-ai-projects-py/scenarios.yaml`