Customer Service Agent Evaluation

This directory contains the customer service agent and its evaluation tests.

Running Evaluations

There are two ways to run evaluations:

1. Recommended: Programmatic Evaluation (Pytest)

This method is more robust and avoids naming conflicts with the ADK CLI.

From the project root directory (adk_eval/):

PYTHONPATH=. uv run pytest customer_service_agent/test_agent_eval.py

From this directory (customer_service_agent/):

PYTHONPATH=.. uv run pytest test_agent_eval.py

2. ADK CLI Evaluation

From the project root directory (adk_eval/):

uv run adk eval customer_service_agent customer_service_agent/eval.test.json --config_file_path customer_service_agent/test_config.json

From this directory (customer_service_agent/): Note: This may encounter naming conflicts if the file is named agent.py. It is recommended to use the Pytest method above.

uv run adk eval . eval.test.json --config_file_path test_config.json

3. Golden Dataset Evaluation

You can also run evaluation using the golden dataset generated by the ADK web UI.

CLI Method:

uv run adk eval customer_service_agent customer_service_agent/evalset780045.evalset.json --config_file_path customer_service_agent/test_config.json

Pytest Method:

PYTHONPATH=. uv run pytest customer_service_agent/test_golden_eval.py

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
customer_service_agent		customer_service_agent
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
billing-enablement.py		billing-enablement.py
init.sh		init.sh
main.py		main.py
pyproject.toml		pyproject.toml
set_env.sh		set_env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Service Agent Evaluation

Running Evaluations

1. Recommended: Programmatic Evaluation (Pytest)

2. ADK CLI Evaluation

3. Golden Dataset Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Customer Service Agent Evaluation

Running Evaluations

1. Recommended: Programmatic Evaluation (Pytest)

2. ADK CLI Evaluation

3. Golden Dataset Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages