Skip to content

Improvement/238 tests for evaluation metrics#242

Merged
dkkdark merged 49 commits into
mainfrom
improvement/238-tests-for-evaluation-metrics
Apr 21, 2026
Merged

Improvement/238 tests for evaluation metrics#242
dkkdark merged 49 commits into
mainfrom
improvement/238-tests-for-evaluation-metrics

Conversation

@dkkdark

@dkkdark dkkdark commented Mar 24, 2026

Copy link
Copy Markdown
Collaborator

No description provided.

@dkkdark dkkdark self-assigned this Mar 24, 2026
@dkkdark dkkdark requested a review from NoB0 March 24, 2026 11:01
@github-actions

Copy link
Copy Markdown
Current Branch Main Branch
Coverage Badge Coverage Badge

Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/evaluation/test_satisfaction_metric.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_satisfaction_metric.py Outdated
@dkkdark dkkdark requested a review from NoB0 March 24, 2026 16:39

@NoB0 NoB0 left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

General comment on format. In the codebase we typically use types from the typing package. For docstrings the verb is conjugated. I commented on a few of places to change (please check for others that I may have skipped).

Left some comments with clarification questions.

Comment thread tests/conftest.py Outdated
Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/conftest.py Outdated
Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
Comment thread tests/evaluation/test_utility_metrics.py Outdated
@dkkdark dkkdark requested a review from NoB0 March 31, 2026 08:55
Comment thread tests/evaluation/test_quality_metric.py Outdated
Comment thread tests/evaluation/test_success_rate_metric.py Outdated
Comment thread tests/evaluation/test_successful_recommendation_round_ratio_metric.py Outdated
@dkkdark dkkdark requested a review from NoB0 April 14, 2026 13:19
Comment thread tests/evaluation/test_success_rate_metric.py
Comment thread usersimcrs/evaluation/dialogue_annotation.py Outdated
@dkkdark dkkdark requested a review from NoB0 April 21, 2026 09:24

@NoB0 NoB0 left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM (just one nit to address)

Comment thread tests/evaluation/test_success_rate_metric.py Outdated
@dkkdark dkkdark merged commit 1e7d339 into main Apr 21, 2026
4 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants