Improvement/234 create main evaluation script by dkkdark · Pull Request #243 · iai-group/UserSimCRS

dkkdark · 2026-03-24T15:00:31Z

No description provided.

…tps://github.com/iai-group/UserSimCRS into improvement/233-create-classes-for-metrics

…mprovement/233-create-classes-for-metrics

…://github.com/iai-group/UserSimCRS into improvement/234-create-main-evaluation-script

…github.com/iai-group/UserSimCRS into improvement/234-create-main-evaluation-script

github-actions · 2026-03-24T15:02:45Z

Current Branch	Main Branch

dkkdark · 2026-04-14T15:08:18Z

+    annotate_dialogues(dialogues, user_nlu, agent_nlu)
+
+
+def get_summary_by_agent(


We can’t skip calling it when there is only one agent because we won’t get a summary for it

NoB0

Some points need to be clarified

NoB0

For now LGTM (please check comments on confuse key access). Although, a follow-up should be created to investigate if build_metric_registry and evaluate_metric can be optimized with regards to maintenance when new metrics are supported.

NoB0 · 2026-05-04T16:20:25Z

+        )
+
+    if config["annotate_dialogues"].get():
+        if not config["user_nlu"].get(None):


The get in confuse behaves bit differently than the get for a dictionary, i.e., it expects a template. I would suggest to look at the documentation, but I think that simply checking if the key is in the configuration and is not empty should work.

NoB0 · 2026-05-04T16:20:39Z

+            raise ValueError(
+                "`user_nlu` is required when `annotate_dialogues` is True."
+            )
+        if not config["agent_nlu"].get(None):


Same as above.

Ksenia Blokhina added 30 commits February 17, 2026 15:13

create base metric class

ceee927

return pre commit config versions

46b7456

#233 add new classes

8c96457

#232 fix class

950964c

#232 fix class

33624b2

Merge branch 'improvement/232-create-abstract-class-for-metric' of ht…

1053b57

…tps://github.com/iai-group/UserSimCRS into improvement/233-create-classes-for-metrics

#232 add aggregation

27f7888

#232 fix methods

c38331a

#232 fix nits

901dd5a

improvement/233-create-classes-for-metrics add classes

c4efd79

improvement/232-create-abstract-class-for-metric fixes

800f8c0

improvement/232-create-abstract-class-for-metric fixes

2014e22

improvement/232-create-abstract-class-for-metric remove agent evaluation

363551c

Merge branch 'improvement/232-create-abstract-class-for-metric' of ht…

9ab36e2

…tps://github.com/iai-group/UserSimCRS into improvement/233-create-classes-for-metrics

improvement/233-create-classes-for-metrics edit classes

a253f74

improvement/233-create-classes-for-metrics remove old files

777e00e

improvement/233-create-classes-for-metrics remove extra files

5a55b43

improvement/233-create-classes-for-metrics remove changes

c5bb100

improvement/233-create-classes-for-metrics remove changes

a1fac9f

move file

4ed5727

move file

e6f0ef3

refactoring

431ae40

add script

6f74153

fixes

773822d

fixes

5e1b3a6

made fixes

558b588

new structure

d78bda3

remove class from other pr

9146421

changes after new structure

fe8ea9c

remove main

d749156

Ksenia Blokhina added 13 commits March 10, 2026 17:10

changes

8f333e7

changes

6e6e8b2

changes

1c802ff

simplyfying

8335005

remove utility class

f006930

fix get_recommendation_rounds

e0f3c56

fixes

4157fdc

resolve issues

3b068b4

Merge branch 'main' of https://github.com/iai-group/UserSimCRS into i…

d7efc12

…mprovement/233-create-classes-for-metrics

fixes

8944621

Merge branch 'improvement/234-create-main-evaluation-script' of https…

335d3bd

…://github.com/iai-group/UserSimCRS into improvement/234-create-main-evaluation-script

Merge branch 'improvement/233-create-classes-for-metrics' of https://…

98564af

…github.com/iai-group/UserSimCRS into improvement/234-create-main-evaluation-script

234-create-main-evaluation-script add eval script

8be81e4

dkkdark requested a review from NoB0 March 24, 2026 15:17

Merge branch 'main' into improvement/234-create-main-evaluation-script

a637a40

NoB0 reviewed Apr 13, 2026

View reviewed changes

fixes

b3cd18f

dkkdark commented Apr 14, 2026

View reviewed changes

Comment thread usersimcrs/run_evaluation.py Outdated

dkkdark requested a review from NoB0 April 14, 2026 15:17

NoB0 reviewed Apr 21, 2026

View reviewed changes

NoB0 mentioned this pull request Apr 21, 2026

246 update evaluation docs #247

Open

fix evaluation

938ccac

dkkdark requested a review from NoB0 April 21, 2026 12:35

NoB0 approved these changes May 4, 2026

View reviewed changes

NoB0 requested a review from kbalog May 4, 2026 16:27

		annotate_dialogues(dialogues, user_nlu, agent_nlu)


		def get_summary_by_agent(

Conversation

dkkdark commented Mar 24, 2026

Uh oh!

github-actions Bot commented Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dkkdark Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NoB0 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NoB0 left a comment

Choose a reason for hiding this comment

Uh oh!

NoB0 May 4, 2026

Choose a reason for hiding this comment

Uh oh!

NoB0 May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants