Skip to content

[BUG]: final_env_score永远等于0 #74

@L0wang

Description

@L0wang

Description of the bug

“ info = {} # Initialize info as empty dict, as StepOutput doesn't explicitly return it
print(f"[Agent._run_single_rollout][{task_idx}][Turn {t+1}] Env Step Result: Reward={reward}, Done={done}, Info={info}")

            # Store the reward from this specific step
            step_rewards.append(reward)
            final_reward = reward # Keep track of the reward from the last executed step
            final_env_score = info.get('score', 0.0) # Use .get for safety”

根据这几行代码,final_env_score在获取info分数时候,是不是永远等于0,因为永远把info={}了

Steps To Reproduce

xx

Additional Information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions