Skip to content

[Question] 训练agent使用的轨迹数据 #40

@whyseu

Description

@whyseu

Question Category | 问题类别

Usage / How-to | 使用方式咨询

Your Question | 你的问题

请问下,训练agent使用的轨迹数据是怎么收集和清洗、给reward打分的?

Context / Background | 背景信息

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No fields configured for Task.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions