Skip to content

Benchmarking strategy #12

@lukaskellerstein

Description

@lukaskellerstein
  1. Define configurations for agents
  • Choose scenarios that need to be tested
  • Translate scenarios as Instructions for Agents
  • Define OS state
  • Define App state
  • Define Evaluation condition = Definition of success
  1. Record (Teams) telemetry for scenarios
  • create a distilled version of what's happened
  1. Define strategy for benchmarking
  • How many times each config will run?
  • How we will measure / calculate success?
  1. Benchmarks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions