Skip to content
@AISBench

AISBench

Popular repositories Loading

  1. benchmark benchmark Public

    AISBench Benchmark is a model evaluation tool built on OpenCompass, compatible with OpenCompass’s configuration system, dataset structure, and model backend implementation, while extending support …

    Python 97 42

  2. benchmark-mindie-old benchmark-mindie-old Public

    plugin for AISBench/benchmark in gitee

    Python 1 1

  3. datasets datasets Public

    Special dataset generate methods for benchmark

    Python 1 2

  4. ci_test ci_test Public

    test ci

    1

  5. mini-swe-agent mini-swe-agent Public

    Forked from SWE-agent/mini-swe-agent

    The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

    Python

  6. terminal-bench-2 terminal-bench-2 Public

    Forked from harbor-framework/terminal-bench-2

    Preset all environment in docker images

    Shell

Repositories

Showing 6 of 6 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…