Introduction

Overview

The AI-innovator suite aims to endow the AI agents with the ability to conduct innovative LLM research.

The AI-innovator suite includes:

  • A benchmark of 20 LLM research tasks
  • A ResearchGym execution environment
  • A interactive human-agent interface in real-time
  • A model registry for storing and sharing models

Model Support

OpenAI (through Azure)

  • GPT-5

Anthropic (through OpenRouter)

  • Claude Sonnet 4

Zhipu AI

  • GLM-4.5

Kimi

  • Kimi-K2

Want to contribute?

Visit our InnovatorBench’s GitHub repository to contribute to the project.