GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?
Researchers created GameCraft-Bench, a benchmark for evaluating AI agents' ability to build playable games end-to-end in a real game engine. The benchmark uses a popular open-source game engine and provides a dataset for training and testing AI models. You can explore the project on GitHub and Hugging Face. The benchmark aims to assess the capabilities of AI agents in game development.
- GameCraft-Bench evaluates AI agents building playable games in a real game engine.
- The benchmark includes a dataset for training and testing AI models.
- Project resources are available on GitHub and Hugging Face.