Fireworker: Home

giphy

Our project aims to train an AI agent to play Hanabi and achieve high scores using reinforcement learning. We will use PPO and A2C to optimize decision-making in a cooperative, partially observable environment.

Github: Source code

Resources links:

Hanabi Learning Environment
Stable baselines3
Hanabi rule
Learning and Reproduction

Fireworker HANABI AI

Pages:

Resources links: