`RL Snake Agents`

MoE and DQL approaches to train RL agents for playing snake.

Built with the tools and technologies:

🔗 Table of Contents

📍 Overview
📂 Repository Structure
🧩 Modules
🚀 Getting Started

📍 Overview

This project showcases the development of a Reinforcement Learning (RL) agent capable of playing a modified version of the classic Snake game. The modifications include:

Collisions with walls result in a negative reward but do not reset the board.
When the snake eats itself, the portion of the body from the collision point to the tail is erased, with a negative reward issued, and the game continues.
Filling up the board grants a strong positive reward and resets the board.
The snake can legally stay in its current position.

Agents Implemented

Random Agent: A baseline agent that selects actions randomly, serving as a benchmark.
Baseline Agent: A rule-based agent that prioritizes moving towards the fruit and, if stuck, triggers self-collision to clear space.
Deep Q-Learning Agent (DQL): A neural network-based agent that learns optimal strategies, especially in late-game scenarios.
Hybrid Agent: Combines the strengths of the Baseline and DQL agents, using rule-based decisions in early game and DQL strategies in late game.

Results

The Hybrid Agent outperformed others by leveraging the early-game efficiency of the Baseline Agent and the advanced strategic capabilities of the DQL Agent. It maximized fruit collection and minimized wall collisions, achieving superior performance on all benchmarks.

Issues

While the Hybrid Agent excelled in the task-specific goals, it diverges from conventional Snake gameplay strategies, where self-collision typically ends the game. Adjusting the reward structure could lead to more traditional playstyles but would be outside the scope of the project.

Conclusion

This project demonstrates the importance of setting appropriate rewards and goals in Reinforcement Learning, leading to innovative and effective strategies. The Hybrid Agent is a strong performer within the defined parameters, highlighting the potential of combining rule-based heuristics with deep learning techniques.

📂 Repository Structure

└── /
    ├── README.md
    ├── Report - RL Snake.pdf
    ├── agents
    │   ├── BaseAgent.py
    │   ├── BaselineAgent.py
    │   ├── DQNAgent.py
    │   ├── HybridDQNAgent.py
    │   ├── RandomAgent.py
    │   ├── __init__.py
    ├── environments_fully_observable.py
    ├── evaluation.py
    ├── main.ipynb
    ├── requirements.txt
    ├── saved_models
    │   ├── dqn_model.weights.h5
    │   └── hybrid_model.weights.h5
    └── training.py

🧩 Modules

.

File	Summary
requirements.txt	Specifying dependencies in requirements.txt ensures that the project has the necessary libraries for optimal functionality and compatibility. It streamlines the setup process for contributors and users, fostering consistent environments for training and evaluation of agents within the repositorys architecture.
environments_fully_observable.py	Establishes a foundational environment for snake-like agents in a simulated grid, facilitating board initialization, movement mechanics, and reward calculations. Integrates with reinforcement learning components, contributing to the overall architecture by providing necessary interactions for training and evaluation of various agent strategies.
main.ipynb	This is the core of the project. It contains both training and evaluation of the agents. Have a look here for in depth info of the code.
training.py	This is a copy of esclusively the training part for the agents. You can run it to see the results by yourself and save perhaps an updated model version.
evaluation.py	This is a copy of esclusively the evaluation part of the agents. You can run it to see the final results by yourself.

agents

File	Summary
BaseAgent.py	This code defines a `BaseAgent` class intended for game agents. It includes constants for game elements (e.g., snake's head, body, fruit) and movement directions. The class provides two methods, `get_actions` and `get_action`, which are meant to be overridden by subclasses to define agent behavior based on the game state. The class cannot be instantiated directly as the methods raise `NotImplementedError`.
RandomAgent.py	This code defines a `RandomAgent` class that inherits from `BaseAgent`. The `RandomAgent` randomly selects actions for a game. It has one attribute, `output_size`, representing the number of possible actions. The class includes two methods: `get_actions` to generate random actions for multiple game boards, and `get_action` to generate a random action for a single board.
BaselineAgent.py	This code defines a `BaselineAgent` class that extends `BaseAgent` to make decisions based on the game board. The `BaselineAgent` calculates and returns optimal actions based on the proximity to fruit while avoiding walls and body parts. It uses a direction array to compute new head positions, clips them within board boundaries, and evaluates distances to the fruit. If all moves are illegal, it randomly selects a valid move. The class includes methods `get_actions` for multiple boards and `get_action` for a single board.
DQNAgent.py	This code defines a `DQNAgent` class implementing a Deep Q-Network (DQN) for reinforcement learning. The agent uses a neural network to predict Q-values for actions based on game board states. It includes attributes for learning rate (`alpha`), discount factor (`gamma`), exploration rate (`epsilon`), and a decay rate for epsilon. Key methods are: `get_actions()`: Selects actions for multiple boards using an epsilon-greedy strategy. `learn()`: Updates the Q-values based on rewards and new board states.
HybridDQNAgent.py	The `HybridDQNAgent` class combines baseline and DQN strategies. It uses the baseline approach for shorter body lengths and the DQN strategy for longer ones, avoiding wall collisions. It includes methods for selecting actions, adjusting for collisions, learning from experiences, and loading model weights.

🚀 Getting Started

🔖 Prerequisites

Python: version 3.12.4

Tensorflow: version 2.7.0

Numpy: version 1.21.2

Matplotlib: version 3.4.3

📦 Installation

Build the project from source:

Clone the repository:

❯ git clone .

Navigate to the project directory:

❯ cd snake_rl

Install the required dependencies:

❯ pip install -r requirements.txt

🤖 Usage

To run the project run the main.ipynb file. This file contains both the training and evaluation of the agents.

If you want to run the training and evaluation separately, you can run the training.py and evaluation.py files respectively.

❯ python training.py
❯ python evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`RL Snake Agents`

🔗 Table of Contents

📍 Overview

Agents Implemented

Results

Issues

Conclusion

📂 Repository Structure

🧩 Modules

🚀 Getting Started

🔖 Prerequisites

📦 Installation

🤖 Usage

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
agents		agents
saved_models		saved_models
.gitignore		.gitignore
README.md		README.md
Report RL - Snake.pdf		Report RL - Snake.pdf
environments_fully_observable.py		environments_fully_observable.py
evaluation.py		evaluation.py
main.ipynb		main.ipynb
requirements.txt		requirements.txt
training.py		training.py

gp-1108/snake_rl

Folders and files

Latest commit

History

Repository files navigation

RL Snake Agents

🔗 Table of Contents

📍 Overview

Agents Implemented

Results

Issues

Conclusion

📂 Repository Structure

🧩 Modules

🚀 Getting Started

🔖 Prerequisites

📦 Installation

🤖 Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`RL Snake Agents`

Packages