AI-Car-Racing

Overview

Autonomous driving in continuous-action environments remains a challenging problem in Reinforcement Learning. This project uses Gymnasium's CarRacing-V3 environment as it provides a controlled, simulated platform where an agent must learn to navigate a procedurally generated racetrack. The agent's goal is to maximize cumulative reward while avoiding collisions. This project aims to explore and benchmark three well-established learning approaches: Deep Q-Learning (DQN), Proximal Policy Optimization (PPO) with Advantage Actor-Critic (A2C) framework, and Behavioral Cloning (BC).

Trained Agent Driving Samples

PPO + A2C

PPO+A2C.mp4

Behavioral Cloning

BehavioralCloning.mp4

Deep Q-Learning Network

DQN.mp4

Methods

We experimented with multiple Reinforcement Learning methods such as Behavioral Cloning, Deep Q-Learning (with Discrete Action Space) and Proximal Policy Optimization through Actor-Critic Framework (PPO-A2C).

Results

According to the literature, we discovered that a mean reward of 800 or more is considered good. Our best performing DQN and PPO + A2C methods are able to achieve a mean reward greater than 800. The Behavioral Cloning method is not too far behind, with a mean reward of 743.54, which is also close to the target of 800.

The DQN method was the best-performing solution with a mean reward of 866.8 and a standard deviation of 44.8. However, this cannot be directly compared to other RL methods we tried, since it's only using a Discrete Action space.

Amongst the methods using Continuous Action space, the PPO + A2C method performed the best with a mean reward of 829.35, but it showed a high standard deviation of 145.8. As noted from the performance of the Behavioral Cloning method, the dataset used for training appears to be promising in capturing effective expert demonstrations, although on its own, it does not achieve competitive performance. We believe that the PPO + A2C method can be combined with pretraining using the Behavioral Cloning (expert) dataset in future work to improve mean reward performance and reduce the high variance, potentially offering strong competition to DQN’s performance in the discrete action space.

Setup Instructions

Prerequisites (Linux / Ubuntu)

Install system build tools and the Python headers before creating the venv:

sudo apt update
sudo apt install -y build-essential swig python3.11-dev python3.11-venv

Prerequisites (Windows)

Download swig and set up system path: https://swig.org/Doc1.3/Windows.html#Windows_examples

Visual Studios Build Tools 2026:

Initial Setup

https://visualstudio.microsoft.com/visual-cpp-build-tools/ (make sure to check the "MSVC v143 - VS 2022 C++ build tools" package)

Download uv

Please download uv (Ultra-Violet) for Python Project Dependency Management: https://docs.astral.sh/uv/getting-started/installation/#installation-methods

Initializing a uv virtual env

Run following commands by navigating to the project directory:

cd /path/to/your/project
uv sync

Activating the virtual env

In the same project directory, execute the following (if virtual env is not already active):

source .venv/bin/activate

Windows

.\.venv\Scripts\Activate.ps1

Adding any Libraries / Dependencies

To add any new dependencies (libraries):

uv add <library_name>

Playing the Car Racing Game Manually

Please run the following command from the project directory:

For MacOS / Linux:

uv run .venv/lib/python3.11/site-packages/gymnasium/envs/box2d/car_racing.py

For Windows:

uv run .venv/lib/site-packages/gymnasium/envs/box2d/car_racing.py

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
bc		bc
data		data
ppo_implementation		ppo_implementation
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
Train_DQN.py		Train_DQN.py
Wrapper_DQN.py		Wrapper_DQN.py
environment.py		environment.py
environment_framestacking.py		environment_framestacking.py
experiment_sb3_run.py		experiment_sb3_run.py
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Car-Racing

Overview

Trained Agent Driving Samples

PPO + A2C

Behavioral Cloning

Deep Q-Learning Network

Methods

Results

Setup Instructions

Prerequisites (Linux / Ubuntu)

Prerequisites (Windows)

Initial Setup

Download uv

Initializing a uv virtual env

Activating the virtual env

Windows

Adding any Libraries / Dependencies

Playing the Car Racing Game Manually

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI-Car-Racing

Overview

Trained Agent Driving Samples

PPO + A2C

Behavioral Cloning

Deep Q-Learning Network

Methods

Results

Setup Instructions

Prerequisites (Linux / Ubuntu)

Prerequisites (Windows)

Initial Setup

Download uv

Initializing a uv virtual env

Activating the virtual env

Windows

Adding any Libraries / Dependencies

Playing the Car Racing Game Manually

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages