RL I - Introduction

RL Lab

An educational interface for building intuition about reinforcement learning fundamentals. The backend is build on the environments of the gymnasium library.

Current Implementation:

environments
- Gymnasium FrozenLake-v1 (4x4) with is_slippery=True
- Gymnasium FrozenLake-v1 (4x4) with is_slippery=False
algorithms
- Q-learning (custom build)

Quick Start (Experienced Users)

Already have git and Docker installed? Get started in 3 commands:

Prerequisites: Docker Desktop running or Docker Engine

Clone the repository

git clone https://github.com/aihpi/workshop-rl1-introduction.git

Navigate inside

cd workshop-rl1-introduction

Is Docker running? Then you can start the app (detached mode)

docker compose up -d

Open browser to http://localhost:3030

First-time setup takes ~1-2 minutes (downloads pre-built images).

Note: Running in detached mode (-d) keeps your terminal free. To view logs if needed for debugging, open a separate terminal and run docker compose logs -f

Installation (Beginners)

New to programming or Docker? Follow the installation guides:

Choose Your Operating System:

📱 Windows

Windows Installation Guide

For Windows 10/11

~10-15 minutes

🍎 macOS

macOS Installation Guide

For macOS 10.15+

~10-15 minutes

🐧 Linux

Linux Installation Guide

For Ubuntu/Debian

~15-20 minutes

Useful Docker Commands

Once installed, here are some helpful commands:

docker compose up -d           # Start the application (detached mode)
docker compose down            # Stop the application
docker compose logs -f         # View live logs (for debugging, in separate terminal)
docker compose logs backend    # View only backend logs
docker compose logs frontend   # View only frontend logs
docker compose ps              # Check container status
docker compose restart         # Restart services

Usage

Open the application in your browser at http://localhost:3030
Adjust parameters using the sliders:
- Number of Episodes: Training duration
- Exploration Rate (ε): Probability of random exploration
- Learning Rate (α): How fast the agent learns
- Discount Factor (γ): Importance of future rewards
Start training: Click "Start Training" and watch real-time visualizations:
- Environment viewer: Renders agent's last position of a training episode
- Reward chart: Tracks training progress with statistics
- Q-table heatmap: Visualizes learned action values (4×4 grid)
Play policy: After training completes, click "Play Policy" to watch the trained agent execute its learned behavior step-by-step.

Testing

Backend (8 tests, 41% coverage):

# Locally
cd backend && uv run pytest

# In Docker
docker compose exec backend pytest

Frontend (12 tests):

cd frontend && npm test

Repository Structure

workshop-rl1-introduction/
├── backend/               # Python Flask backend
│   ├── algorithms/        # RL algorithm implementations
│   │   ├── base_algorithm.py      # Abstract base class
│   │   └── q_learning.py          # Q-Learning implementation
│   ├── environments/      # Gymnasium environment handling
│   ├── training/          # Session management
│   ├── tests/             # Backend test suite
│   └── app.py             # Flask API server
├── frontend/              # React frontend
│   ├── src/
│   │   ├── components/    # React components
│   │   │   ├── ParameterPanel.jsx
│   │   │   ├── EnvironmentViewer.jsx
│   │   │   ├── RewardChart.jsx
│   │   │   ├── LearningVisualization.jsx
│   │   │   └── ControlButtons.jsx
│   │   ├── App.js         # Main application
│   │   └── api.js         # Backend communication
│   └── src/components/__tests__/  # Frontend test suite
├── docs/
│   ├── DEVELOPMENT.md          # Local development setup (without Docker)
│   ├── INSTALLATION_LINUX.md   # Linux installation guide
│   ├── INSTALLATION_MACOS.md   # macOS installation guide
│   ├── INSTALLATION_WINDOWS.md # Windows installation guide
│   └── screenshots/            # Documentation screenshots
└── docker-compose.yml     # Multi-container orchestration

License

MIT License - Free to use for educational purposes

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github/workflows		.github/workflows
backend		backend
dev		dev
docs		docs
frontend		frontend
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RL I - Introduction

RL Lab

Quick Start (Experienced Users)

Installation (Beginners)

Choose Your Operating System:

📱 Windows

🍎 macOS

🐧 Linux

Useful Docker Commands

Usage

Testing

Repository Structure

License

About

Uh oh!

Releases

Packages

Languages

aihpi/workshop-rl1-introduction

Folders and files

Latest commit

History

Repository files navigation

RL I - Introduction

RL Lab

Quick Start (Experienced Users)

Installation (Beginners)

Choose Your Operating System:

📱 Windows

🍎 macOS

🐧 Linux

Useful Docker Commands

Usage

Testing

Repository Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages