Point Goal Navigation with PPO

This repository contains an implementation of Point Goal navigation using Proximal Policy Optimization (PPO) for a two-wheeled robot in Webots.

Robot Description

The implementation uses a two-wheeled differential drive robot that navigates in maze environments. The robot is equipped with:

GPS sensor for position tracking
Compass for orientation

The robot learns to navigate efficiently to goal positions using a neural network policy trained with PPO reinforcement learning.

Prerequisites

Python 3.6+
PyTorch
NumPy
Webots Simulator (2023b or newer)

Installation

Install Webots from the official website

Clone this repository:

git clone https://github.com/yourusername/point-goal-ppo.git
cd point-goal-ppo

Install required Python packages:
```
pip install -r requirements.txt
```

Training Process

1. Launch Webots and Open the Training World

Open Webots

From Webots, open the training world file:

File > Open > [path-to-repo]/worlds/train.wbt

The training script runs automatically, using PPO (Proximal Policy Optimization) to teach the robot navigating from point [0,0] to a point [-5,10]

Training progress will be displayed in the console with metrics including:

Episode reward
Success rate (percentage of successful navigation attempts)
SPL (Success weighted by Path Length)
Path length

2. Training Details

The robot learns to navigate to randomly placed goals in the maze environment
The policy network maps GPS coordinates and orientation to movement actions
Reward shaping is used to encourage progress toward the goal
Early stopping is implemented when the robot achieves a 95% success rate with 80% SPL

Model Output

During training, models and metrics are saved automatically:

Models are stored in a timestamped directory: controllers/drive_robot/output/YYYYMMDD_HHMMSS/
The best-performing model is saved as navigation_policy_best.pt
The final model is saved as navigation_policy_final.pt
Training rewards are logged in rewards.txt

Running the Trained Model

To run the trained navigation policy:

Open Webots
Load the run world: File > Open > [path-to-repo]/worlds/run.wbt
This automatically runs the controller script.

Project Structure

.
├── controllers/
│   ├── drive_robot/             # Training controller
│   │   ├── drive_robot.py       # Main training script
│   │   ├── maze_env.py          # Environment wrapper
│   │   └── output/              # Saved models and metrics
│   │       └── YYYYMMDD_HHMMSS/
│   │           ├── navigation_policy_best.pt
│   │           ├── navigation_policy_final.pt
│   │           └── rewards.txt
│   └── run/                     # Inference controller
│       ├── drive_robot.py       # Policy implementation
│       ├── maze_env.py          # Environment wrapper
│       └── run.py               # Inference script
├── worlds/
│   ├── train.wbt                # Training world
│   └── run.wbt                  # Evaluation world
├── protos/                      # Robot definition files
│   └── Astra.proto              # Robot model definition
├── requirements.txt             # Python dependencies
└── README.md                    # This file

License

This project is licensed under the MIT License - see the LICENSE file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Point Goal Navigation with PPO

Robot Description

Prerequisites

Installation

Training Process

1. Launch Webots and Open the Training World

2. Training Details

Model Output

Running the Trained Model

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
controllers		controllers
protos		protos
worlds		worlds
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.gif		run.gif

Folders and files

Latest commit

History

Repository files navigation

Point Goal Navigation with PPO

Robot Description

Prerequisites

Installation

Training Process

1. Launch Webots and Open the Training World

2. Training Details

Model Output

Running the Trained Model

Project Structure

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages