Portfolio Optimizer

Reinforcement learning portfolio optimizer with custom TD3 and DDPG agents, an EIIE-style model reference, and market data utilities for financial trading experiments.

This project is a research codebase for experimenting with portfolio allocation agents rather than a packaged library. It is useful for studying how actor-critic methods behave in a trading environment with transaction costs and historical market data.

Highlights

Custom TD3 implementation for continuous portfolio allocation
Custom DDPG implementation
EIIE-style portfolio model reference based on Jiang et al.
Trading environment and wrapper modules
Market data utilities for Yahoo Finance and Binance-style data workflows
Replay buffer, noise, and exploration helpers for RL experiments

Project Structure

agent/          TD3, DDPG, and shared agent logic
data/           Data loading utilities
environment/    Trading environment and wrappers
networks/       Neural network models
utils/          Replay buffer, noise, and scheduling helpers
main.py         Example TD3 training entry point

Installation

git clone https://github.com/novis10813/Portfolio-Optimizer.git
cd Portfolio-Optimizer
pip install -r requirements.txt

The original environment was developed around Python 3.7-3.9 era packages and PyTorch 1.11. If you are running on a newer Python or CUDA version, expect to adjust the pinned dependencies.

Usage

main.py is the current example entry point. It builds a TradingEnv, wraps it, configures TD3 hyperparameters, and starts training.

python main.py

Before running, check the paths and hardware settings in main.py:

TradingEnv('data/history', ...) expects historical data under data/history.
device is set to cuda by default.
Training length, replay buffer size, and exploration settings are configured in the params dictionary.

Research Context

The implementation is inspired by portfolio management and continuous-control reinforcement learning papers:

Jiang et al., "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem". arXiv:1706.10059
Fujimoto et al., "Addressing Function Approximation Error in Actor-Critic Methods". arXiv:1802.09477
Lillicrap et al., "Continuous Control with Deep Reinforcement Learning". arXiv:1509.02971

Status

This is an experimental research repository. The code is best read as a portfolio RL prototype and reference implementation, not as production trading software.

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
__pycache__		__pycache__
agent		agent
data		data
environment		environment
networks		networks
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Portfolio Optimizer

Highlights

Project Structure

Installation

Usage

Research Context

Status

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Portfolio Optimizer

Highlights

Project Structure

Installation

Usage

Research Context

Status

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages