This project applies machine learning techniques( random forest & ridge ) to forecast monthly excess returns of the CRSP value-weighted market index using the Welch and Goyal financial indicators dataset. Covering data from 1927 to 2022, the analysis involves processing the dataset, calculating various financial indicators, and developing predictive models.
- pandas
- numpy
- scikit-learn
- seaborn
- matplotlib
Dataset Preview - Summary Statistics - Missing Data - Correlation Analysis - Visualization - Feature Distribution - Feature Engnieering - Data Splitting
Model Selection ( Ridge - Random Forest) - Time-series Validation - Hyperparameter Tuning - Regularization - Residual Analysis - Economic Interpretation
Performance Metrics - Out-of-sample Performance - Visualization of Results - Comparative Analysis
Key Predictor Analysis - Predictor Stability - Investment Strategies - Visualization of Insights