Evaluating Time Series Augmentation Techniques for Deep Learning-Based Solar Flare Prediction

This is the repository for our work titled "Evaluating Time Series Augmentation Techniques for Deep Learning-Based Solar Flare Prediction", which has been accepted to The Astrophysical Journal Supplement Series.

Abstract

Accurate forecasting of solar flares is crucial for mitigating their severe impacts on space-based and communication systems. Deep learning models have shown promise in predicting flare activity using magnetic field measurements of solar active regions. However, the scarcity and imbalance of flare occurrence data—particularly for major flares—pose significant challenges to model robustness and generalization. This study provides a comprehensive evaluation of time-series data augmentation techniques to improve deep learning-based solar flare prediction. Using the benchmark Space Weather Analytics for Solar Flares dataset, which offers multivariate magnetic field parameter time series, we assess 12 augmentation methods across three architectures: fully convolutional network (FCN), multivariate LSTM-FCN, and residual network. Our primary experiments address the binary classification of major (M- and X-class) versus minor (C-, B-, and FQ-class) flares, using metrics tailored for imbalanced data, including recall, true skill statistic, Heidke skill score, and the Gini coefficient. A case study on X versus M classification further examines performance under a simpler, more balanced setting. Results show that select augmentation strategies yield measurable gains across different models and scenarios, offering a viable path forward for addressing data scarcity in space weather forecasting tasks.

Data Download

Download the raw data from: https://dmlab.cs.gsu.edu/solar/data/data-comp-2020/

About

This project includes four parts: 1. Data preprocessing (Data preparation), 2. 12 Different data augmentation methods were used to generate synthetic samples, 3. Apply 3 deep learning models to evaluate the solar flare prediction in different scenarios: (1) original and undersampling comparison, (2) data augmentation impact, (3) synthetic/real ratios vs performances, (4). Case study on X vs. M Classification

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
Data preprocessing		Data preprocessing
case_study_XvsM		case_study_XvsM
ori_under_aug_comparisom		ori_under_aug_comparisom
syn:real ratios		syn:real ratios
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating Time Series Augmentation Techniques for Deep Learning-Based Solar Flare Prediction

Abstract

Data Download

About

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Evaluating Time Series Augmentation Techniques for Deep Learning-Based Solar Flare Prediction

Abstract

Data Download

About

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages