Skip to content

jmiravet/Sinusoidal-Initialization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sinusoidal Initialization: Time for a New Start

Official PyTorch implementation of Sinusoidal Initialization, as introduced in:

Sinusoidal Initialization, Time for a New Start
Alberto Fernández-Hernández, Jose I. Mestre, Manuel F. Dolz, José Duato, Enrique S. Quintana-Ortí
NeurIPS 2025
[OpenReview Page][PDF]


📝 Abstract

Initialization plays a critical role in Deep Neural Network training, directly influencing convergence, stability, and generalization. Common approaches such as Glorot and He initializations rely on randomness, which can produce uneven weight distributions across layer connections. In this paper, we introduce the Sinusoidal initialization, a novel deterministic method that employs sinusoidal functions to construct structured weight matrices expressly to improve the spread and balance of weights throughout the network while simultaneously fostering a more uniform, well‑conditioned distribution of neuron activation states from the very first forward pass. Because Sinusoidal initialization begins with weights and activations that are already evenly and efficiently utilized, it delivers consistently faster convergence, greater training stability, and higher final accuracy across a wide range of models, including convolutional neural networks, vision transformers, and large language models. On average, our experiments show an increase of 4.8 % in final validation accuracy and 20.9 % in convergence speed. By replacing randomness with structure, this initialization provides a stronger and more reliable foundation for Deep Learning systems.


🧠 What is Sinusoidal Initialization?

Sinusoidal Initialization replaces random weight sampling with deterministic sinusoidal patterns.
Each neuron (or output channel) receives a unique wave defined by frequency and phase, producing:

  • Smooth, structured, and diverse filters
  • Balanced, variance-normalized weight distributions
  • Uniform activation states from the first forward pass

This yields better-conditioned networks that learn faster and more robustly.

Supported layers:

  • nn.Linear
  • nn.Conv1d, nn.Conv2d, nn.Conv3d
  • nn.MultiheadAttention

📄 Cite This Work

If you use Sinusoidal Initialization in your research, please cite:

@inproceedings{fernandez-hernandez2025sinusoidal,
      title={Sinusoidal Initialization, Time for a New Start},
      author={Alberto Fern{\'a}ndez-Hern{\'a}ndez and Jose I. Mestre and Manuel F. Dolz and Jos{\'e} Duato and Enrique S. Quintana-Orti},
      booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
      year={2025},
      url={https://openreview.net/forum?id=FGliQVcrDZ}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors