split-weight

Split a single neural network into multiple smaller networks using weight splitting.

⚠️ Sparsity vs. Overhead. ⛔️ [DEPRECATED]

Even if the total number of operations (FLOPs) decreases, splitting a single large matrix multiplication (matmul) into several smaller ones often results in a slowdown on modern hardware like GPUs. This is because large matmuls are highly optimized for parallel processing; multiple small calls introduce "kernel launch overhead" and prevent the hardware from reaching peak throughput.

Overview

This project explores an approach to improve inference efficiency in neural networks by decomposing a large model into smaller sub-networks based on weight significance.

Idea

In many neural network tasks, not all inputs strongly influence all outputs. When certain weights are close to zero, their contribution to the final output becomes negligible.

The core idea is:

Identify weights that have minimal impact (near zero values)
Split the network into smaller sub-networks by grouping significant weights
Reduce unnecessary computation during inference by ignoring weak connections

This can lead to more efficient output generation in trained models, especially in scenarios where sparsity naturally emerges.

Visualization

Original Weights

Split Weights

How It Works

Train a standard neural network
Analyze the learned weights
Identify near-zero weights (low importance connections)
Partition the network into smaller sub-networks
Use these sub-networks independently or selectively during inference

Benefits

Reduced computation during inference
Potential speed improvements
Better utilization of sparsity in trained models
Modular network structure

Use Cases

Edge devices with limited compute
Real-time inference systems
Sparse neural network optimization

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
notebook		notebook
README.md		README.md
pyproject.toml		pyproject.toml
split_weight.py		split_weight.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

split-weight

⚠️ Sparsity vs. Overhead. ⛔️ [DEPRECATED]

Overview

Idea

Visualization

Original Weights

Split Weights

How It Works

Benefits

Use Cases

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

split-weight

⚠️ Sparsity vs. Overhead. ⛔️ [DEPRECATED]

Overview

Idea

Visualization

Original Weights

Split Weights

How It Works

Benefits

Use Cases

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages