Complete GPT-2 mixed-precision quantization training implementation by Copilot · Pull Request #2 · Tanayshri123/GPT2Quantization

Copilot · 2025-12-25T21:28:26Z

Implements end-to-end training pipeline for GPT-2 with dynamic per-layer quantization using LoRA adapters, enabling parameter-efficient mixed-precision training where bit-widths (2/4/8/32-bit) are randomly assigned per batch.

Core Implementation

train.py - Training orchestration:

Injects SwitchableLinear layers into GPT-2 architecture
Applies QuantizationController to randomize layer bit-widths each batch
Trains LoRA adapters while keeping base weights frozen
~1% trainable parameters vs full fine-tuning

models/layers.py - Enhanced SwitchableLinear:

class SwitchableLinear(nn.Module):
    def __init__(self, base_linear_layer, supported_bits=[2, 4, 8], lora_rank=4):
        # ...
        self._current_bit_width = 32  # Stored state for seamless forward pass
        
    def forward(self, x, bit_width=None):
        if bit_width is None:
            bit_width = self._current_bit_width  # Falls back to stored value
        # Quantize and apply LoRA adapter for current bit-width

Controller sets _current_bit_width before each forward pass, eliminating need for monkey-patching or signature changes in GPT-2's call stack.

Infrastructure

requirements.txt - Dependencies with CVE fixes (torch 2.0→2.6, transformers 4.30→4.48)
.gitignore - Standard ML project exclusions
README.md - Architecture overview, usage, configuration

Security

Fixed heap buffer overflow in PyTorch <2.2.0
Fixed deserialization RCE in transformers <4.48.0
CodeQL: 0 alerts

Original prompt

continue

Created from VS Code via the GitHub Pull Request extension.

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

…ining loop Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Initial plan

400df45

Copilot AI assigned Copilot and Tanayshri123 Dec 25, 2025

Copilot started work on behalf of Tanayshri123 December 25, 2025 21:28 View session

Copilot AI and others added 6 commits December 25, 2025 21:31

Add requirements.txt, training script, and comprehensive README

f26bae9

Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Improve SwitchableLinear to support stored bit-width and simplify tra…

5af7f87

…ining loop Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Add .gitignore and remove unnecessary files from version control

0e98ccc

Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Fix epoch average loss calculation and improve .gitignore

65f28af

Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Update dependencies to fix security vulnerabilities

b8f0875

Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Clarify data loading comment for production vs development

ef22963

Co-authored-by: Tanayshri123 <43706966+Tanayshri123@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Continue feature development workflow~~ Complete GPT-2 mixed-precision quantization training implementation Dec 25, 2025

Copilot AI requested a review from Tanayshri123 December 25, 2025 21:42

Copilot finished work on behalf of Tanayshri123 December 25, 2025 21:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complete GPT-2 mixed-precision quantization training implementation#2

Complete GPT-2 mixed-precision quantization training implementation#2
Tanayshri123 with Copilot wants to merge 7 commits into
mainfrom
copilot/continue-feature-development

Copilot AI commented Dec 25, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Core Implementation

Infrastructure

Security

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 25, 2025 •

edited

Loading