fix(kagent-feast-mcp): pin torch to CPU-only wheel to reduce image size by ~2.3GB by Kunal-Somani · Pull Request #199 · kubeflow/docs-agent

Kunal-Somani · 2026-04-03T17:08:50Z

Summary

Fixes #198

requirements.txt listed bare torch which installs the full CUDA-enabled wheel (~2.5GB) by default. The Dockerfile uses python:3.10-slim — a CPU-only base image with no CUDA drivers. The embedding model (all-mpnet-base-v2) runs on CPU regardless, making the entire CUDA payload dead weight.

Root Cause

# Before
torch   ← pip installs full CUDA wheel (~2.5GB) by default

FROM python:3.10-slim   ← CPU-only, no CUDA drivers available

Fix

# After
torch --index-url https://download.pytorch.org/whl/cpu

Impact

	Before	After
torch wheel size	~2.5GB (CUDA)	~200MB (CPU)
Final image size	~3.5GB	~1GB
Runtime behavior	CPU inference	CPU inference (identical)
Cold-start time	3-4x slower	Baseline

No change in functionality — the model runs on CPU in both cases.

Checklist

Commits are signed off (DCO)
Fixes bug(kagent-feast-mcp): requirements.txt installs full CUDA torch (~2.5GB) on a CPU-only base image #198
No change in runtime behavior — CPU inference is identical
Verified with PyTorch official CPU index URL

requirements.txt listed bare 'torch' which installs the full CUDA wheel (~2.5GB) by default. The Dockerfile uses python:3.10-slim — a CPU-only base image with no CUDA drivers. The embedding model runs on CPU regardless, so the CUDA payload is entirely dead weight. Pin torch to the official CPU-only index URL, reducing the installed wheel from ~2.5GB to ~200MB and the final Docker image from ~3.5GB to under 1GB with no change in runtime behavior. Fixes kubeflow#198 Signed-off-by: Kunal <kunal120222@gmail.com>

google-oss-prow · 2026-04-03T17:08:56Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign franciscojavierarceo for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

google-oss-prow bot requested a review from franciscojavierarceo April 3, 2026 17:08

google-oss-prow bot added the size/XS label Apr 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(kagent-feast-mcp): pin torch to CPU-only wheel to reduce image size by ~2.3GB#199

fix(kagent-feast-mcp): pin torch to CPU-only wheel to reduce image size by ~2.3GB#199
Kunal-Somani wants to merge 1 commit intokubeflow:mainfrom
Kunal-Somani:fix/issue-198-cpu-torch-dockerfile

Kunal-Somani commented Apr 3, 2026

Uh oh!

google-oss-prow bot commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Kunal-Somani commented Apr 3, 2026

Summary

Root Cause

Fix

Impact

Checklist

Uh oh!

google-oss-prow bot commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant