Change fine-tune to use lamma-cpp, after getting llama-cpp to use the GPU for fine tuning

this is a big task, but it's important.

pytorch ecosystem is "dependency hell" at best, and rarely works well on other platforms besides linux, especially for tasks with many deps like peft, bitsandbytes

llama cpp ecosystem uses CMAKE and is easy to get to work with linux, windows, max, and even WASM!

we're using pytorch for fine tuning only because llama-cpp doesn't support GPU

the same will apply to stable-diffusion too!

this ticket is for

- updating llama-cpp downstream to support the gpu
   -  see https://github.com/ggerganov/llama.cpp/issues/3458	
- updating worker to ditch pytorch and still support the same fine-tune job flow, status updates
- ok for the lora output to be gguf of course, not pth


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change fine-tune to use lamma-cpp, after getting llama-cpp to use the GPU for fine tuning #15

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Change fine-tune to use lamma-cpp, after getting llama-cpp to use the GPU for fine tuning #15

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions