[Incomplete] Initial fast llama modeling by CoffeeVampir3 · Pull Request #24 · LocalResearchGroup/llm-foundry

CoffeeVampir3 · 2025-02-04T03:08:52Z

Initial modeling files for llama3 -- not hooked into llm-foundary yet.

HangenYuu · 2025-02-11T23:21:53Z

@CoffeeVampir3 mind if I push the FlashAttention commit to this branch? To put everything Llama in a single PR.

Also would be great if you can include some quick start guide for Llama, to check of implementation is running correctly.

CoffeeVampir3 · 2025-02-12T17:13:21Z

@CoffeeVampir3 mind if I push the FlashAttention commit to this branch? To put everything Llama in a single PR.

Yeah! That sounds like a plan. I think anything related to modeling/optimization this is a fine place to put.

Also would be great if you can include some quick start guide for Llama, to check of implementation is running correctly.

I do have a testing repo here, I did not want to pollute the repo with test code that was unrelated to llm-foundary, but I did confirm the modeling is correct and trainable here: https://github.com/CoffeeVampir3/Llama-3-Clean-Minimized

For the llm-foundary specific, there's a variety of hooks needed to be added into the modeling. I don't think it'll be too difficult, but just a matter of doing it. Quick start for the foundary stuff is waiting on that.

AbdalrahmanWael · 2025-02-15T16:56:12Z

For the llm-foundary specific, there's a variety of hooks needed to be added into the modeling. I don't think it'll be too difficult, but just a matter of doing it. Quick start for the foundary stuff is waiting on that.

@CoffeeVampir3 I can try having it work with llm-foundary in a manner similar to how they did for their mpt model, if you haven't already started on that

CoffeeVampir3 · 2025-02-16T02:09:47Z

For the llm-foundary specific, there's a variety of hooks needed to be added into the modeling. I don't think it'll be too difficult, but just a matter of doing it. Quick start for the foundary stuff is waiting on that.

@CoffeeVampir3 I can try having it work with llm-foundary in a manner similar to how they did for their mpt model, if you haven't already started on that

I haven't started, would be great if you're interested 👍

CoffeeVampir3 added 2 commits February 3, 2025 20:04

Initial fast llama modeling

90b7b62

Initialization -- forward for cut cross entropy support if desired.

098b2a8

Merge branch 'main' into llama-modeling

8e12422

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Incomplete] Initial fast llama modeling#24

[Incomplete] Initial fast llama modeling#24
CoffeeVampir3 wants to merge 3 commits intomainfrom
llama-modeling

CoffeeVampir3 commented Feb 4, 2025

Uh oh!

HangenYuu commented Feb 11, 2025 •

edited

Loading

Uh oh!

CoffeeVampir3 commented Feb 12, 2025 •

edited

Loading

Uh oh!

AbdalrahmanWael commented Feb 15, 2025

Uh oh!

CoffeeVampir3 commented Feb 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

CoffeeVampir3 commented Feb 4, 2025

Uh oh!

HangenYuu commented Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CoffeeVampir3 commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AbdalrahmanWael commented Feb 15, 2025

Uh oh!

CoffeeVampir3 commented Feb 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HangenYuu commented Feb 11, 2025 •

edited

Loading

CoffeeVampir3 commented Feb 12, 2025 •

edited

Loading