Is rotate_ov_proj necessary ？

https://github.com/HandH1998/QQQ/blob/e307d9f00b90309069733890f28eefe9886bb6f2/QQQ/rotation/rotation.py#L194

Thanks for your great work！
In the original quarot paper， rotate_ov_proj is designed to rotate the v state， so the (4bit) quantization for the value cache can get easier.  But in this repo,  I think we only want to rotate the weight and the linear output, we do not want to quant the v cache. So, is this necessary to have this rotate_ov_proj here? If my understanding is incorrect, please point it out. I am looking forward to your reply. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is rotate_ov_proj necessary ？ #37

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Is rotate_ov_proj necessary ？ #37

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions