Use `GGML_BACKEND_DL` and `GGML_CPU_ALL_VARIANTS` for x86_64 builds when it's ready

### Comment:

Hi maintainers!

There was a cool feature added to llama.cpp where we can have dynamic dispatch to various backends which are compiled for the various microarchitecture levels for x86_64. This can enabled great performance benefits! It's basically enabling `GGML_BACKEND_DL` and `GGML_CPU_ALL_VARIANTS`.

I created an issue for llama-cpp-python to add some missing bindings to really make use of this, see here https://github.com/abetlen/llama-cpp-python/issues/2069

When this has been taken care of and llama-cpp-python supports this, I think it would be really great if this feedstock would enable this for the x86_64 CPU builds.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use `GGML_BACKEND_DL` and `GGML_CPU_ALL_VARIANTS` for x86_64 builds when it's ready #68

Comment:

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Use GGML_BACKEND_DL and GGML_CPU_ALL_VARIANTS for x86_64 builds when it's ready #68

Description

Comment:

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Use `GGML_BACKEND_DL` and `GGML_CPU_ALL_VARIANTS` for x86_64 builds when it's ready #68