Skip to content

[Question]: Multimodal not supported? #215

Description

@mpetruc

Question

When trying to upload an image i'm getting a pop-up saying: Multimodal not supported form the server of model. The model i'm using (ibm-granite-docling-258m) IS multimodal, and llama.ui was built from source.

What am ai doing wrong? Do i need to enable vision anywhere? Thanks.

Steps Taken

npm ci; npm run build; npm start
${llama_server} -m ${models}/granite-docling-258M-bf16.gguf --mmproj ${models}/mmproj-ibm-granite-docling-258M-f16.gguf

Environment

wsl in windows 11; Application Version: 2.36.0

Additional Context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions