Question
When trying to upload an image i'm getting a pop-up saying: Multimodal not supported form the server of model. The model i'm using (ibm-granite-docling-258m) IS multimodal, and llama.ui was built from source.
What am ai doing wrong? Do i need to enable vision anywhere? Thanks.
Steps Taken
npm ci; npm run build; npm start
${llama_server} -m ${models}/granite-docling-258M-bf16.gguf --mmproj ${models}/mmproj-ibm-granite-docling-258M-f16.gguf
Environment
wsl in windows 11; Application Version: 2.36.0
Additional Context
No response
Question
When trying to upload an image i'm getting a pop-up saying: Multimodal not supported form the server of model. The model i'm using (ibm-granite-docling-258m) IS multimodal, and llama.ui was built from source.
What am ai doing wrong? Do i need to enable vision anywhere? Thanks.
Steps Taken
npm ci; npm run build; npm start
${llama_server} -m ${models}/granite-docling-258M-bf16.gguf --mmproj ${models}/mmproj-ibm-granite-docling-258M-f16.gguf
Environment
wsl in windows 11; Application Version: 2.36.0
Additional Context
No response