Skip to content

Enable support for Gemma 3n-E4B #138

@shahkarnav115-beep

Description

@shahkarnav115-beep

Name and Version

dev_backend_openvino

Operating systems

Windows

GGML backends

OpenVINO

Hardware

13th Gen Intel(R) Core(TM) i5-13450HX + Nvidia RTX 3050

Models

gemma-3n-E4B-it-Q4_K_M.gguf

Problem description & steps to reproduce

gemma 3n-E4B not running with openvino backend.

GGML_OPENVINO:ON

llama-cli.exe -m "path/to/gemma-3n-E4B-it-Q4_K_M.gguf"

First Bad Commit

No response

Relevant log output

OpenVINO: using device CPU

Loading model... /Doesn't handle node name: node_2 op: REPEAT                                                          -Doesn't handle node name: node_2 op: REPEAT
Doesn't handle node name: node_4 op: SQR
Doesn't handle node name: node_5 op: SUM_ROWS
Doesn't handle node name: node_6 op: SQRT
Doesn't handle node name: node_8 op: SQR
Doesn't handle node name: node_9 op: SUM_ROWS
Doesn't handle node name: node_10 op: SQRT
Doesn't handle node name: node_11 op: DIV
Doesn't handle node name: inp_stacked op: CONCAT
GGML OpenVINO backend ov::Exception: Check 'dynamic_dim_value == node->ne[m_node_dynamic_dims[node]]' failed at C:\Users\karnav\openvino.genai\thirdparty\llama.cpp\ggml\src\ggml-openvino\ggml-decoder.cpp:1142:
Dynamic dim value mismatch for node: inp_stacked (permuted) and its src[0]: inp_stacked

graph_compute: ggml_backend_sched_graph_compute_async failed with error -1
process_ubatch: failed to compute graph, compute status: -1
llama_decode: failed to decode, ret = -3

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    Status

    Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions