Skip to content

Higher latency for image embeddings for ColModernVBert compared to Colpali 1.3 #388

@aabbhishekksr

Description

@aabbhishekksr

on A100 40GB

Comparing latency for text and image both using colpali 1.3 and colmodernvbert. Lateny is lower on text, and way high on Image.

On further breakdown, noticed the processor in both the models are processing very differently . Colpali is using 1 tile of 448448 wheras colmodernvbert is using 13 tike of 512512

COLPALI

Model Load Time: 11.603 sec

Image Embedding:

  • Total images: 10
  • Embedding calls: 1
  • Avg per call: 835.43 ms
  • Avg per image: 83.54 ms
  • Total time: 0.835 sec
  • Throughput: 11.97 images/sec

Query Embedding:

  • Total queries: 4
  • Embedding calls: 4
  • Avg per call: 40.91 ms
  • Avg per query: 40.91 ms
  • Total time: 0.164 sec
  • Throughput: 24.45 queries/sec

Retrieval (Scoring):

  • Avg per retrieval: 2.46 ms
  • Total: 0.010 sec

Model Load Time: 4.651 sec

Image Embedding:

  • Total images: 10
  • Embedding calls: 1
  • Avg per call: 4250.66 ms
  • Avg per image: 425.07 ms
  • Total time: 4.251 sec
  • Throughput: 2.35 images/sec

Query Embedding:

  • Total queries: 4

  • Embedding calls: 4

  • Avg per call: 31.51 ms

  • Avg per query: 31.51 ms

  • Total time: 0.126 sec

  • Throughput: 31.73 queries/sec

Retrieval (Scoring):

  • Avg per retrieval: 2.99 ms
  • Total: 0.012 sec
  • Count: 4

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions