-
Notifications
You must be signed in to change notification settings - Fork 234
Open
Description
on A100 40GB
Comparing latency for text and image both using colpali 1.3 and colmodernvbert. Lateny is lower on text, and way high on Image.
On further breakdown, noticed the processor in both the models are processing very differently . Colpali is using 1 tile of 448448 wheras colmodernvbert is using 13 tike of 512512
COLPALI
Model Load Time: 11.603 sec
Image Embedding:
- Total images: 10
- Embedding calls: 1
- Avg per call: 835.43 ms
- Avg per image: 83.54 ms
- Total time: 0.835 sec
- Throughput: 11.97 images/sec
Query Embedding:
- Total queries: 4
- Embedding calls: 4
- Avg per call: 40.91 ms
- Avg per query: 40.91 ms
- Total time: 0.164 sec
- Throughput: 24.45 queries/sec
Retrieval (Scoring):
- Avg per retrieval: 2.46 ms
- Total: 0.010 sec
Model Load Time: 4.651 sec
Image Embedding:
- Total images: 10
- Embedding calls: 1
- Avg per call: 4250.66 ms
- Avg per image: 425.07 ms
- Total time: 4.251 sec
- Throughput: 2.35 images/sec
Query Embedding:
-
Total queries: 4
-
Embedding calls: 4
-
Avg per call: 31.51 ms
-
Avg per query: 31.51 ms
-
Total time: 0.126 sec
-
Throughput: 31.73 queries/sec
Retrieval (Scoring):
- Avg per retrieval: 2.99 ms
- Total: 0.012 sec
- Count: 4
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels