Skip to content
#

vision-llm

Here are 20 public repositories matching this topic...

A FastAPI-based backend service that extracts structured information from academic marksheets (images or PDFs) using OCR and an LLM, and returns a normalized JSON response with confidence scores.

  • Updated Jan 24, 2026
  • Python

This repository focuses on customizing the Qwen2.5-Vision model for specific tasks. It provides step-by-step guidance, scripts, and best practices for fine-tuning the model on custom datasets. Ideal for developers and researchers, it ensures optimal performance and accuracy tailored to unique use cases.

  • Updated Apr 22, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-llm topic, visit your repo's landing page and select "manage topics."

Learn more