A lightweight FastAPI web app that generates AI captions from uploaded images, plus a social-ready caption mode with tone, emojis, and hashtags.
Explore the docs »
English
View Repository | Report Bug | Request Feature
Table of Contents
This project is a lightweight web app that generates AI captions from uploaded images. It supports a standard caption mode and a social caption mode with tone, emojis, and hashtags.
- Upload images and generate AI captions
- Social caption mode with tone, emojis, and hashtags
- Clean UI with preview and drag-and-drop upload
- FastAPI backend with vision-language models
- Python 3.11 (recommended)
- Conda or venv
- Clone the repo
git clone https://github.com/17addisonlin/AI-Powered-Image-Captioning-System.git cd AI-Powered-Image-Captioning-System - Create and activate an environment
conda create -n captioning python=3.11 -y conda activate captioning
- Install dependencies
pip install -r requirements.txt
- Start the server
uvicorn main:app --reload
- Open
http://127.0.0.1:8000
- Upload an image in Caption mode to get a short description.
- Switch to Social mode to generate a social-ready caption.
- Choose tone and toggle emojis/hashtags if desired.
- Slow startup: models download on first run; allow extra time.
- Dependency errors: verify you are using the
captioningenvironment. - Blank caption: try a different image or reduce image size.
- Images are processed locally by your server; nothing is uploaded unless you deploy it.
- If you deploy publicly, add authentication and rate limiting.
- Enhance UI, deploy two tabs with its features
- Add mutiple model support, try light and heavy model
- Add dark/white toggle background
- Improve speed of usage
See the open issues for a full list of proposed features and known issues.
Contributions are welcome. If you have improvements or bug fixes, please open an issue or submit a pull request.
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature) - Commit your Changes (
git commit -m 'Add some AmazingFeature') - Push to the Branch (
git push origin feature/AmazingFeature) - Open a Pull Request
Distributed under the MIT License. See LICENSE for more information.
Addison Lin - LinkedIn
Project Link: https://github.com/17addisonlin/AI-Powered-Image-Captioning-System

