Go to file
MSVstudios 14db6bcd61 Initial Commit 2026-03-16 11:43:26 +01:00
.vscode Initial Commit 2026-03-16 11:43:26 +01:00
src Initial Commit 2026-03-16 11:43:26 +01:00
tests Initial Commit 2026-03-16 11:43:26 +01:00
.gitignore Initial Commit 2026-03-16 11:43:26 +01:00
README.md Initial Commit 2026-03-16 11:43:26 +01:00
config.toml.example Initial Commit 2026-03-16 11:43:26 +01:00
requirements.txt Initial Commit 2026-03-16 11:43:26 +01:00

README.md

Florence-2 Captioning Pipeline

High-throughput asynchronous captioning pipeline using Florence-2 Base PromptGen.

Goals

  • Download images from S3/HTTP concurrently
  • Preprocess (resize/normalize)
  • Run batched caption generation on GPU
  • Persist captions back to a database (async)

Project structure

  • src/: implementation code
  • tests/: unit/integration tests
  • todo.md: tasks list
  • implementationPlanV2.md: architecture + design notes

Quickstart

  1. Install dependencies:
pip install -r requirements.txt
  1. Configure environment variables (see src/config.py for expected vars).

  2. Run the pipeline (example):

python -m src.pipeline --dry-run

Notes

This repo is intended as a foundation for building a fast, async dataset captioning tool.