content-cleaning

Here are 3 public repositories matching this topic...

chigwell / luminaweaver

A new package that processes user-submitted text descriptions of images or videos containing watermarks and returns structured, watermark-free descriptions. It uses an LLM to reinterpret the content w

text-processing archival structured-output video-description content-creation watermark-removal image-description llm-interpretation content-cleaning user-submitted-text text-reinterpretation watermark-free-descriptions

Updated Dec 21, 2025
Python

xsukax / xsukax-ReadClean-PDF

Star

A privacy-focused, client-side web application that extracts clean, readable content from any webpage and converts it to PDF format. Built with pure HTML, CSS, and JavaScript—no backend required, no tracking, complete privacy.

Updated Oct 5, 2025
HTML

aaa-mvc / aca

Star

Copy transcripts from YouTube, podcasts, or Feishu, paste into Obsidian — timestamps vanish instantly. Covers 8 formats across English and Chinese: colon-based seconds, Chinese fen-miao markers, mid-paragraph stamps. Three-pass regex engine: line-start anchoring, global sweep, whitespace normalization. You do one thing: Ctrl+V.

Updated Jun 14, 2026
JavaScript

Improve this page

Add a description, image, and links to the content-cleaning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the content-cleaning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly