Skip to content
#

content-cleaning

Here are 3 public repositories matching this topic...

Language: All
Filter by language

A new package that processes user-submitted text descriptions of images or videos containing watermarks and returns structured, watermark-free descriptions. It uses an LLM to reinterpret the content w

  • Updated Dec 21, 2025
  • Python
xsukax-ReadClean-PDF

A privacy-focused, client-side web application that extracts clean, readable content from any webpage and converts it to PDF format. Built with pure HTML, CSS, and JavaScript—no backend required, no tracking, complete privacy.

  • Updated Oct 5, 2025
  • HTML

Copy transcripts from YouTube, podcasts, or Feishu, paste into Obsidian — timestamps vanish instantly. Covers 8 formats across English and Chinese: colon-based seconds, Chinese fen-miao markers, mid-paragraph stamps. Three-pass regex engine: line-start anchoring, global sweep, whitespace normalization. You do one thing: Ctrl+V.

  • Updated Jun 14, 2026
  • JavaScript

Improve this page

Add a description, image, and links to the content-cleaning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the content-cleaning topic, visit your repo's landing page and select "manage topics."

Learn more