Chinese scraping utilities -- date parsing, city extraction, SHA256 ID, UA pool, rate limiter, DeepSeek client
-
Updated
Jun 12, 2026 - Python
Chinese scraping utilities -- date parsing, city extraction, SHA256 ID, UA pool, rate limiter, DeepSeek client
🔍 基于BERT的MBA论文AIGC风险检测系统 | BERT-based MBA Thesis AIGC Detection System with 96.7% F1 Score. Complete S1-S8 pipeline for academic integrity.
Document processing toolkit for Word, PowerPoint, Excel, and CSV files. Bytes in, bytes out.
DocKit Raycast extension — fix formatting in Word, PowerPoint, and Excel files
中英双语AI欺诈文本检测引擎,支持MCP协议,为AI Agent提供内容安全检测服务。
A high-quality, anonymous text corpus focusing on undergraduate social dynamics and physiological hygiene awareness in Shandong universities. Optimized for LLM pre-training and SFT fine-tuning.
Copy transcripts from YouTube, podcasts, or Feishu, paste into Obsidian — timestamps vanish instantly. Covers 8 formats across English and Chinese: colon-based seconds, Chinese fen-miao markers, mid-paragraph stamps. Three-pass regex engine: line-start anchoring, global sweep, whitespace normalization. You do one thing: Ctrl+V.
KIT-2920 is a reading-layer and transliteration infrastructure for the Chinese Buddhist Canon (Taishō Tripiṭaka).
Add a description, image, and links to the chinese-text topic page so that developers can more easily learn about it.
To associate your repository with the chinese-text topic, visit your repo's landing page and select "manage topics."