ClippingBot

Turn Feishu bot messages into Markdown notes in your Obsidian vault.

中文文档

Features

Receive text messages that contain URLs
Extract the first URL from the message body
Fetch page content through Crawl4AI
Save the result as a Markdown note with frontmatter
Write notes directly into a local Obsidian-compatible directory
Run in Feishu long-connection mode
Run in local HTTP server mode for debugging or custom automation
Run one-off clipping jobs from the CLI
Support Crawl4AI fallback strategies for problematic sites
Deploy with Docker Compose

How It Works

A message is sent to a Feishu bot, or posted to the local ingest endpoint.
ClippingBot extracts the first URL from the incoming text.
ClippingBot requests content from Crawl4AI.
ClippingBot renders a Markdown note with metadata.
The note is written to the configured vault directory.

Quick Start

Local

git clone https://github.com/parap1uie-s/ClippingBot.git
cd ClippingBot
cp .env.example .env
python3 -m clippingbot.main

CLI

python3 -m clippingbot.cli "Example Domain https://example.com"

HTTP Ingest

curl -X POST 'http://127.0.0.1:8787/ingest' \
  -H 'Content-Type: application/json' \
  -d '{"text":"Example Domain https://example.com"}'

Configuration

Required:

CLIPPINGBOT_OBSIDIAN_VAULT
CLIPPINGBOT_CRAWL4AI_BASE_URL
One of:
- CLIPPINGBOT_CRAWL4AI_EMAIL
- CLIPPINGBOT_CRAWL4AI_BEARER_TOKEN

Common optional settings:

CLIPPINGBOT_OBSIDIAN_INBOX
CLIPPINGBOT_CRAWL4AI_FILTER
CLIPPINGBOT_CRAWL4AI_TIMEOUT_SECONDS
CLIPPINGBOT_CRAWL4AI_MODE
CLIPPINGBOT_CRAWL4AI_CRAWL_FALLBACK_DOMAINS
CLIPPINGBOT_NOTE_TAGS
CLIPPINGBOT_NOTE_OVERWRITE_EXISTING
CLIPPINGBOT_FILENAME_MAX_LENGTH
CLIPPINGBOT_FEISHU_APP_ID
CLIPPINGBOT_FEISHU_APP_SECRET
CLIPPINGBOT_FEISHU_DELIVERY_MODE
CLIPPINGBOT_FEISHU_REPLY_ENABLED
CLIPPINGBOT_FEISHU_REPLY_RECEIVE_ID_TYPE

See .env.example for a complete example.

Runtime Modes

ClippingBot supports two runtime modes:

longconn Feishu long-connection mode
webhook Local HTTP server mode

Unified entrypoint:

python3 -m clippingbot.main

Crawl4AI Modes

ClippingBot supports three Crawl4AI fetch modes:

md Always use /md
crawl Always use /crawl
auto Use /md by default and fall back to /crawl when needed

This is useful for sites where /md may return an interstitial or verification page while /crawl can still extract the article body.

Output

Each clip is written as a Markdown file that includes:

frontmatter
source URL
source channel
clip timestamp
original share text
captured Markdown content

Docker Compose

cp .env.example .env
docker compose up -d --build

If ClippingBot and Crawl4AI both run in containers, prefer using a shared Docker network and a service-style base URL such as:

CLIPPINGBOT_CRAWL4AI_BASE_URL=http://crawl4ai:11235

Feishu Event Support

ClippingBot accepts text messages that contain URLs, for example:

Example Domain https://example.com

For webhook-style delivery, the repository also supports standard Feishu im.message.receive_v1 payloads.

Project Structure

clippingbot/main.py: runtime entrypoint
clippingbot/feishu_longconn.py: Feishu long-connection listener
clippingbot/server.py: HTTP server and ingest endpoint
clippingbot/crawl4ai_client.py: Crawl4AI client and fallback logic
clippingbot/note_writer.py: Markdown note rendering and persistence
.env.example: example configuration
docker-compose.yml: Docker deployment entry

Limitations

Only the first URL in a message is processed
Duplicate handling is filename-based using a URL hash
Feishu bot behavior still depends on upstream app permissions and event subscription configuration

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
clippingbot		clippingbot
.env.example		.env.example
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ClippingBot

Features

How It Works

Quick Start

Local

CLI

HTTP Ingest

Configuration

Runtime Modes

Crawl4AI Modes

Output

Docker Compose

Feishu Event Support

Project Structure

Limitations

Star History

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ClippingBot

Features

How It Works

Quick Start

Local

CLI

HTTP Ingest

Configuration

Runtime Modes

Crawl4AI Modes

Output

Docker Compose

Feishu Event Support

Project Structure

Limitations

Star History

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages