AutoPublish

English · العربية · Español · Français · 日本語 · 한국어 · Tiếng Việt · 中文 (简体) · 中文（繁體） · Deutsch · Русский

AutoPublish

Script-first, browser-driven multi-platform short-video publishing.
_{Canonical operations manual for setup, runtime, queue mode, and platform automation workflows.}

Jump to	Link
First-time setup	Start Here
Run with local watcher	Running the CLI pipeline (`autopub.py`)
Run via HTTP queue	Running the Tornado service (`app.py`)
Deploy as service	Raspberry Pi / Linux Service Setup
Shipinhao live ops	`docs/SHIPINHAO_PUBLISH_WORKFLOW.md`
Support the project	Support

Automation toolkit for distributing short-form video content to multiple Chinese and international creator platforms. The project combines a Tornado-based service, Selenium automation bots, and a local file-watcher workflow so that dropping a video into a folder eventually results in uploads to XiaoHongShu, Douyin, Bilibili, WeChat Channels (ShiPinHao), Instagram, and optionally YouTube.

The repository is intentionally low-level: most configuration lives in Python files and shell scripts. This document is an operational manual that covers setup, runtime, and extension points.

⚙️ Operational philosophy: this project favors explicit scripts and direct browser automation over hidden abstraction layers. ✅ Canonical policy for this README: preserve technical detail, then improve readability and discoverability. 🌍 Localization status (verified in this workspace on February 28, 2026): i18n/ currently includes Arabic, German, Spanish, French, Japanese, Korean, Russian, Vietnamese, Simplified Chinese, and Traditional Chinese variants.

Quick Navigation

I want to...	Go here
Run my first publish	Quick Start Checklist
Compare runtime modes	Runtime Modes at a Glance
Configure credentials and paths	Configuration
Launch API mode and queue jobs	Running the Tornado service (`app.py`)
Validate with copy/paste commands	Examples
Set up on Raspberry Pi/Linux	Raspberry Pi / Linux Service Setup

Start Here

If you are new to this repo, use this sequence:

Read Prerequisites and Installation.
Configure secrets and absolute paths in Configuration.
Prepare browser debug sessions in Preparing Browser Sessions.
Choose one runtime mode from Usage: autopub.py (watcher) or app.py (API queue).
Validate with commands from Examples.

Overview

AutoPublish currently supports two production runtime modes:

CLI watcher mode (autopub.py) for folder-based ingestion and publishing.
API queue mode (app.py) for ZIP-based publishing via HTTP (/publish, /publish/queue).

It is designed for operators who prefer transparent, script-first workflows over abstract orchestration platforms.

Runtime Modes at a Glance

Mode	Entry point	Input	Best for	Output behavior
CLI watcher	`autopub.py`	Files dropped into `videos/`	Local operator workflows and cron/service loops	Processes detected videos and publishes immediately to selected platforms
API queue service	`app.py`	ZIP upload to `POST /publish`	Integrations with upstream systems and remote triggering	Accepts jobs, enqueues them, and executes publishing in worker order

Platform Coverage Snapshot

Platform	Publisher module	Login helper	Control port	CLI mode	API mode
XiaoHongShu	`pub_xhs.py`	`login_xiaohongshu.py`	`5003`	✅	✅
Douyin	`pub_douyin.py`	`login_douyin.py`	`5004`	✅	✅
Bilibili	`pub_bilibili.py`	N/A	`5005`	✅	✅
ShiPinHao (WeChat Channels)	`pub_shipinhao.py`	`login_shipinhao.py`	`5006`	Optional	✅
Instagram	`pub_instagram.py`	`login_instagram.py`	`5007`	Optional	✅
YouTube	`pub_y2b.py`	N/A	`9222`	Optional	✅

Quick Snapshot

What	Value	Color cue
Primary language	Python 3.10+
Main runtimes	CLI watcher (`autopub.py`) + Tornado queue service (`app.py`)
Automation engine	Selenium + remote-debug Chromium sessions
Input formats	Raw videos (`videos/`) and ZIP bundles (`/publish`)
Current repo workspace path	`/home/lachlan/ProjectsLFS/AutoPublish`
Ideal users	Creators/ops engineers managing multi-platform short video pipelines

Operational Safety Snapshot

Topic	Current state	Action
Hard-coded paths	Present in multiple modules/scripts	Update path constants per host before production runs
Browser login state	Required	Keep persistent remote-debug profiles per platform
Captcha handling	Optional integrations available	Configure 2Captcha/Turing credentials if needed
License declaration	No top-level `LICENSE` file detected	Confirm usage terms with maintainer before redistribution

Compatibility & Assumptions

Item	Current assumption in this repo
Python	3.10+
Runtime environment	Linux desktop/server with GUI display available to Chromium
Browser control mode	Remote debugging sessions with persisted profile directories
Primary API port	`8081` (`app.py --port`)
Processing backend	`upload_url` + `process_url` must be reachable and return valid ZIP output
Workspace used for this draft	`/home/lachlan/ProjectsLFS/AutoPublish`

Start Here
Overview
Runtime Modes at a Glance
Platform Coverage Snapshot
Quick Snapshot
Operational Safety Snapshot
Compatibility & Assumptions
System Overview
Features
Project Structure
Repository Layout
Prerequisites
Installation
Configuration
Configuration Verification Checklist
Preparing Browser Sessions
Usage
Examples
Metadata & ZIP Format
Data & Artifact Lifecycle
Platform-Specific Notes
Raspberry Pi / Linux Service Setup
Legacy macOS Scripts
Troubleshooting & Maintenance
FAQ
Extending the System
Quick Start Checklist
Development Notes
Roadmap
Contributing
Security & Ops Checklist
License
Acknowledgements
❤️ Support

System Overview

🎯 End-to-end flow from raw media to published posts:

flowchart LR
    A[Video in videos/] --> B[autopub.py watcher]
    B --> C[process_video.py]
    C --> D[ZIP + metadata in transcription_data/]
    D --> E[pub_*.py Selenium publishers]
    F[POST /publish ZIP] --> G[app.py queue worker]
    G --> E
    E --> H[Platforms: XHS, Douyin, Bilibili, ShiPinHao, Instagram, YouTube]

Workflow at a glance:

Raw footage intake: Place a video inside videos/. The watcher (either autopub.py or a scheduler/service) notices new files using videos_db.csv and processed.csv.
Asset generation: process_video.VideoProcessor uploads the file to a content-processing server (upload_url and process_url) which returns a ZIP package containing:
- the edited/encoded video (<stem>.mp4),
- a cover image,
- {stem}_metadata.json with localized titles, descriptions, tags, etc.
Publishing: Metadata drives the Selenium publishers in pub_*.py. Each publisher attaches to an already-running Chromium/Chrome instance using remote debugging ports and persistent user-data directories.
Web control plane (optional): app.py exposes /publish, accepts pre-built ZIP bundles, unpacks them, and queues publish jobs to the same publishers. It can also refresh browser sessions and trigger login helpers (login_*.py).
Support modules: load_env.py loads secrets from the repo-local .env first and uses ~/.bashrc only as fallback, utils.py provides helpers (window focus, QR handling, mail utility helpers), and solve_captcha_*.py integrates with Turing/2Captcha when captchas appear.

Features

✨ Designed for pragmatic, script-first automation:

Multi-platform publishing: XiaoHongShu, Douyin, Bilibili, ShiPinHao (WeChat Channels), Instagram, YouTube (optional).
Two operating modes: CLI watcher pipeline (autopub.py) and API queue service (app.py + /publish + /publish/queue).
Per-platform temporary disable switches via ignore_* files.
Remote-debugging browser-session reuse with persistent profiles.
Optional QR/captcha automation and email notification helpers.
No frontend build requirement for the included PWA (pwa/) uploader UI.
Linux/Raspberry Pi automation scripts for service setup (scripts/).

Feature Matrix

Capability	CLI (`autopub.py`)	API (`app.py`)
Input source	Local `videos/` watcher	Uploaded ZIP via `POST /publish`
Queueing	Internal file-based progression	Explicit in-memory job queue
Platform flags	CLI args (`--pub-`) + `ignore_`	Query args (`publish_`) + `ignore_`
Best fit	Single-host operator workflow	External systems and remote triggering

Project Structure

High-level source/runtime layout:

AutoPublish/
├── README.md
├── app.py
├── autopub.py
├── process_video.py
├── load_env.py
├── utils.py
├── pub_*.py                  # platform publishers
├── login_*.py                # platform login/session helpers
├── solve_captcha_*.py
├── smtp.py
├── smtp_test_simple.py
├── send_email_qreader.py
├── requirements.txt
├── requirements.autopub.txt
├── .env.example
├── setup_raspberrypi.md
├── scripts/
├── pwa/
├── figs/
├── .github/FUNDING.yml
├── i18n/                     # multilingual READMEs
├── videos/                   # runtime input artifacts
├── logs/, logs-autopub/      # runtime logs
├── temp/, temp_screenshot/   # runtime temp artifacts
├── videos_db.csv
└── processed.csv

Note: transcription_data/ is used at runtime by processing/publishing flow and may appear after execution.

Repository Layout

🗂️ Key modules and what they do:

Path	Purpose
`app.py`	Tornado service exposing `/publish` and `/publish/queue`, with internal publish queue and worker thread.
`autopub.py`	CLI watcher: scans `videos/`, processes new files, and invokes publishers in parallel.
`process_video.py`	Uploads videos to processing backend and stores returned ZIP bundles.
`pub_xhs.py`, `pub_douyin.py`, `pub_bilibili.py`, `pub_shipinhao.py`, `pub_instagram.py`, `pub_y2b.py`	Selenium automation modules per platform.
`login_xiaohongshu.py`, `login_douyin.py`, `login_shipinhao.py`, `login_instagram.py`	Session checks and QR login flows.
`utils.py`	Shared automation helpers (window focus, QR/mail helper utilities, diagnostics helpers).
`load_env.py`	Loads env vars from shell profile (`~/.bashrc`) and masks sensitive logs.
`smtp.py`, `smtp_test_simple.py`, `send_email_qreader.py`	SMTP/SendGrid helper and test scripts.
`solve_captcha_2captcha.py`, `solve_captcha_turing.py`	Captcha solver integrations.
`scripts/`	Service setup and operations scripts (Raspberry Pi/Linux + legacy automation).
`pwa/`	Static PWA for ZIP preview and publish submission.
`setup_raspberrypi.md`	Step-by-step Raspberry Pi provisioning guide.
`.env.example`	Environment variable template (credentials, paths, captcha keys).
`.github/FUNDING.yml`	Sponsor/funding configuration.
`logs/`, `logs-autopub/`, `temp/`, `temp_screenshot/`, `videos/`	Runtime artifacts and logs (many are gitignored).

Prerequisites

🧰 Install these before first run.

Operating system and tools

Linux desktop/server with an X session (DISPLAY=:1 is common in provided scripts).
Chromium/Chrome and matching ChromeDriver.
GUI/media helpers: xdotool, ffmpeg, zip, unzip.
Python 3.10+ (venv or Conda).

Python dependencies

Minimal runtime set:

pip install selenium tornado requests requests-toolbelt sendgrid qreader opencv-python webdriver-manager

Repository parity:

python -m pip install -r requirements.txt

For lightweight service installs (used by setup scripts by default):

python -m pip install -r requirements.autopub.txt

requirements.autopub.txt contains:

selenium, webdriver-manager, tornado, requests, requests-toolbelt, sendgrid, qreader, opencv-python, numpy, pillow, twocaptcha.

Optional: create a sudo user

sudo useradd -m -s /bin/bash -G sudo <USERNAME> && echo "<USERNAME>:<PASSWORD>" | sudo chpasswd

Installation

🚀 Setup from a clean machine:

Clone the repository:

git clone https://github.com/lachlanchen/AutoPublish.git
cd AutoPublish

Create and activate an environment (example with venv):

python3 -m venv .venv
source .venv/bin/activate
python -m pip install -U pip
python -m pip install -r requirements.txt

Prepare environment variables:

cp .env.example .env
# fill values in .env (do not commit)

Load variables for scripts that need publishing credentials:

python load_env.py

By default, load_env.py reads the repo-local .env first. It only falls back to ~/.bashrc if required keys are still missing.

Configuration

🔐 Set credentials, then verify host-specific paths.

Environment variables

The project expects credentials and optional browser/runtime paths from environment variables. Start from .env.example:

Variable	Description
`FROM_EMAIL`, `TO_EMAIL`, `APP_PASSWORD`	SMTP credentials for QR/login notifications.
`SENDGRID_API_KEY`	SendGrid key for email flows that use SendGrid APIs.
`APIKEY_2CAPTCHA`	2Captcha API key.
`TULING_USERNAME`, `TULING_PASSWORD`, `TULING_ID`	Turing captcha credentials.
`DOUYIN_LOGIN_PASSWORD`	Douyin second verification helper.
`INSTAGRAM_`, `CHROME_`, `CHROMEDRIVER_PATH`	Instagram/browser driver overrides.
`AUTOPUBLISH_BROWSER_BIN`, `AUTOPUBLISH_CHROMEDRIVER`, `AUTOPUBLISH_DISPLAY`	Preferred global browser/driver/display overrides in `app.py`.

Path constants (important)

📌 Most common startup issue: unresolved hard-coded absolute paths.

Several modules still contain hard-coded paths. Update these for your host:

File	Constant(s)	Meaning
`app.py`	`logs_folder_root`, `autopublish_folder_root`, `videos_db_path`, `processed_path`, `transcription_root`, `upload_url`, `process_url`.	API service roots and backend endpoints.
`autopub.py`	`logs_folder_path`, `autopublish_folder_path`, `videos_db_path`, `processed_path`, `transcription_path`, `upload_url`, `process_url`, `chromedriver_path`.	CLI watcher roots and backend endpoints.
`scripts/run_autopub.sh`, `scripts/setup_autopub.sh`	Absolute paths to Python/Conda/repo/log locations.	Legacy/macOS-oriented wrappers.
`utils.py`	FFmpeg path assumptions in cover processing helpers.	Media tooling path compatibility.

Important repository note:

Current repository path in this workspace is /home/lachlan/ProjectsLFS/AutoPublish.
Some code and scripts still reference /home/lachlan/Projects/auto-publish or /Users/lachlan/....
Preserve and adjust these paths locally before production usage.

Platform toggles via `ignore_*`

🧩 Fast safety switch: touching an ignore_* file disables that publisher without code edits.

Publishing flags are also gated by ignore files. Create an empty file to disable a platform:

touch ignore_xhs ignore_douyin ignore_bilibili ignore_shipinhao ignore_instagram ignore_y2b

Remove the corresponding file to re-enable.

Configuration Verification Checklist

Run this quick validation after setting .env and path constants:

python -c "import os;print('AUTOPUBLISH_BROWSER_BIN=', os.getenv('AUTOPUBLISH_BROWSER_BIN'));print('AUTOPUBLISH_CHROMEDRIVER=', os.getenv('AUTOPUBLISH_CHROMEDRIVER'));print('DISPLAY=', os.getenv('DISPLAY') or os.getenv('AUTOPUBLISH_DISPLAY'))"
python -c "from load_env import load_env_from_bashrc; load_env_from_bashrc(); print('Environment load OK')"
python -c "import os; p=os.getenv('AUTOPUBLISH_CHROMEDRIVER') or os.getenv('CHROMEDRIVER_PATH') or '/usr/bin/chromedriver'; print(p, 'exists=', os.path.exists(p))"

If any value is missing, update .env, ~/.bashrc, or script-level constants before running publishers.

Preparing Browser Sessions

🌐 Session persistence is mandatory for reliable Selenium publishing.

Create dedicated profile folders:

mkdir -p ~/chromium_dev_session_{5003,5004,5005,5006,5007,9222}
mkdir -p ~/chromium_dev_session_logs

Launch browser sessions with remote debugging (example for XiaoHongShu):

DISPLAY=:1 chromium-browser \
  --remote-debugging-port=5003 \
  --user-data-dir="$HOME/chromium_dev_session_5003" \
  https://creator.xiaohongshu.com/creator/post \
  > "$HOME/chromium_dev_session_logs/chromium_xhs.log" 2>&1 &

Log in manually once for each platform/profile.
Verify Selenium can attach:

from selenium import webdriver
opts = webdriver.ChromeOptions()
opts.add_experimental_option("debuggerAddress", "127.0.0.1:5003")
driver = webdriver.Chrome(options=opts)
print(driver.title)
driver.quit()

Security note:

app.py currently contains a hard-coded sudo password placeholder (password = "1") used by browser restart logic. Replace this before real deployment.

Usage

▶️ Two runtime modes are available: CLI watcher and API queue service.

Running the CLI pipeline (`autopub.py`)

Put source videos in the watch directory (videos/ or your configured autopublish_folder_path).
Run:

python autopub.py --use-cache --pub-xhs --pub-douyin --pub-bilibili

Flags:

Flag	Meaning
`--pub-xhs`, `--pub-douyin`, `--pub-bilibili`	Restrict publishing to selected platforms. If none are passed, all three default to enabled.
`--test`	Test mode passed into publishers (behavior varies by platform module).
`--use-cache`	Reuse existing `transcription_data/<video>/<video>.zip` if available.

CLI flow per video:

Upload/process through process_video.py.
Extract ZIP to transcription_data/<video>/.
Launch selected publishers via ThreadPoolExecutor.
Append tracking state into videos_db.csv and processed.csv.

Running the Tornado service (`app.py`)

🛰️ API mode is useful for external systems that produce ZIP bundles.

Start server:

python app.py --refresh-time 1800 --port 8081

API endpoint summary:

Endpoint	Method	Purpose
`/publish`	`POST`	Upload ZIP bytes and enqueue a publish job
`/publish/queue`	`GET`	Inspect queue, job history, and publish state

`POST /publish`

📤 Queue a publish job by uploading ZIP bytes directly.

Header: Content-Type: application/octet-stream
Required query/form arg: filename (ZIP filename)
Optional booleans: publish_xhs, publish_douyin, publish_bilibili, publish_shipinhao, publish_instagram, publish_y2b, test
Body: raw ZIP bytes

Example:

curl -X POST "http://localhost:8081/publish?filename=demo.zip&publish_xhs=true&publish_instagram=true&publish_y2b=true" \
  --data-binary @demo.zip \
  -H "Content-Type: application/octet-stream"

Current behavior in code:

Request is accepted and queued.
Immediate response returns JSON including status: queued, job_id, and queue_size.
Worker thread serially processes queued jobs.

`GET /publish/queue`

📊 Observe queue health and in-flight jobs.

Returns queue status/history JSON:

curl "http://localhost:8081/publish/queue"

Response fields include:

status, jobs, queue_size, is_publishing.

Browser refresh thread

♻️ Periodic browser refresh reduces stale-session failures over long uptime windows.

app.py runs a background refresh thread using --refresh-time interval and hooks in login checks. Refresh sleep includes randomized delay behavior.

PWA frontend (`pwa/`)

🖥️ Lightweight static UI for manual ZIP uploads and queue inspection.

Run static UI locally:

cd pwa
python -m http.server 5173

Open http://localhost:5173 and set backend base URL (for example http://lazyingart:8081).

PWA capabilities:

Drag/drop ZIP preview.
Publish-target toggles + test mode.
Submits to /publish and polls /publish/queue.

Command Palette

🧷 Most-used commands in one place.

Task	Command
Install full dependencies	`python -m pip install -r requirements.txt`
Install lightweight runtime dependencies	`python -m pip install -r requirements.autopub.txt`
Load shell-based env vars	`source ~/.bashrc && python load_env.py`
Start API queue server	`python app.py --refresh-time 1800 --port 8081`
Start CLI watcher pipeline	`python autopub.py --use-cache --pub-xhs --pub-douyin --pub-bilibili`
Submit ZIP to queue	`curl -X POST "http://localhost:8081/publish?filename=demo.zip" --data-binary @demo.zip -H "Content-Type: application/octet-stream"`
Inspect queue status	`curl -s "http://localhost:8081/publish/queue"`
Serve local PWA	`cd pwa && python -m http.server 5173`

Examples

🧪 Copy/paste smoke-test commands:

Example 0: Load environment and start API server

source ~/.bashrc
python load_env.py
python app.py --refresh-time 1800 --port 8081

Example A: CLI publish run

python autopub.py --pub-xhs --pub-douyin --use-cache

Example B: API publish run (single ZIP)

curl -X POST "http://localhost:8081/publish?filename=my_bundle.zip&publish_bilibili=true&test=true" \
  --data-binary @my_bundle.zip \
  -H "Content-Type: application/octet-stream"

Example C: Check queue status

curl -s "http://localhost:8081/publish/queue"

Example D: SMTP helper smoke test

python smtp.py
python smtp_test_simple.py

Metadata & ZIP Format

📦 ZIP contract matters: keep filenames and metadata keys aligned with publisher expectations.

Expected ZIP contents (minimum):

<stem>_metadata.json
<video_filename>.mp4
<cover_filename>.jpg

metadata drives CN publishers; optional metadata["english_version"] feeds YouTube publisher.

Fields commonly used by modules:

title, brief_description, middle_description, long_description
tags (list of hashtags)
video_filename, cover_filename
platform-specific fields as implemented in individual pub_*.py files

If you generate ZIPs externally, keep keys and filenames aligned with module expectations.

Data & Artifact Lifecycle

The pipeline creates local artifacts that operators should retain, rotate, or clean up deliberately:

Artifact	Location	Produced by	Why it matters
Input videos	`videos/`	Manual drop or upstream sync	Source media for CLI watcher mode
Processing ZIP output	`transcription_data/<stem>/<stem>.zip`	`process_video.py`	Reusable payload for `--use-cache`
Extracted publish assets	`transcription_data/<stem>/...`	ZIP extraction in `autopub.py` / `app.py`	Publisher-ready files and metadata
Publish logs	`logs/`, `logs-autopub/`	CLI/API runtime	Failure triage and audit trail
Browser logs	`~/chromium_dev_session_logs/*.log` (or chrome prefix)	Browser startup scripts	Diagnose session/port/startup issues
Tracking CSVs	`videos_db.csv`, `processed.csv`	CLI watcher	Prevent duplicate processing

Housekeeping recommendation:

Add a periodic cleanup/archival job for old transcription_data/, temp/, and old logs to prevent disk-pressure failures.

Platform-Specific Notes

🧭 Port map + module ownership for each publisher.

Platform	Port	Module(s)	Notes
XiaoHongShu	5003	`pub_xhs.py`, `login_xiaohongshu.py`	QR re-login flow; title sanitization and hashtag usage from metadata.
Douyin	5004	`pub_douyin.py`, `login_douyin.py`	Upload completion checks and retry paths are platform-fragile; monitor logs closely.
Bilibili	5005	`pub_bilibili.py`	Captcha hooks available via `solve_captcha_2captcha.py` and `solve_captcha_turing.py`.
ShiPinHao (WeChat Channels)	5006	`pub_shipinhao.py`, `login_shipinhao.py`	Fast QR approval is important for session refresh reliability.
Instagram	5007	`pub_instagram.py`, `login_instagram.py`	Controlled in API mode with `publish_instagram=true`; env vars available in `.env.example`.
YouTube	9222	`pub_y2b.py`	Uses `english_version` metadata block; disable with `ignore_y2b`.

Raspberry Pi / Linux Service Setup

🐧 Recommended for always-on hosts.

For a full host bootstrap, follow setup_raspberrypi.md.

Quick pipeline setup:

export AUTOPUB_USER=<USERNAME>
export AUTOPUB_REPO=/home/<USERNAME>/Projects/autopub
sudo -E ./scripts/setup_autopub_pipeline.sh

This orchestrates:

scripts/setup_envs.sh
scripts/setup_virtual_desktop_service.sh
scripts/download_and_setup_driver.sh
scripts/setup_autopub_service.sh

Run service manually in tmux:

./scripts/start_autopub_tmux.sh

Validate services/ports:

systemctl status autopub.service autopub-vnc.service
sudo ss -ltnp | grep 590

Compatibility note:

Some older docs/scripts still refer to virtual-desktop.service; current setup scripts in this repository install autopub-vnc.service.

Legacy macOS Scripts

🍎 Legacy wrappers remain for compatibility with older local setups.

The repository still includes legacy macOS-oriented wrappers:

scripts/run_autopub.sh
scripts/setup_autopub.sh

These contain absolute /Users/lachlan/... paths and Conda assumptions. Keep them if you rely on that workflow, but update paths/venv/tooling for your host.

Troubleshooting & Maintenance

🛠️ If something fails, start here first.

Path drift across machines: if errors mention missing files under /Users/lachlan/... or /home/lachlan/Projects/auto-publish, align constants to your host path (/home/lachlan/ProjectsLFS/AutoPublish in this workspace).
Secrets hygiene: run ~/.local/bin/detect-secrets scan before push. Rotate any leaked credentials.
Processing backend errors: if process_video.py prints “Failed to get the uploaded file path,” verify upload response JSON contains file_path and processing endpoint returns ZIP bytes.
ChromeDriver mismatch: if DevTools connection errors appear, align Chrome/Chromium and driver versions (or switch to webdriver-manager).
Browser focus issues: bring_to_front relies on window title matching (Chromium/Chrome naming differences can break this).
Captcha interrupts: configure 2Captcha/Turing credentials and integrate solver outputs where needed.
Stale lock files: if scheduled runs never start, verify process state and remove stale autopub.lock (legacy script flow).
Logs to inspect: logs/, logs-autopub/, ~/chromium_dev_session_logs/*.log, plus service journal logs.

FAQ

Q: Can I run API mode and CLI watcher mode at the same time?
A: It is possible but not recommended unless you isolate inputs and browser sessions carefully. Both modes can compete for the same publishers, files, and ports.

Q: Why does /publish return queued but nothing appears published yet?
A: app.py enqueues jobs first, then a background worker processes them serially. Check /publish/queue, is_publishing, and service logs.

Q: Do I need load_env.py if I already use .env?
A: start_autopub_tmux.sh sources .env if present, while some direct runs rely on shell environment loading. Keeping both .env and shell exports consistent avoids surprises.

Q: What is the minimum ZIP contract for API uploads?
A: A valid ZIP with {stem}_metadata.json, plus video and cover filenames that match metadata keys (video_filename, cover_filename).

Q: Is headless mode supported?
A: Some modules expose headless-related variables, but this repository’s primary and documented operating mode is GUI-backed browser sessions with persistent profiles.

Extending the System

🧱 Extension points for new platforms and safer operations.

Adding a new platform: copy a pub_*.py module, update selectors/flows, add login_*.py if QR re-auth is needed, then wire flags and queue handling in app.py and CLI wiring in autopub.py.
Config abstraction: migrate scattered constants to structured config (config.yaml/.env + typed model) for multi-host operation.
Credential storage hardening: replace hard-coded or shell-exposed sensitive flows with secure secret management (sudo -A, keychain, vault/secret manager).
Containerization: package Chromium/ChromeDriver + Python runtime + virtual display into one deployable unit for cloud/server use.

Quick Start Checklist

✅ Minimal path to first successful publish.

Clone this repository and install dependencies (pip install -r requirements.txt or lightweight requirements.autopub.txt).
Update hard-coded path constants in app.py, autopub.py, and any script you will run.
Export required credentials in your shell profile or .env; run python load_env.py to validate loading.
Create remote-debug browser profile folders and launch each required platform session once.
Manually sign in on each target platform in its profile.
Start either API mode (python app.py --port 8081) or CLI mode (python autopub.py --use-cache ...).
Submit one sample ZIP (API mode) or one sample video file (CLI mode) and inspect logs/.
Run secrets scanning before every push.

Development Notes

🧬 Current development baseline (manual formatting + smoke testing).

Python style follows existing 4-space indentation and manual formatting.
No formal automated test suite currently; rely on smoke tests:
- process one sample video through autopub.py;
- post one ZIP to /publish and monitor /publish/queue;
- validate each target platform manually.
Include a small if __name__ == "__main__": entrypoint when adding new scripts for quick dry-runs.
Keep platform changes isolated where possible (pub_*, login_*, ignore_* toggles).
Runtime artifacts (videos/*, logs*/*, transcription_data/*, ignore_*) are expected to be local and are mostly ignored by git.

Roadmap

🗺️ Priority improvements reflected by current code constraints.

Planned/desired improvements (based on current code structure and existing notes):

Replace scattered hard-coded paths with central config (.env/YAML + typed models).
Remove hard-coded sudo password patterns and move process control to safer mechanisms.
Improve publish reliability with stronger retries and better UI-state detection per platform.
Expand platform support (for example Kuaishou or other creators platforms).
Package runtime into reproducible deployment units (container + virtual display profile).
Add automated integration checks for ZIP contract and queue execution.

Contributing

🤝 Keep PRs focused, reproducible, and explicit about runtime assumptions.

Contributions are welcome.

Fork and create a focused branch.
Keep commits small and imperative (example style in history: “Wait for YouTube checks before publishing”).
Include manual verification notes in PRs:
- environment assumptions,
- browser/session restarts,
- relevant logs/screenshots for UI flow changes.
Never commit real secrets (.env is ignored; use .env.example for shape only).

If introducing new publisher modules, wire all of the following:

pub_<platform>.py
optional login_<platform>.py
API flags and queue handling in app.py
CLI wiring in autopub.py (if needed)
ignore_<platform> toggle handling
README updates

Security & Ops Checklist

Before any production-like run:

Confirm .env exists locally and is not tracked by git.
Rotate/remove any credentials that may have been committed historically.
Replace placeholder sensitive values in code paths (for example the sudo password placeholder in app.py).
Verify platform ignore_* switches are intentional before batch runs.
Ensure browser profiles are isolated per platform and least-privilege accounts are used.
Confirm logs do not expose secrets before sharing issue reports.
Run detect-secrets (or equivalent) before push.

❤️ Support

Donate	PayPal	Stripe

License

No LICENSE file is currently present in this repository snapshot.

Assumption for this draft:

Treat usage and redistribution as undefined until the maintainer adds an explicit license file.

Recommended next action:

Add a top-level LICENSE (for example MIT/Apache-2.0/GPL-3.0) and update this section accordingly.

📝 Until a license file is added, treat commercial/internal redistribution assumptions as unresolved and confirm directly with the maintainer.

Acknowledgements

Maintainer and sponsor profile: @lachlanchen
Funding configuration source: .github/FUNDING.yml
Ecosystem services referenced in this repo: Selenium, Tornado, SendGrid, 2Captcha, Turing captcha APIs.

Name		Name	Last commit message	Last commit date
Latest commit History 202 Commits
.github		.github
.ipynb_checkpoints		.ipynb_checkpoints
archived		archived
bilibili_test		bilibili_test
docs		docs
figs		figs
geetest_screenshot		geetest_screenshot
i18n		i18n
logs-autopub		logs-autopub
logs		logs
pwa		pwa
scripts		scripts
temp		temp
temp_screenshot		temp_screenshot
videos		videos
.DS_Store		.DS_Store
._.DS_Store		._.DS_Store
._.git		._.git
.___pycache__		.___pycache__
._logs		._logs
._logs-autopub		._logs-autopub
._temp		._temp
._videos		._videos
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
app.py		app.py
automator.txt		automator.txt
autopub.py		autopub.py
load_env.py		load_env.py
login_douyin.py		login_douyin.py
login_instagram.py		login_instagram.py
login_shipinhao.py		login_shipinhao.py
login_xiaohongshu.py		login_xiaohongshu.py
process_video.py		process_video.py
processed.csv		processed.csv
pub_bilibili.py		pub_bilibili.py
pub_douyin.py		pub_douyin.py
pub_instagram.py		pub_instagram.py
pub_shipinhao.py		pub_shipinhao.py
pub_xhs.py		pub_xhs.py
pub_y2b.py		pub_y2b.py
requirements.autopub.txt		requirements.autopub.txt
requirements.txt		requirements.txt
send_email_qreader.py		send_email_qreader.py
setup_raspberrypi.md		setup_raspberrypi.md
smtp.py		smtp.py
smtp_test_simple.py		smtp_test_simple.py
solve_captcha_2captcha.py		solve_captcha_2captcha.py
solve_captcha_turing.py		solve_captcha_turing.py
test_json.py		test_json.py
test_xdo.py		test_xdo.py
utils.py		utils.py
videos_db.csv		videos_db.csv

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

AutoPublish

Quick Navigation

Start Here

Overview

Runtime Modes at a Glance

Platform Coverage Snapshot

Quick Snapshot

Operational Safety Snapshot

Compatibility & Assumptions

Table of Contents

System Overview

Features

Feature Matrix

Project Structure

Repository Layout

Prerequisites

Operating system and tools

Python dependencies

Optional: create a sudo user

Installation

Configuration

Environment variables

Path constants (important)

Platform toggles via ignore_*

Configuration Verification Checklist

Preparing Browser Sessions

Usage

Running the CLI pipeline (autopub.py)

Running the Tornado service (app.py)

POST /publish

GET /publish/queue

Browser refresh thread

PWA frontend (pwa/)

Command Palette

Examples

Example 0: Load environment and start API server

Example A: CLI publish run

Example B: API publish run (single ZIP)

Example C: Check queue status

Example D: SMTP helper smoke test

Metadata & ZIP Format

Data & Artifact Lifecycle

Platform-Specific Notes

Raspberry Pi / Linux Service Setup

Legacy macOS Scripts

Troubleshooting & Maintenance

FAQ

Extending the System

Quick Start Checklist

Development Notes

Roadmap

Contributing

Security & Ops Checklist

❤️ Support

License

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Platform toggles via `ignore_*`

Running the CLI pipeline (`autopub.py`)

Running the Tornado service (`app.py`)

`POST /publish`

`GET /publish/queue`

PWA frontend (`pwa/`)

Packages