A2Pod

Convert articles into audio you can listen to anywhere. Generates natural-sounding speech locally on Apple Silicon, publishes to a podcast feed, and supports either a local LAN server or AWS S3 as the podcast provider.

URL / file / text  →  Extract  →  Clean  →  Summarize  →  Chunk  →  TTS  →  Intro  →  M4A  →  Podcast Feed
                                (regex+LLM)  (LexRank+LLM)       (Kokoro)  (jingle)       (+ VTT transcript)

Disclaimer — This tool is designed for personal use with content you already have access to. Respect copyright: do not redistribute generated audio unless you own the source content or have permission to do so.

Features

Single provider — choose Local (LAN server) or S3 (public access) during setup; all operations target one provider
Any URL — articles, blog posts, newsletters, X/Twitter posts and long-form articles
Local text — convert .txt files or paste text directly (Telegram bot)
Episode intros — programmatic chime jingle + spoken title before content
Two-pass text cleaning — regex pass strips URLs, markdown, code, CTAs; LLM pass catches subtle patterns (parallel for cloud providers)
TTS pronunciation normalization — abbreviations, numbers, currencies, symbols, and acronyms converted to spoken words
Extractive summarization — LexRank selects key sentences from the full article before LLM generates a 2-3 sentence episode description
WebVTT transcripts — per-chunk timestamped transcript generated alongside every audio file
Apple Silicon TTS — Kokoro-82M via MLX Audio, 7 voices, parallel workers
Podcast feed — RSS 2.0 with iTunes and Podcast Index extensions; subscribe once in any podcast app
Telegram bot — send URLs, paste text, or upload .txt files; live progress updates, inline voice/model switching
Deduplication — skips URLs already in the podcast feed (override with --force)
Episode management — delete single episodes or bulk-clear the entire feed

Requirements

macOS with Apple Silicon (M1/M2/M3/M4)
Python 3.10–3.13 (see Python version below)
~500 MB disk for model + dependencies
X API bearer token (optional, for X/Twitter posts)
AWS account (optional, for S3 provider)
LLM provider (optional, for summaries and text cleaning): Ollama (local), OpenAI API, Anthropic API, or Google Gemini API

Python Version

The default macOS system Python (3.9) is too old — the X | Y type syntax and several dependencies require 3.10+. Python 3.14 is too new — spacy and blis do not yet support it.

Install a compatible version via Homebrew:

brew install python@3.13

The installer uses pip3 install which targets whichever python3 is first on your PATH. If that resolves to the system Python 3.9, packages will install but fail at runtime. Two ways to handle this:

Option A — Virtual environment (recommended):

python3.13 -m venv .venv
source .venv/bin/activate
./install.sh

Add the venv to your shell profile so a2pod always uses it:

echo 'export PATH="/path/to/a2pod/.venv/bin:$PATH"' >> ~/.zshrc

Option B — Homebrew Python on PATH:

Ensure /opt/homebrew/bin comes before /usr/bin in your PATH so that python3 resolves to the Homebrew version.

phonemizer Conflict

The TTS pipeline depends on phonemizer-fork (which provides EspeakWrapper.set_data_path). If both phonemizer and phonemizer-fork are installed, the original phonemizer takes precedence and TTS model loading fails with:

AttributeError: type object 'EspeakWrapper' has no attribute 'set_data_path'

Fix by removing the original and reinstalling the fork:

pip uninstall phonemizer -y
pip install phonemizer-fork --force-reinstall

Quick Start

git clone https://github.com/dyankov91/a2pod.git
cd a2pod
./install.sh

The installer handles dependencies, model download, PATH setup, podcast artwork, provider choice (Local or S3), and optional Telegram bot configuration.

If you already have a ~/.config/a2pod/config from another machine, copy it before running the installer — it will detect existing values and skip the interactive prompts.

Then:

a2pod https://example.com/some-article

Your podcast feed is immediately available at the URL shown during setup. Subscribe from any podcast app.

Usage

# Basic — converts and publishes to the podcast feed
a2pod https://example.com/article

# Custom voice
a2pod https://example.com/article --voice am_michael

# Faster speech
a2pod https://example.com/article --speed 1.2

# From a local text file
a2pod --file article.txt --title "My Article"

# Custom output path
a2pod https://example.com/article --output ~/Desktop/article.m4a

# Skip summary generation
a2pod https://example.com/article --no-summary

# Skip episode intro (jingle + spoken title)
a2pod https://example.com/article --no-intro

# Reprocess a URL already in the feed
a2pod https://example.com/article --force

# Use more parallel TTS workers
a2pod https://example.com/article --workers 4

# Override LLM model
a2pod https://example.com/article --model qwen3.5:9b

CLI Reference

Flag	Short	Description
`<url>`		Article URL to convert
`--file`	`-f`	Local text file instead of URL
`--title`	`-t`	Override article title
`--voice`	`-v`	TTS voice (default: `af_heart`)
`--speed`	`-s`	Speech speed (default: `1.0`)
`--output`	`-o`	Custom output path
`--model`	`-m`	LLM model override
`--workers`	`-w`	Parallel TTS workers (default: `2`)
`--no-summary`		Skip episode summary generation
`--no-intro`		Skip episode intro (jingle + spoken title)
`--force`		Reprocess even if already in the podcast feed
`--delete`		Delete episode matching title or URL
`--delete-all`		Delete all episodes from the feed

X/Twitter

Works with posts and long-form articles:

a2pod https://x.com/someuser/status/1234567890

Requires an X API bearer token. Add it to ~/.config/a2pod/config:

[x]
bearer_token = YOUR_TOKEN_HERE

The installer can also set this up for you during ./install.sh.

Podcast Setup

Every article you convert is automatically added to your podcast feed. During ./install.sh you choose a provider:

Local (default)

Runs a local HTTP server on your LAN. No cloud accounts needed.

Open a podcast app on your phone (Overcast, Pocket Casts, Castro, etc.)
Add by URL / Subscribe to URL:
```
http://<lan-ip>:8008/feed.xml
```
Every new article you convert will appear as an episode

Note: Your phone/tablet must be on the same Wi-Fi network as the Mac running the server.

Apple Podcasts limitation: Apple Podcasts requires a public HTTPS URL and will not work with LAN addresses. Use Overcast, Pocket Casts, Castro, or another podcast app that supports custom feed URLs for the local provider.

AWS S3

For public access from anywhere. When using S3 as the provider, local .m4a and .vtt files are automatically deleted after successful upload to save disk space.

[publisher]
provider = s3

[aws]
profile = default
bucket = my-podcast-feed
region = us-east-1

The public S3 feed URL is:

https://<your-bucket>.s3.<your-region>.amazonaws.com/feed.xml

Minimal IAM Policy

If you prefer not to use broad AWS credentials, create a dedicated IAM user with only the permissions a2pod needs:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": ["s3:GetObject", "s3:PutObject"],
      "Resource": [
        "arn:aws:s3:::YOUR-BUCKET/feed.xml",
        "arn:aws:s3:::YOUR-BUCKET/artwork.jpg"
      ]
    },
    {
      "Effect": "Allow",
      "Action": ["s3:PutObject", "s3:DeleteObject"],
      "Resource": "arn:aws:s3:::YOUR-BUCKET/audiobooks/*"
    },
    {
      "Effect": "Allow",
      "Action": "s3:ListBucket",
      "Resource": "arn:aws:s3:::YOUR-BUCKET",
      "Condition": {
        "StringLike": { "s3:prefix": "audiobooks/*" }
      }
    }
  ]
}

Local Server

When using the local provider, the installer sets up a launchd service (com.a2pod.server) that runs automatically whenever your Mac is on.

# Check status
launchctl print gui/$(id -u)/com.a2pod.server

# Restart
launchctl kickstart -k gui/$(id -u)/com.a2pod.server

# Stop
launchctl bootout gui/$(id -u)/com.a2pod.server

# View logs
tail -f ~/.config/a2pod/server.log

# Run manually
a2pod-server

Telegram Bot

Send article URLs to a Telegram bot and receive the audio file directly in chat. The bot shows live progress as each pipeline step runs.

Setup

The installer can configure this for you during ./install.sh. To set up manually:

Message @BotFather on Telegram and create a new bot
Get your numeric user ID by messaging @userinfobot
Add to ~/.config/a2pod/config:

[telegram]
bot_token = 7123456789:AAH...
allowed_users = 123456789,987654321

Multiple user IDs can be comma-separated. Only listed users can interact with the bot.

Commands

Command	Description
`/start`	Introduction and feature overview
`/help`	Detailed usage instructions
`/voice`	Show or switch TTS voice (inline keyboard)
`/model`	Show or switch LLM provider and model (inline keyboard)
`/speed`	Show or set speech speed
`/workers`	Show or set TTS worker count
`/feed`	Get the podcast feed URL
`/status`	Bot status, uptime, version, active jobs
`/delete`	Remove a single episode (with confirmation)
`/deleteall`	Remove all episodes
`/restart`	Restart the bot process

File and Text Input

Beyond URLs, the bot accepts:

Pasted text — paste 50+ words directly into the chat to generate audio
.txt file uploads — upload a text file (up to 5 MB) to convert to audio

Jobs are serialized per user — each user can run one conversion at a time.

Running as a Background Service

The installer offers to set up a launchd service that starts the bot automatically whenever your Mac is on and restarts it if it crashes.

Virtual environment note: If you use a venv, the launchd plist must reference the venv's Python binary (e.g. /path/to/a2pod/.venv/bin/python3) in ProgramArguments, and include the venv's bin directory in the PATH environment variable. The installer handles this automatically, but if you create the plist manually or move the venv, update the paths accordingly.

# Check status
launchctl print gui/$(id -u)/com.a2pod.bot

# Restart
launchctl kickstart -k gui/$(id -u)/com.a2pod.bot

# Stop
launchctl bootout gui/$(id -u)/com.a2pod.bot

# View logs
tail -f ~/.config/a2pod/bot.log

# Run manually
a2pod-bot

Configuration

All configuration lives in ~/.config/a2pod/config (INI format). The installer creates this file for you.

[publisher]
provider = local                      # 'local' or 's3'

[podcast]
name = A2Pod                   # Podcast title in feed and episode intros

[server]
port = 8008                            # Local HTTP server port (local provider only)
# hostname = 192.168.1.50              # Override auto-detected LAN IP

[llm]
provider = ollama                      # ollama, openai, anthropic, or gemini
model = llama3.2                       # Model name for the active provider
openai_api_key = sk-...                # OpenAI API key (if using OpenAI)
anthropic_api_key = sk-ant-...         # Anthropic API key (if using Anthropic)
gemini_api_key = AIza...               # Google Gemini API key (if using Gemini)

[tts]
voice = af_heart                       # Default TTS voice
workers = 2                            # Parallel TTS workers

[telegram]
bot_token = 7123456789:AAH...          # Telegram bot token
allowed_users = 123456789,987654321    # Comma-separated allowed user IDs

[x]
bearer_token = YOUR_TOKEN_HERE         # X/Twitter API v2 bearer token

[aws]                                  # Required when provider = s3
profile = default                      # AWS CLI profile name
bucket = my-podcast-feed               # S3 bucket name
region = us-east-1                     # AWS region

LLM Providers

An LLM is used for episode summaries and the second pass of text cleaning. If no provider is configured, Ollama is used by default. If the LLM is unavailable, summaries fall back to first-sentence extraction and text cleaning uses regex only.

Ollama (local, free):

[llm]
provider = ollama
model = llama3.2          # default; lightweight (~2GB)
# model = qwen3.5:9b      # higher quality (~6GB)

brew install ollama && ollama pull llama3.2
ollama pull qwen3.5:9b   # optional, recommended for better summaries

OpenAI:

[llm]
provider = openai
openai_api_key = sk-...
model = gpt-4o-mini

Anthropic:

[llm]
provider = anthropic
anthropic_api_key = sk-ant-...
model = claude-haiku-4-20250414

Google Gemini:

[llm]
provider = gemini
gemini_api_key = AIza...
model = gemini-2.5-flash-lite

You can store API keys for multiple providers and switch between them at runtime via the Telegram bot's /model command or by editing the config. Use --no-summary to skip summaries entirely, or --model <name> to override the model for a single run.

Voices

Voice	Gender	ID
Heart (default)	Female	`af_heart`
Bella	Female	`af_bella`
Nicole	Female	`af_nicole`
Sarah	Female	`af_sarah`
Sky	Female	`af_sky`
Adam	Male	`am_adam`
Michael	Male	`am_michael`

How It Works

Extract — trafilatura scrapes article text from URLs; X API v2 handles X/Twitter posts; also accepts local files and pasted text
Clean (regex) — strips URLs, markdown, HTML, code blocks, CTAs, and web artifacts; normalizes abbreviations, numbers, currencies, and symbols to spoken words
Summarize — LexRank (extractive) selects key sentences across the full article, then LLM generates a 2-3 sentence episode description from those sentences
Clean (LLM) — second pass catches subtle promotional language, visual references, and awkward transitions the regex missed (runs in parallel with summarization for cloud providers)
Chunk — splits text into ~2000-character segments at sentence boundaries
TTS — Kokoro-82M generates WAV audio for each chunk in parallel (configurable worker count)
Intro — synthesizes a C-major chime jingle + spoken "[Podcast Name] presents: [Title]" + brief silence
Assemble — ffmpeg concatenates all WAVs into a single M4A with metadata; builds a WebVTT transcript with timestamps
Publish — updates the podcast feed on the active provider; when using S3, uploads files and cleans up local copies

Project Structure

a2pod/
├── install.sh                 # One-time setup (deps, model, provider choice, Telegram)
├── bin/
│   ├── a2pod          # Main CLI
│   ├── a2pod-bot      # Telegram bot entry point
│   └── a2pod-server   # Local HTTP server entry point
├── lib/
│   ├── errors.py              # Shared PipelineError exception
│   ├── pipeline.py            # Orchestration (used by CLI and bot)
│   ├── extractor.py           # URL/file/text extraction (trafilatura + X API)
│   ├── cleaner.py             # Regex + LLM two-pass text cleaning
│   ├── llm.py                 # LLM abstraction (Ollama / OpenAI / Anthropic / Gemini)
│   ├── summarizer.py          # LexRank extraction + LLM episode summaries
│   ├── chunker.py             # Sentence-boundary text splitting
│   ├── tts.py                 # Kokoro-82M TTS via MLX Audio
│   ├── intro.py               # Episode intro (jingle + spoken title)
│   ├── assembler.py           # Audio concat + M4A encoding + VTT transcripts
│   ├── artwork.py             # Podcast cover image generation
│   ├── publisher.py           # Single-provider feed management
│   ├── server.py              # HTTP server for ~/A2Pod/
│   ├── telegram_bot.py        # Telegram bot handlers + polling
│   └── backends/
│       ├── __init__.py        # RemoteBackend ABC + get_active_backend()
│       └── s3.py              # AWS S3 backend implementation
└── README.md

Output

~/A2Pod/
├── feed.xml                       # Podcast feed (local provider only)
├── artwork.jpg                    # Podcast artwork
├── Episode_Title_20260302.m4a     # Audio files (local provider; deleted after S3 upload)
└── Episode_Title_20260302.vtt     # VTT transcripts (local provider; deleted after S3 upload)

When using the local provider, the server serves this directory on http://<lan-ip>:8008/. When using S3, files are uploaded to s3://<your-bucket>/audiobooks/ and local copies are removed.

Contributing

Contributions are welcome. Please open an issue to discuss larger changes before submitting a PR.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A2Pod

Features

Requirements

Python Version

phonemizer Conflict

Quick Start

Usage

CLI Reference

X/Twitter

Podcast Setup

Local (default)

AWS S3

Minimal IAM Policy

Local Server

Telegram Bot

Setup

Commands

File and Text Input

Running as a Background Service

Configuration

LLM Providers

Voices

How It Works

Project Structure

Output

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
bin		bin
docs/plans		docs/plans
lib		lib
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh

Folders and files

Latest commit

History

Repository files navigation

A2Pod

Features

Requirements

Python Version

phonemizer Conflict

Quick Start

Usage

CLI Reference

X/Twitter

Podcast Setup

Local (default)

AWS S3

Minimal IAM Policy

Local Server

Telegram Bot

Setup

Commands

File and Text Input

Running as a Background Service

Configuration

LLM Providers

Voices

How It Works

Project Structure

Output

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages