RSS-TTS

This project converts web articles and text into audio files using Django and Django REST Framework. It allows users to submit URLs or text and receive audio versions through a private RSS feed, suitable for podcast applications.

Project Overview

Area	Summary
Vision	Turn web articles, blog posts, or text into high-quality audio available through a private podcast-style RSS feed.
Technology Stack	Django 5 + DRF • Celery 6 + Redis • SQLite/PostgreSQL • OpenAI TTS • Bootstrap 5 for UI.

For more details, see PROJECT_PLAN.md. Each user manages one or more Feeds. A feed groups a user's converted articles and is accessed via a unique token-based URL, e.g. /feeds/<feed_token>/. The token is generated using UUID4 when the feed is created so the RSS feed remains private.

Setup

Local Development

Install dependencies in a virtual environment:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Run the development server:

python manage.py runserver

Environment Configuration

The project uses environment variables for configuration. Copy the example file and adjust as needed:

cp .env.example .env
# Edit .env with your settings

Key environment variables:

DJANGO_SECRET_KEY: Secret key for Django (required)
DJANGO_DEBUG: Set to "True" for development
SQLITE_DATA_DIR: Directory to store SQLite database (optional)
CELERY_BROKER_URL: Redis URL for Celery
SITE_URL: Full URL of your site (e.g., "https://example.com") for RSS feed links
FIRECRAWL_API_KEY: Optional API key for Firecrawl URL fetching
USE_FIRECRAWL_BY_DEFAULT: Set to "True" to fetch pages via Firecrawl first

Database Configuration

The application supports both SQLite (default for local development) and PostgreSQL (recommended for production).

SQLite (Default)

By default, the application uses SQLite with zero configuration required. The database file is created automatically when you run migrations:

python manage.py migrate

To customize the SQLite database location, set:

SQLITE_DATA_DIR=data  # Database will be stored at data/db.sqlite3

PostgreSQL (Production)

For production deployments, PostgreSQL is recommended. Configure it using either:

Option 1: DATABASE_URL (Recommended)

DATABASE_URL=postgresql://user:password@host:5432/dbname

Option 2: Explicit PostgreSQL Variables

POSTGRES_DB=your_db_name
POSTGRES_USER=your_db_user
POSTGRES_PASSWORD=your_db_password
POSTGRES_HOST=localhost
POSTGRES_PORT=5432

The application automatically detects which database to use based on available environment variables. Priority: DATABASE_URL > PostgreSQL vars > SQLite.

Docker Development

We use Docker Compose for local development to ensure consistency across environments. The application uses Caddy as a reverse proxy to serve static MP3 files with proper byte-range support (required by Apple Podcasts) and to proxy other requests to Django.

# Build and start all services
docker-compose up -d

# View logs
docker-compose logs -f

# Stop all services
docker-compose down

# For production
# First, ensure you've set up your .env file with appropriate production values
cp .env.example .env
# Edit .env with production settings
docker-compose -f docker-compose.prod.yml up -d

Architecture

The application consists of the following services:

Caddy: Web server that handles:
- Static MP3 file serving with automatic byte-range support at /audio/<uuid>/
- Reverse proxy for all other requests to Django
- Automatic HTTPS in production
Web: Django application server
Worker: Celery worker for processing audio conversion tasks
Redis: Message broker for Celery

Both the web and worker services invoke /app/start-web.sh which runs python manage.py migrate before launching. This ensures database migrations are automatically applied whenever new Docker images are deployed.

Media Files

MP3 files are stored in the ./media/articles/ directory and served directly by Caddy for optimal performance. The files are:

Saved by the worker as ./media/articles/{uuid}.mp3
Accessible via HTTP at /audio/{uuid}/
Shared between containers via a bind mount
During export, a de-essing filter reduces harsh "s" sounds for better playback quality.

To use Docker volumes instead of bind mounts for media files, uncomment the relevant sections in the docker-compose files.

Redis runs as a separate Docker service (redis: in docker-compose.yml) and is not bundled inside the worker container. Both the web and worker containers connect to Redis via the REDIS_URL environment variable (redis://redis:6379/0).

Redis data is persisted in the ../data/redis directory on the host, which is mapped to /data inside the Redis container. The Redis service uses a custom configuration file (./redis.conf) for its settings.

Environment Configuration

Copy .env.sample to .env and update the values with your local secrets:

cp .env.sample .env

The application and Docker Compose will load environment variables from this file.

GitHub Codespaces

This project is configured to work with GitHub Codespaces:

In GitHub, click the "Code" button on the repository
Select the "Codespaces" tab
Click "Create codespace on main"

The Codespace will automatically build the Docker environment and provide a full development setup with all dependencies installed.

Development

Code Quality

We use several tools to maintain code quality:

Black: For code formatting
isort: For import sorting
Flake8: For linting
mypy: For type checking

Configuration for these tools lives in setup.cfg. Flake8 enforces a McCabe complexity limit of 20, and mypy reads its options from the same file.

Install pre-commit hooks to automatically run these tools. You can run the commands manually or execute the provided helper script. The script now also sets up a pre-push hook so Black formatting and mypy checks run before pushing changes to GitHub:

./setup_precommit.sh

The script installs the pre-commit package if needed and configures the git hooks for you.

Coding Standards

We follow the PEP 8 style guide for Python code. All classes, methods and functions should include docstrings using the Google Python style. See CODING_STANDARDS.md for our complete coding guidelines.

Branching Strategy

main is for production code
dev/main is for development
Feature branches should follow the format feature/description
See AGENTS.md for more details on branch management

Features

ChunkToneService (LLM-Driven Audio Processing)

The application includes an advanced LLM-driven text chunking and voice assignment system called ChunkToneService. This feature can replace the traditional chunking pipeline with intelligent, context-aware processing.

Key Features:

Intelligent Chunking: Uses OpenAI GPT models to break text into logical segments
Voice Assignment: Automatically assigns appropriate voices for different speakers/narrators
Character Recognition: Identifies dialogue and assigns character-specific voices
Fallback Safety: Always produces valid output, even when LLM processing fails

Configuration:

# Enable ChunkToneService (default: false)
ENABLE_CHUNK_TONE_LLM=true

Available Voices:

alloy - Default narrator voice
echo - Character dialogue
fable - Storytelling
onyx - Bold/dramatic characters
nova - Casual/conversational
shimmer - News/formal content

For detailed documentation, see docs/chunk_tone_service.md.

Tests

Activate the virtual environment and run:

# Install test dependencies
pip install -r requirements-test.txt

# Run tests
python -m unittest discover -s tests

Running Tests Without Audio Dependencies

If you encounter issues with audio dependencies (pydub/audioop), you can use our test scripts with mocked audio dependencies:

# Run a specific test file
python run_single_test.py tests/test_models.py

# Run all tests with mocked audio dependencies
python run_all_tests_with_mock.py

For more details on testing approach, see TESTING.md.

Continuous Integration

We use GitHub Actions for continuous integration. On each push to main or dev/main, and for pull requests to these branches, the CI will:

Run all tests
Check code formatting with Black
Run linting with Flake8
Perform type checking with mypy

See .github/workflows/ directory for workflow files:

00-lint.yml: Runs code formatting and linting
10-type-check.yml: Performs type checking with mypy
20-test.yml: Runs the test suite
90-pr-review.yml: Provides automated PR reviews

Name		Name	Last commit message	Last commit date
Latest commit History 392 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
accounts		accounts
appconfig		appconfig
docs		docs
mypy_stubs		mypy_stubs
rss_tts		rss_tts
tests		tests
text_to_audio		text_to_audio
.dockerignore		.dockerignore
.env.demo		.env.demo
.env.example		.env.example
.env.sample		.env.sample
.flake8.bak		.flake8.bak
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
APPLE_PODCASTS_TEST.md		APPLE_PODCASTS_TEST.md
CLAUDE.md		CLAUDE.md
CODING_STANDARDS.md		CODING_STANDARDS.md
Caddyfile		Caddyfile
Dockerfile		Dockerfile
GITHUB_PROJECT.md		GITHUB_PROJECT.md
MANAGEMENT_COMMANDS.md		MANAGEMENT_COMMANDS.md
PRODUCTION_MIGRATION_GUIDE.md		PRODUCTION_MIGRATION_GUIDE.md
PROJECT_PLAN.md		PROJECT_PLAN.md
README.md		README.md
TESTING.md		TESTING.md
code_review.sh		code_review.sh
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
issues.md		issues.md
manage.py		manage.py
multi_voice_analysis_and_improvements.md		multi_voice_analysis_and_improvements.md
openai-tts-docs.md		openai-tts-docs.md
pipeline_diagram.md		pipeline_diagram.md
redis.conf		redis.conf
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
run_all_tests_with_mock.py		run_all_tests_with_mock.py
run_single_test.py		run_single_test.py
setup.cfg		setup.cfg
setup_precommit.sh		setup_precommit.sh
start-web.sh		start-web.sh
start-with-redis.sh		start-with-redis.sh
start-worker.sh		start-worker.sh
voice_flow_diagram.md		voice_flow_diagram.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RSS-TTS

Project Overview

Setup

Local Development

Environment Configuration

Database Configuration

SQLite (Default)

PostgreSQL (Production)

Docker Development

Architecture

Media Files

Environment Configuration

GitHub Codespaces

Development

Code Quality

Coding Standards

Branching Strategy

Features

ChunkToneService (LLM-Driven Audio Processing)

Tests

Running Tests Without Audio Dependencies

Continuous Integration

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 7

Uh oh!

Languages

JM-Addington/RSS-TTS

Folders and files

Latest commit

History

Repository files navigation

RSS-TTS

Project Overview

Setup

Local Development

Environment Configuration

Database Configuration

SQLite (Default)

PostgreSQL (Production)

Docker Development

Architecture

Media Files

Environment Configuration

GitHub Codespaces

Development

Code Quality

Coding Standards

Branching Strategy

Features

ChunkToneService (LLM-Driven Audio Processing)

Tests

Running Tests Without Audio Dependencies

Continuous Integration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 7

Uh oh!

Languages

Packages