GitHub - rasulkireev/TuxSEO: Automated Blog Content Creation for Founders Who Hate Writing

Automated Blog Content Creation for Founders Who Hate Writing

Overview

TuxSEO learns about your business, analyzes the market which lets you...
Generate content ideas for you business blog to drive traffic from searches.
Stop wasting time and money on research and writing, let TuxSEO do it for you.
TuxSEO is open-source, self-hostable. Always.
Run it privately on your computer or try it on our cloud app.

Deployment

Render

The only required env vars are:

OPENAI_API_KEY
TAVILY_API_KEY
GEMINI_API_KEY
PERPLEXITY_API_KEY
JINA_READER_API_KEY
KEYWORDS_EVERYWHERE_API_KEY

The rest are optional.

Note: This should work out of the box with Render's free tier if you provide the AI API keys. Here's what you need to know about the limitations:

Worker Service Limitation: The worker service is not a dedicated worker type (those are only available on paid plans). For the free tier, I had to use a web service through a small hack, but it works fine for most use cases.
Memory Constraints: The free web service has a 512 MB RAM limit, which can cause issues with automated background tasks only. When you add a project, it runs a suite of background tasks to analyze your website, generate articles, keywords, and other content. These automated processes can hit memory limits and potentially cause failures.
Manual Tasks Work Fine: However, if you perform tasks manually (like generating a single article), these typically use the web service instead of the worker and should work reliably since it's one request at a time.
Upgrade Recommendation: If you do upgrade to a paid plan, use the actual worker service instead of the web service workaround for better automated task reliability.

Reality Check: The website functionality should be usable on the free tier - you'll only pay for API costs. Manual operations work fine, but automated background tasks (especially when adding multiple projects) may occasionally fail due to memory constraints. It's not super comfortable for heavy automated use, but perfectly functional for manual content generation.

If you know of any other services like Render that allow deployment via a button and provide free Redis, Postgres, and web services, please let me know in the Issues section. I can try to create deployments for those. Bear in mind that free services are usually not large enough to run this application reliably.

Docker Compose

This should also be pretty streamlined. On your server you can create a folder in which you will have 2 files:

.env

Copy the contents of .env.example into .env and update all the necessary values.

docker-compose.yml

Copy the contents of docker-compose-prod.yml into docker-compose.yml and run the suggested command from the top of the docker-compose-prod.yml file.

How you are going to expose the backend container is up to you. I usually do it via Nginx Reverse Proxy with http://tuxseo-backend-1:80 UPSTREAM_HTTP_ADDRESS.

Pure Python / Django deployment

Not recommended due to not being too safe for production and not being tested by me.

If you are not into Docker or Render and just wanto to run this via regular commands you will need to have 5 processes running:

python manage.py collectstatic --noinput && python manage.py migrate && gunicorn ${PROJECT_NAME}.wsgi:application --bind 0.0.0.0:80 --workers 3 --threads 2
python manage.py qcluster
npm install && npm run start
postgres
redis

You'd still need to make sure .env has correct values.

Custom Deployment on Caprover

Create 4 apps on CapRover.

tuxseo
tuxseo-workers
tuxseo-postgres
tuxseo-redis

Create a new CapRover app token for:
- tuxseo
- tuxseo-workers
Add Environment Variables to those same apps from .env.
Create a new GitHub Actions secret with the following:
- CAPROVER_SERVER
- CAPROVER_APP_TOKEN
- WORKERS_APP_TOKEN
- REGISTRY_TOKEN
Then just push main branch.
Github Workflow in this repo should take care of the rest.

Local Development

All the information on how to run, develop and update your new application can be found in the documentation.

Update the name of the .env.example to .env and update relevant variables.
Run make serve
Run make restart-worker just in case, it sometimes has troubles connecting to REDIS on first deployment.

CI pre-PR runbook (required before opening a PR)

Run the same pytest command as CI locally:

make test-ci

What this does:

Boots local Postgres/Redis services used by tests.
Runs pytest with the same strict flags as CI (--strict-config --strict-markers).
Pins PYTHONHASHSEED=0 for deterministic hash ordering.

If this fails, fix locally before pushing.

PostHog Logs (structured backend observability)

TuxSEO can ship structured backend logs to PostHog Logs over OTLP HTTP.

Web request logs include correlation IDs (request_id, trace_id) via middleware.
Background and AI generation jobs bind task_id/job_id so failures can be traced end-to-end.
A redaction layer strips common secrets/tokens and emails before shipping.
Export is async + batched to avoid adding request latency.

Quick setup:

Set POSTHOG_API_KEY.
Set POSTHOG_LOGS_ENABLED=true in production.
Optionally override POSTHOG_LOGS_ENDPOINT (default: https://us.i.posthog.com/v1/logs).

See docs/posthog-logs.md for full configuration and field-level behavior.

PostHog LLM analytics (PydanticAI flows)

TuxSEO emits $ai_generation events from key PydanticAI generation flows via run_agent_synchronously(...).

Includes model, latency, token usage (when available), flow path, and failure context.
Designed for PostHog LLM analytics views to compare performance/cost by feature path.

See docs/posthog-llm-analytics.md for event schema and verification steps.

Product analytics event taxonomy (PostHog)

TuxSEO maintains a canonical event taxonomy and coverage matrix for funnel-safe product analytics:

docs/event-taxonomy.md
docs/posthog-event-coverage-matrix.md

core/analytics/event_taxonomy.json is the source of truth for canonical event names + required properties.

PostHog dashboard pack (ops + funnel + LLM + paid attribution)

The first-pass operational + product + LLM + paid acquisition dashboard set is documented (with live links) in:

docs/posthog-dashboards.md

Use scripts/posthog_dashboard_bootstrap.py to create/update all dashboard tiles idempotently.

Paid acquisition attribution foundation (Meta/Google/Reddit/X)

TuxSEO now persists first-touch/latest-touch attribution and enriches server-side conversion events with normalized acquisition fields.

canonical schema + guardrails: docs/acquisition-attribution-v1.md
source of truth for event names: core/analytics/event_taxonomy.json

Analytics ingestion jobs (GA4, GSC, Plausible)

TuxSEO now ships background ingestion for connected analytics providers:

Google Analytics 4 (GA4)
Google Search Console (GSC)
Plausible

Implementation highlights:

Incremental sync cursor per (project, provider, source_account_ref) with a rolling lookback window.
Raw source snapshots (AnalyticsSourceSnapshot) + normalized daily facts (AnalyticsFactDaily).
Idempotent upsert semantics for safe retries.
Provider-aware retry/backoff behavior with observable cursor status and sanitized errors.
Missing/disconnected integrations are skipped without failing the whole scheduling run.
Project Home includes an Analytics (GA4/GSC/Plausible) section that surfaces:
- connected-source status badges
- 30-day KPI rollups (clicks, impressions, sessions, users, conversions)
- derived rates (CTR, engagement, conversion) and avg GSC position
- trend deltas (recent 7d vs prior 7d)
- top low-CTR/high-impression opportunities with actionable on-page SEO suggestions

Scheduled entrypoint:

core.scheduled_tasks.schedule_project_analytics_syncs

Worker task entrypoint:

core.tasks.sync_project_integration_analytics(project_id, provider)

Periodic sitemap sync for “Your Pages”

TuxSEO now auto-refreshes sitemap-backed project pages on a configurable schedule.

batch entrypoint: core.tasks.sync_all_projects_with_sitemaps
single-project sync: core.tasks.parse_sitemap_and_save_urls(project_id)
manual trigger endpoint (session auth): POST /api/project/{project_id}/sitemap/sync-now/

Behavior:

only projects with a valid sitemap URL are eligible
per-project lock prevents overlapping syncs
supports both urlset sitemap XML and sitemap index files
upserts discovered URLs and marks missing sitemap URLs as stale instead of deleting
logs per-project counters: discovered, added, updated, stale, failed

Key env settings:

SITEMAP_SYNC_SCHEDULER_ENABLED (default true)
SITEMAP_SYNC_INTERVAL_HOURS (default 6)
SITEMAP_SYNC_TIMEOUT_SECONDS, SITEMAP_SYNC_MAX_RETRIES, SITEMAP_SYNC_RETRY_BACKOFF_SECONDS
USE_REDIS_CACHE (enable shared Redis cache for cross-worker sync locks)

Custom post types for blog idea generation

Projects can now define reusable custom post types under the Posts sidebar:

open Posts → Manage Types (or click the + action next to Posts)
create a type with:
- a unique, clean name (validated per project)
- prompt guidance (required, max length enforced)
each custom type appears directly in Posts navigation
selecting a custom type opens a dedicated generation page that applies its guidance automatically to both:
- title suggestion generation
- full article generation from those suggestions
built-in post types continue to use the default generation behavior unchanged

Deterministic content quality evaluation

Run the deterministic rubric evaluation locally:

make test-content-quality

This executes core/tests/test_content_quality_evaluation.py, scores good/medium/bad fixtures with fixed rubric weights, and writes artifacts/content-quality-report.json.

Safe baseline update procedure:

Run make test-content-quality and confirm the change is expected.
Review the generated artifacts/content-quality-report.json score deltas.
Re-run with explicit baseline update mode:

UPDATE_CONTENT_QUALITY_BASELINE=1 make test-content-quality

Commit core/tests/fixtures/content_quality_baseline.json together with the scoring logic change.

Internal API (BlogPost CRUD)

Internal blog post management endpoints are available under /api/internal/blog-posts and are protected by the superuser API key query param (?api_key=...).

POST /api/blog-posts/submit — create (existing endpoint, kept for compatibility)
GET /api/internal/blog-posts — list
GET /api/internal/blog-posts/{id} — retrieve
PUT /api/internal/blog-posts/{id} — full update
PATCH /api/internal/blog-posts/{id} — partial update
DELETE /api/internal/blog-posts/{id} — delete

Non-superuser or missing API keys are rejected with unauthorized responses.

Name		Name	Last commit message	Last commit date
Latest commit History 754 Commits
.cursor/rules		.cursor/rules
.github/workflows		.github/workflows
core		core
deployment		deployment
docs		docs
frontend		frontend
scripts		scripts
snippets		snippets
steering		steering
tuxseo		tuxseo
.babelrc		.babelrc
.browserslistrc		.browserslistrc
.env.example		.env.example
.eslintrc		.eslintrc
.flake8		.flake8
.gitignore		.gitignore
.nvmrc		.nvmrc
.pre-commit-config.yaml		.pre-commit-config.yaml
.stylelintrc.json		.stylelintrc.json
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile-python		Dockerfile-python
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
docker-compose-local.yml		docker-compose-local.yml
docker-compose-prod.yml		docker-compose-prod.yml
manage.py		manage.py
package-lock.json		package-lock.json
package.json		package.json
poetry.lock		poetry.lock
postcss.config.js		postcss.config.js
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
render.yaml		render.yaml
requirements.txt		requirements.txt
skill.md		skill.md
tailwind.config.js		tailwind.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

TOC

Deployment

Render

Docker Compose

Pure Python / Django deployment

Custom Deployment on Caprover

Local Development

CI pre-PR runbook (required before opening a PR)

PostHog Logs (structured backend observability)

PostHog LLM analytics (PydanticAI flows)

Product analytics event taxonomy (PostHog)

PostHog dashboard pack (ops + funnel + LLM + paid attribution)

Paid acquisition attribution foundation (Meta/Google/Reddit/X)

Analytics ingestion jobs (GA4, GSC, Plausible)

Periodic sitemap sync for “Your Pages”

Custom post types for blog idea generation

Deterministic content quality evaluation

Internal API (BlogPost CRUD)

Star History

About

Uh oh!

Releases 9

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

TOC

Deployment

Render

Docker Compose

Pure Python / Django deployment

Custom Deployment on Caprover

Local Development

CI pre-PR runbook (required before opening a PR)

PostHog Logs (structured backend observability)

PostHog LLM analytics (PydanticAI flows)

Product analytics event taxonomy (PostHog)

PostHog dashboard pack (ops + funnel + LLM + paid attribution)

Paid acquisition attribution foundation (Meta/Google/Reddit/X)

Analytics ingestion jobs (GA4, GSC, Plausible)

Periodic sitemap sync for “Your Pages”

Custom post types for blog idea generation

Deterministic content quality evaluation

Internal API (BlogPost CRUD)

Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages