CI/CD Documentation

GitHub Actions Workflow

This project uses these GitHub Actions workflow files for pull request validation, image publishing, deploy promotion, security scanning, scheduled evals, and dependency automation:

.github/workflows/pr.yml – main application and workspace checks (lint, Fallow audit, type-check, web tests, recommendation evals, Storybook tests, accessibility checks, e2e smoke tests, build, service CI, dependency review)
.github/workflows/recommendation-real-data-evals.yml – scheduled/manual recommendation evals against a seeded database and real catalog retrieval
.github/workflows/container-images.yml – production container image builds for app and service runtimes, published to GitHub Container Registry with commit and PR provenance metadata
.github/workflows/movie-discovery-ci.yml – TypeScript compilation and tests for the services/movie-discovery service, triggered when files under services/movie-discovery/ change or when .github/workflows/movie-discovery-ci.yml itself changes
.github/workflows/codeql.yml – CodeQL analysis for GitHub Actions and JavaScript/TypeScript on pushes, pull requests, and a weekly schedule
.github/workflows/dependabot-auto-merge.yml – enables auto-squash merge for Dependabot minor and patch update PRs after required checks pass

The PR validation workflows run automatically on pull requests targeting the development branch. Jobs run in parallel to minimise feedback time, with each job focused on a single concern. CodeQL also runs on pushes to development and on a weekly schedule.

Jobs

Workflow	Job	Purpose
`pr.yml`	`changes`	Classifies PR as docs-only vs code-changing
`pr.yml`	`lint`	ESLint code quality + Prettier formatting
`pr.yml`	`fallow-audit`	Required new-only Fallow audit for changed TypeScript/JavaScript quality risks
`pr.yml`	`type-check`	TypeScript type safety (`tsc --noEmit`)
`pr.yml`	`server-tests`	Vitest server tests with coverage collection and artifact upload
`pr.yml`	`backoffice-tests`	Shared/UI builds, backoffice quality checks, and backoffice Vitest suite
`pr.yml`	`recommendation-evals`	Deterministic recommendation quality fixtures and scoring
`pr.yml`	`storybook-tests`	Playwright browser install + Storybook component tests
`pr.yml`	`e2e-tests`	Playwright smoke tests against isolated PostgreSQL + Redis services
`pr.yml`	`a11y-tests`	axe accessibility smoke tests against seeded app flows
`pr.yml`	`build`	Next.js production build verification
`pr.yml`	`docs-build`	Documentation type-check and production docs build
`pr.yml`	`services-ci`	Shared package tests plus Turbo test pass for `services/*`
`pr.yml`	`dependency-review`	Blocks PRs introducing vulnerable dependencies
`pr.yml`	`pr-validation`	Stable required check that always runs on every PR
`container-images.yml`	`validate-deploy-request`	Resolves manual deploy target and enforces matching release refs
`container-images.yml`	`build-images`	Builds and publishes GHCR production images with provenance
`container-images.yml`	`deploy-development`	Triggers the development Coolify deploy after development images publish
`container-images.yml`	`deploy-production`	Promotes production tags and triggers the gated production Coolify deploy
`movie-discovery-ci.yml`	`movie-discovery-ci`	TypeScript compilation and tests for `services/movie-discovery`
`codeql.yml`	`analyze`	CodeQL static analysis for Actions and JavaScript/TypeScript
`recommendation-real-data-evals.yml`	`recommendation-real-data-evals`	Manual/scheduled seeded-database catalog retrieval evals
`dependabot-auto-merge.yml`	`auto-merge`	Enables auto-squash merge for Dependabot minor and patch update PRs

Workflow Features

Parallel Jobs

Each concern (lint, type-check, tests, build) runs as an independent job. A failure in one job does not prevent the others from running, giving fast, complete feedback on every PR.

Hard Failure on Playwright Install

The storybook-tests job runs npx playwright install-deps on every CI run and npx playwright install without a fallback when browsers are not already cached. If Playwright installation fails, the job fails visibly rather than silently skipping the tests.

Playwright Browser Caching

The storybook-tests job caches Playwright browser binaries in ~/.cache/ms-playwright using actions/cache@v5, keyed on the OS and package-lock.json hash. On a cache hit, the browser download step is skipped, but npx playwright install-deps still runs because Linux system packages are not part of the Playwright browser cache.

The a11y-tests job intentionally avoids this cache path. It runs inside the pinned official mcr.microsoft.com/playwright:v1.61.0-noble container, matching the playwright package version in package-lock.json, so Chromium and system dependencies are already present in the image. The job uses GitHub service containers for PostgreSQL and Redis, then runs the e2e migration/seed script with E2E_SKIP_DOCKER=1.

E2E Smoke Tests

The e2e-tests job runs npm run test:e2e. That command starts the isolated docker-compose.e2e.yml services, applies the same db/init migrations used by production/local setup, seeds deterministic movie fixtures, starts the web app on port 3100, starts the backoffice app on port 3101, and runs both Playwright suites. The job always runs npm run test:e2e:down afterwards so the disposable database volume is removed.

The e2e coverage proves the real app can read seeded catalog data from PostgreSQL, that /api/health can reach both PostgreSQL and Redis, and that auth/session, catalog empty states, quiz submission, deterministic result rendering, feedback, movie-memory persistence, backoffice catalog repair enqueue/audit visibility, and TMDB review decisions work through the browser. AI-eval coverage is tracked separately so model quality gates can use fixture scoring and optional live-provider runs.

Accessibility Tests

The a11y-tests job runs npm run test:a11y:run after preparing the same deterministic movie fixtures used by e2e. The Playwright config lives at apps/web/playwright.a11y.config.ts, and the tests live under apps/web/a11y/.

The suite uses @axe-core/playwright and blocks critical or serious WCAG 2 A/AA, WCAG 2.1 A/AA, and axe best-practice violations on the public entrypoints, auth forms, catalog page, and an authenticated deterministic results page. Full Playwright output is uploaded as the playwright-a11y-report artifact.

Recommendation Evals

The recommendation-evals job runs npm run eval:recommendations. This is a CI-blocking deterministic suite: it uses fixture prompts, fixture memory constraints, and mocked model outputs, then writes apps/web/test-results/recommendation-evals/report.json. The report is uploaded as the recommendation-eval-report artifact.

Default evals do not require OPENAI_API_KEY, TMDB_API_KEY, Redis, or PostgreSQL. Real-data evals use npm run eval:recommendations:real-data after preparing the seeded e2e database; they validate catalog connectivity, candidate availability, and catalog search retrieval while keeping model output controlled. Operators can also queue mock and real-data evals from the backoffice, where run summaries and per-case results are persisted in PostgreSQL and processed by the recommendation-evals BullMQ worker. Optional live OpenAI evals use npm run eval:recommendations --workspace=apps/web -- --live or the guarded backoffice live form. They persist the provider response for inspection and are intentionally manual because they can spend API credits, depend on provider availability, and may need seeded local data.

Recommendation checks are intentionally layered:

Per-PR CI: deterministic fixtures and mocked model outputs via npm run eval:recommendations.
Scheduled, manual, or backoffice real-data eval: seeded database and real catalog retrieval, but still controlled model output by default. Use this for schema, seed/backfill, retrieval, and candidate-availability changes. The Recommendation Real-Data Evals workflow runs weekly and can be triggered manually; backoffice runs are useful when an operator wants to test the current environment/catalog without leaving the admin surface.
Manual live OpenAI eval: live provider-backed recommendation calls via npm run eval:recommendations -- --live or the guarded backoffice form. This is not a default hard gate because provider responses, rate limits, and API cost can make it noisy.

Agents and reviewers should explicitly consider these layers when a PR changes recommendation prompts, embeddings, OpenAI/TMDB integration, candidate filtering/ranking, feedback signals, or catalog data feeding recommendations.

Code Coverage

Server tests run with --coverage via @vitest/coverage-v8. Coverage reports (HTML, JSON, LCOV) are uploaded as a GitHub Actions artifact named coverage-report and retained for 30 days. Coverage is configured in vitest.config.ts.

Backoffice Tests

The backoffice-tests job runs npm run test:backoffice for code-changing PRs. That root script builds packages/shared, builds packages/ui, runs the backoffice quality gate, and then runs the apps/backoffice Vitest suite.

Docs Build

The docs-build job always runs from pr.yml, including docs-only PRs. It runs npm run type-check:docs and npm run build:docs so documentation routing, MDX compilation, and production build behavior are verified before merge.

Fallow Code Quality Audit

The fallow-audit PR job runs npm run quality:fallow:audit -- --changed-since origin/development --format compact. It is a required new-only gate for code-changing PRs and is included in the required PR Validation aggregate. The job fails when a changeset introduces unused-code, duplication, or complexity findings that Fallow reports against the changed files. The audit runs from the repository root, so findings can come from any npm workspace, not only apps/web.

Google Fonts Build Reliability

The build job sets NEXT_FONT_GOOGLE_DISABLE=1 to prevent flaky failures caused by network access to Google Fonts in restricted CI environments.

Services CI

The services-ci job in pr.yml runs npm run test:services, which first runs packages/shared tests and then delegates to Turbo for root services/* packages that define a test script. Services without a test script are skipped by that command; use npm run build:services locally when you specifically need a compile pass across service workspaces.

Movie Discovery CI

The movie-discovery-ci job lives in its own workflow (movie-discovery-ci.yml) and is triggered when files inside services/movie-discovery/ change or when .github/workflows/movie-discovery-ci.yml itself changes. It installs dependencies, runs tsc (npm run build) to ensure the service always compiles correctly, and runs the service's Vitest test suite.

CodeQL

The CodeQL workflow analyzes GitHub Actions and JavaScript/TypeScript on pushes and pull requests targeting development, plus a weekly scheduled run.

Dependency Review

The dependency-review job uses actions/dependency-review-action to block any PR that introduces a dependency with a known vulnerability. It does this by failing the GitHub Actions check, which can then prevent merging when that check is required. The workflow only grants contents: read for this job; it does not require pull-requests: write.

Prebuilt Container Images

container-images.yml builds production Docker images for:

ghcr.io/<owner>/<repo>/web
ghcr.io/<owner>/<repo>/workers
ghcr.io/<owner>/<repo>/bull-board
ghcr.io/<owner>/<repo>/storybook
ghcr.io/<owner>/<repo>/docs
ghcr.io/<owner>/<repo>/db-migrate
ghcr.io/<owner>/<repo>/movie-discovery
ghcr.io/<owner>/<repo>/movie-backfill

On pull requests from the same repository, the workflow publishes:

sha-<12-char-github-sha> – exact checked commit used by the workflow
pr-<number> – moving PR tag for the latest image built for that PR

On pushes to development, it also publishes development. Manual workflow_dispatch runs can publish development from the development ref or promote production from the main ref, depending on the selected deploy environment.

Each image receives OCI labels for the repository, workflow run, checked commit, source PR branch, source PR head commit, and image role. Published runs also upload a container-image-<role> artifact containing the digest-pinned reference (ghcr.io/.../<role>@sha256:...). Downstream previews and deploys should consume the digest-pinned reference from that artifact or from GHCR rather than rebuilding from the monorepo.

Forked pull requests still build the images for validation, but publishing is disabled because the GitHub token does not have package write permission.

Coolify consumes the published images through coolify.compose.yml. The compose file does not build application images on the server; it pulls every PopChoice runtime from:

APP_IMAGE_PREFIX=ghcr.io/<owner>/<repo>
IMAGE_TAG=development

For simple continuous deployment to the shared development Coolify resource, keep IMAGE_TAG=development, set DEPLOYMENT_ENVIRONMENT=development, and let the development deploy job run after the image matrix succeeds. Treat this as staging, not production promotion. The current development VPS resource uses https://dev.pop-choice.shchilkin.dev as its deploy verification base URL.

For production, create a separate Coolify resource with its own database, Redis, volumes, domains, and secrets. The GitHub workflow supports a gated promotion path: after staging is accepted, merge development into main, manually run Container Images on the main branch, choose deploy_environment=production, approve the GitHub Environment if required, and let the job publish the moving production image tag before triggering the production Coolify webhook. The production resource should use IMAGE_TAG=production and DEPLOYMENT_ENVIRONMENT=production for that path. The current production VPS resource uses https://pop-choice.shchilkin.dev as its deploy verification base URL.

The immutable sha-<12-char-github-sha> tags are still published for audit and rollback. If you need a fully pinned rollback, set the production Coolify resource IMAGE_TAG to the previous known-good sha tag and redeploy manually. Every PopChoice service must use the same IMAGE_TAG; mixing web, workers, bull-board, storybook, docs, and service images from different commits is not a supported deployment shape.

Use GitHub Environments named development and production for deploy secrets. Store COOLIFY_DEPLOY_WEBHOOK and COOLIFY_TOKEN separately in each environment so the same workflow can deploy either resource without sharing webhook URLs. COOLIFY_TOKEN must be a Coolify API token with deploy permission because the deploy webhook is authenticated with Authorization: Bearer <token>. The production GitHub Environment should require manual reviewers. When the webhook secret is absent, images are still published, but deployment is skipped for the development Environment. Production promotions fail when either production deploy secret is missing, because updating the production image tag without triggering the production resource would create an ambiguous release state. When the webhook is present but the token is missing, the deploy job fails to make the misconfiguration visible.

The deploy job also supports deploy-aware observability hooks:

Set GRAFANA_URL and GRAFANA_SERVICE_ACCOUNT_TOKEN to create a short Grafana silence for alerts labeled noise_profile=deploy-sensitive before the Coolify webhook runs.
Set POPCHOICE_DEPLOY_VERIFY_BASE_URL in each GitHub Environment to poll the matching public /api/health and /api/build after the webhook. The job fails if the deployed resource does not recover within the retry budget. POPCHOICE_PRODUCTION_BASE_URL remains supported as a backwards-compatible fallback.
Set POSTDEPLOY_SEED_ENABLED=true, BACKOFFICE_BASE_URL, and BACKOFFICE_AUTOMATION_TOKEN in a GitHub Environment to queue the curated movie seed through backoffice after deploy verification succeeds. The automation token must match BACKOFFICE_AUTOMATION_TOKEN in the deployed backoffice runtime.

If Grafana secrets are absent, deploys continue without creating a temporary silence. If POPCHOICE_DEPLOY_VERIFY_BASE_URL is absent, the deploy webhook can still run, but post-deploy health/build verification is skipped by scripts/verify-production-deploy.sh.

For provenance in /api/build, pass these non-secret runtime variables to the deployed web container when using a prebuilt image:

APP_COMMIT_SHA=<github workflow commit sha>
APP_GIT_BRANCH=<github ref name>
APP_PR_NUMBER=<pull request number, if any>
APP_IMAGE_REPOSITORY=ghcr.io/<owner>/<repo>/web
APP_IMAGE_TAG=sha-<12-char-github-sha>
APP_IMAGE_DIGEST=sha256:<image digest>
SOURCE_COMMIT=<pull request head sha, if any>
SOURCE_BRANCH=<pull request head ref, if any>

APP_IMAGE_DIGEST is only known after the image is pushed, so it is normally a deploy-time environment variable rather than a Docker build argument. If the deployment uses the moving development tag, the image also contains baked fallback metadata from the build workflow so /api/build can still report the source commit.

Runtime Compatibility Rules

Treat the published image set as one release unit:

use one IMAGE_TAG for all PopChoice services in coolify.compose.yml
run database migrations before or during the web startup, then start workers against the same image tag
keep SQL migrations additive and idempotent; use an expand/backfill/switch pattern before removing old columns or changing meanings
version BullMQ job payloads when changing worker contracts, and keep workers tolerant of older queued jobs
keep API changes additive between web, workers, Bull Board, and future backoffice apps unless a coordinated release and data migration is planned

Workflow Triggers and Path Filtering

How Path Filtering Works

GitHub Actions supports two path-related filters on workflow triggers:

paths – the workflow runs only when at least one changed file matches a listed pattern.
paths-ignore – the workflow is skipped when all changed files match the listed patterns.

Both filters accept glob patterns such as src/** or **/*.md.

on:
  push:
    branches: ['main']
    # Only trigger when source files change
    paths:
      - 'src/**'
      - 'package*.json'

  pull_request:
    branches: ['main']
    # Skip when only documentation changes
    paths-ignore:
      - 'docs/**'
      - '*.md'

Path Filtering in This Repository

pr.yml now always triggers on pull requests to development, then classifies changed files in the changes job:

on:
  pull_request:
    branches: ['development']

If the PR is docs-only (docs/** and root-level *.md), code-focused jobs are skipped, docs-build still verifies the documentation app, and the lightweight PR Validation job still reports success. For non-doc PRs, the full CI suite runs and PR Validation verifies all required CI jobs succeeded.

container-images.yml triggers on pull requests to development, pushes to development, and manual dispatches. Manual runs expose a deploy_environment input with development, production, and none choices; Coolify deploys are only allowed from the matching release ref: development deploys run from development, and production deploys run from main. The workflow intentionally does not use docs-only path filtering: image-build changes often span workflow files, Dockerfiles, package manifests, and deployment docs, and the workflow itself is the source of truth for the deployable artifact.

movie-discovery-ci.yml uses paths so that the service CI only runs when the service source actually changes:

on:
  pull_request:
    branches: ['development']
    paths:
      - 'services/movie-discovery/**'
      - '.github/workflows/movie-discovery-ci.yml'

Best Practices

Practice	Details
Prefer always-running required workflows	If branch protection requires a check, keep the workflow trigger unconditional and gate expensive jobs with job-level conditions instead of `paths-ignore` at the workflow level.
Prefer `paths` for isolated services	Use `paths` when a job is only relevant to a single subdirectory (e.g., a microservice).
Do not mix `paths` and `paths-ignore`	GitHub does not allow both filters on the same event.
Account for required status checks	If a workflow with path filtering is a required status check, GitHub will report the check as "skipped" (not "passed") when the workflow does not run. Skipped checks do not satisfy branch protection rules. A safe workaround is to add an always-run "noop" job that reports success when the paths filter is not met — avoid `pull_request_target` for this purpose, as it runs with base-branch permissions and can expose secrets when combined with PR code checkout. See Troubleshooting required status checks.
`schedule` events ignore path filters	Scheduled (`cron`) workflows always run regardless of any `paths` or `paths-ignore` configuration.

Reference Links

Workflow Trigger

The PR validation workflows are triggered on:

Pull request events targeting the development branch (branches: ['development'])
Both opening PRs and pushing new commits to existing PRs targeting development

pr.yml always runs and classifies docs-only PRs inside the workflow so PR Validation is always reported. container-images.yml builds production images on pull requests and on pushes to development; manual runs can also publish development or production promotion tags before triggering the matching Coolify resource. Its validate-deploy-request job enforces that development deploys run from the development ref and production deploys run from main. movie-discovery-ci.yml additionally only runs when at least one file under services/movie-discovery/** changes or when .github/workflows/movie-discovery-ci.yml itself changes. codeql.yml runs on pushes and pull requests targeting development, plus its weekly scheduled scan. dependabot-auto-merge.yml runs on pull requests to development, but its auto-merge job only executes for PRs opened by dependabot[bot].

Dependabot

Dependabot is configured in .github/dependabot.yml to monitor:

npm (root) – workspace dependencies, grouped by framework, AI, database, UI utilities, testing, Storybook, build tools, linting/formatting, and miscellaneous production/development buckets
npm (services/movie-discovery) – movie-discovery service dependencies, grouped by production, development, and security updates
GitHub Actions – workflow action versions

All groups cover minor and patch updates. Major updates still require manual review.

Minor and patch Dependabot PRs are eligible for auto-squash merge through .github/workflows/dependabot-auto-merge.yml. Branch protection still controls when the queued auto-merge can complete because required checks must pass first.

Local Testing

To run the same checks locally before pushing:

# Code quality
npm run lint:check
npm run format:check
npm run type-check

# Server tests with coverage
npx vitest --project=server --run --coverage

# Storybook tests (requires Playwright browsers)
npm run pretest:storybook --workspace=apps/web
npm run test:storybook

# Verify build
NEXT_FONT_GOOGLE_DISABLE=1 npm run build

# Service test pass
npm run test:services

# Focused movie discovery CI reproduction
npx turbo run build test --filter=./services/movie-discovery

Troubleshooting

Common Issues

Playwright Installation Failures
- The storybook-tests job will now fail explicitly — check runner disk space and OS compatibility for Playwright browsers.
Google Fonts Network Timeouts
- Ensure NEXT_FONT_GOOGLE_DISABLE=1 is set in the build step (already configured in CI).
Build Failures
- Verify all required environment variables are present as repository secrets (OPENAI_API_KEY).
Service Type or Test Errors
- Run npm run test:services locally, then narrow with npx turbo run build test --filter=./services/<service-name>.
Movie Discovery Path-Filtered Workflow Errors
- Run npx turbo run build test --filter=./services/movie-discovery locally to reproduce the dedicated workflow.

The workflow helps maintain code quality and functionality across all pull requests.