Architecture Roadmap (Current Workspace, Next Boundary Steps)

Purpose

This roadmap exists to guide architectural refactoring in a practical, low-risk way so PopChoice stays maintainable as it grows. The repository already uses a workspace-based layout, so the focus now is not "move to a monorepo someday" but "make the current workspace boundaries explicit, enforceable, and easier to extract over time."

Current State

The repository is maintainable today and productive for ongoing work.

What is already true:

the repo already uses npm workspaces with apps/*, packages/*, and services/*
the web app and supporting apps already live under apps/
background jobs and sync processes already live under root services/*
a shared package already exists in packages/shared
a shared UI primitives package already exists in packages/ui
recommendation orchestration now lives in reusable feature-owned modules outside route ownership
account, password reset, recommendation feedback, and movie-memory flows exist for signed-in users

The main remaining risks are:

boundary clarity across app/domain/infrastructure concerns
extractability of logic into independently owned modules over time
some API boundaries still retain more orchestration awareness than the target boundary model

Status Snapshot

Done Already

Issues Still Present

The current quiz is still a first-generation guided flow. It should evolve toward a signal-based recommendation model with explicit Solo/Duo/Group audience modes, Fast/Normal effort modes, a shorter "tonight" quiz, a swipe-based mode for movie-heavy users, and a TMDB-first catalog strategy. See RECOMMENDATION-ROADMAP.md.
Route-local compatibility re-export files still exist under src/app/api/movie-recommendation; future recommendation changes should continue moving real logic into src/features/recommendation.
Account settings/profile/provider identity remain intentionally thin.

Reference: use BOUNDARIES.md as the current ownership baseline.

Backlog Hygiene

Create a GitHub issue for every actionable roadmap ticket before or alongside adding it to this document.
If a roadmap item is too large for one PR, keep the original issue as an epic/umbrella and create focused child issues for implementation-sized work.
Link roadmap items to concrete issues whenever possible so completed work can be checked off and future agents do not have to rediscover context.
Keep unlinked bullets for direction-setting only; convert them into issues once they become actionable.

Target Direction

PopChoice should continue evolving within the current workspace layout toward clearer ownership and easier extractability, for example:

apps/web
apps/docs
apps/bull-board
apps/backoffice
services/movie-discovery
services/movie-backfill
services/db-migrate
packages/domain
packages/config
packages/clients
packages/shared
packages/ui

This is an extraction direction, not a mandate for large-scale file moves right now. Current work should optimize for clean boundaries inside the existing workspace first.

UI extraction should follow UI Development: one Storybook runner loads stories from app workspaces and packages/ui, while shadcn-derived primitives move into the shared UI package when reuse is real and the primitive stays domain-free.

Guiding Principles

Prefer explicit boundaries over convenience imports.
Keep route handlers thin; move orchestration into reusable modules.
Separate domain logic from infrastructure details (DB, API clients, queues).
Minimize coupling and avoid hidden dependencies through broad utility barrels.
Standardize shared tooling only after boundaries are stable.
Refactor in small, reversible steps with existing tests and CI gates.

Phased Plan

Phase 1: Stabilize boundaries in the current workspace

Define and document ownership for app, domain, infrastructure, and shared helpers.
Reduce cross-layer imports that bypass intended boundaries.
Keep API routes focused on validation, orchestration calls, and response mapping.
Align docs with the real workspace structure before making larger architectural claims.

Phase 2: Refactor for extractability

Move reusable domain logic into cohesive modules that can be lifted out later.
Isolate external integrations behind stable client/service interfaces.
Centralize configuration and constants to reduce manual synchronization.
Prefer extracting recommendation flows out of src/app/api into clearer domain-owned modules.
Refactor the quiz submission lifecycle so recommendation creation is modeled explicitly instead of coordinated through route-local useEffect, refs, and navigation timing.
Keep account/movie-memory orchestration behind feature-owned modules as the API surface grows beyond the current service extraction.

Phase 3: Extract intentional packages from the existing layout

Use the existing apps/, services/, and packages/ layout more intentionally as boundaries become clearer.
Move modules incrementally to preserve behavior and delivery speed.
Keep migration steps explicit and reversible.

Phase 4: Standardize shared tooling/config/testing

Consolidate shared TypeScript, lint, formatting, and test conventions.
Reuse config packages where duplication exists across app/services.
Align CI checks to the new boundaries and ownership model.
Adopt Fallow code-quality analysis in #684, with PR-local Fallow Audit promoted to a required new-only gate.
Extend Fallow cleanup beyond apps/web through #697 for backoffice/docs and #698 for shared/services.
Drive inherited repo-wide Fallow complexity to zero actionable findings through #717, split into focused worker, recommendation, persistence, eval, backoffice, account/results, auth, catalog, and UI helper cleanup issues.
Improve the repo-wide Fallow health score through #766 and #772. After PR #773, the root health score is 87.9 (A) with zero complexity and dead-code findings.

CI/CD and Deployment Track

Build production container images in GitHub PR/CI once, publish them with commit/PR metadata, and document preview or downstream deploys running those already-built images instead of rebuilding the monorepo in each deployment environment.
Preserve provenance between a PR check, container digest, deployed preview, and /api/build metadata through GHCR labels, digest artifacts, runtime image metadata, and Docker-baked fallbacks.
Run Coolify from GHCR images via a single IMAGE_TAG release bundle instead of compiling PopChoice services on the VPS.
Add an optional Coolify deploy webhook path after successful development image publishing.
Define the staged local -> development -> production deployment model in #556, including DNS wildcard usage, Coolify service-domain mapping, development vs production resources, immutable production image tags, and preview cleanup/certificate rate-limit guidance.
Harden the preview certificate strategy in #545 by documenting single-label wildcard limits, preview cleanup, generated-domain alternatives, and Let's Encrypt registered-domain rate-limit debugging.
Keep deployment-time work focused on migrations, health checks, runtime configuration validation, and compatibility checks rather than application compilation.
Add a dedicated migration release gate so database migrations are visibly completed before rolling long-running web/worker services.
Add release compatibility checks that verify all running PopChoice services report the same commit/image tag.

Operational Observability Track

Security and Reliability Track

Add per-call OpenAI timeout handling with cancellation where supported, and map upstream timeout failures to clear 504-style API responses.
Add request body size limits for externally facing routes before expensive parsing, moderation, embedding, or recommendation work begins.
Add retry/backoff and circuit-breaker behavior for expensive external dependencies where retrying is safe.
Sanitize client-facing error responses so internal exception details, upstream payloads, and infrastructure hints stay out of API responses.
Validate required environment variables on application startup for web, workers, and root services so misconfigured deployments fail early. First slice: apps/backoffice and apps/bull-board now use process-specific plain Zod runtime configs from @pop-choice/shared; continue the same pattern for web, workers, and standalone catalog services.
Clarify idempotency and retry behavior for recommendation creation, worker retries, more-picks jobs, and failed queue recovery.
Add a shared operator login model for operational apps in #548. apps/bull-board now supports shared Basic Auth with optional fail-closed behavior, and future backoffice routes should reuse the same operator-auth contract instead of embedding admin access in the user-facing web app.
Add dependency/security scanning, static security checks, and periodic security review expectations to CI or maintenance workflows.

Data Quality Track

Backoffice Operator Maturity Track

Continue post-MVP backoffice work under the follow-up epic #660, linked back to the original backoffice/catalog-health epic #493. The original epic established the dedicated app, catalog-health review, safe repair actions, durable repair batches, queue visibility, realtime updates, and duplicate-merge foundations. The maturity track keeps the next operator-console work implementation-sized:

#661: add deterministic e2e coverage for core operator flows such as catalog-health repair enqueueing, repair-batch triage, realtime queue state, TMDB review actions, and duplicate merge preview/submit once the UI exists.
#662: extract shared backoffice operator UI primitives for repeated page headers, toolbars, panels, status badges, data tables, empty states, error states, and action affordances.
#663: standardize backoffice action route contracts for same-origin/auth failures, validation errors, progressive-enhanced redirects, JSON responses, and public operator error messages.
#664: improve bulk repair recovery UX so operators can understand partial, failed, skipped, unavailable, and unresolved work, then retry only the failed or unavailable items.
#665: harden realtime queue resilience with visible live/stale/reconnecting/unavailable states, last successful snapshot timestamps, and manual refresh or polling fallback.
#666: complete TMDB review workflow follow-ups such as next-review navigation, clearer risk summaries, decision history, safe bulk affordances, and richer filters.
#667: add backoffice observability and security guardrails for operator action logs, metrics, same-origin/CSRF coverage, and secret-safe public error responses.
#668: improve backoffice developer experience with reusable fixtures, deterministic realtime/action test helpers, and clearer local validation docs.

Account Platform Track

Add a user profile model for display name, avatar, and account settings metadata.
Add account settings APIs and UI for profile edits, saved recommendation edits, and taste-profile management.
Design a provider identity model before adding magic-link or social login so local credentials and external providers can coexist cleanly.
Add saved-recommendation mutations for rename, annotate, remove, and organize actions without leaving the account page.
Make taste memory inspectable and editable so users can correct watched, liked, not-interested, and wrong-mood signals.
Plan scalable account memory views before the list grows: search, signal filters, pagination or virtualized lists, and compact rows for large watched/liked histories.

Accessibility and UI Quality Track

Add recurring accessibility checks for keyboard navigation, focus states, labels, color contrast, and reduced-motion behavior.
Add focused tests or visual checks for the quiz, loading/results handoff, account pages, and feedback controls.
Keep design-system examples aligned with production components so UI regressions are easier to spot before release.

Testing and Evaluation Track

Product Feedback Track

Expand the explicit movie-memory experience so watched/not-seen setup feels complete for users with large histories.
Expand liked feedback beyond exact candidate boosts into a richer positive taste signal once the canonical signal model exists.
Consider a separate "worth rewatching" angle for watched movies so strong matches can still appear intentionally, with copy that frames them as rewatch candidates instead of new discoveries.
Make the reason for reused titles transparent when feedback history intentionally allows a repeat.
Manual watched-list management, rewatch mode, richer preference editing, and gamified taste history can follow after the core memory behavior is stable.
Continue polishing the dedicated movie-memory experience around exact-title search, empty states, and large-history review now that deck state and batched submission are in place.
Keep manual movie search as a secondary escape hatch for exact titles. Movie memory and available-movies now both expose catalog search; extract a shared catalog-search component if another surface needs the same controls.
Treat posters and localized metadata as first-class data quality requirements for movie memory. Candidate cards should degrade gracefully, but missing poster coverage should be visible in catalog-health reporting.
Add a stable way to avoid recommending movies the user just marked as watched, not-interested, or wrong-mood, while still allowing an intentional "rewatch" recommendation mode later.

TMDB and Catalog Expansion Track

Pivot recommendation retrieval toward TMDB-backed catalog coverage instead of relying primarily on the embedded/course-sized movie database.
Keep the local movies table as a cache/index of known titles, embeddings, localized names, poster URLs, and TMDB ids rather than the full source of truth.
Plan #607 before switching defaults from local-vector-first plus TMDB fallback to TMDB-first broad candidate generation plus local cache, enrichment, memory, and reranking.
Backfill TMDB ids for existing local movies using exact title/year matches first, then persist ambiguous or low-confidence matches for manual review.
Add cast, director, genre, and keyword metadata as first-class catalog data before implementing actor/director/genre search.
Move TMDB discovery, backfill, and metadata refresh into shared rate-limited BullMQ catalog workers in #492 before growing catalog volume. The worker enforces one configurable TMDB request budget across catalog-maintenance jobs, honors 429 with backoff, dedupes jobs by stable tmdbId/movieId keys, and exposes queue depth/failures in Bull Board.
Add the metadata v1 quality contract for recommendations: hot movie columns for identity/language/quality/popularity, normalized watch providers for US, FI, and RU, bounded TMDB details enrichment for top direct TMDB candidates, and catalog-health/eval checks for low-quality metadata.
Add a back-office review queue for ambiguous TMDB matches, missing posters, duplicate identities, and metadata conflicts in #493 before applying risky automatic merges.
Prefer TMDB ids for all cross-feature identity checks. Fall back to normalized title plus year only when TMDB identity is unavailable.
Design future discovery flows around dynamic TMDB candidate sets: "I have watched many films" deck mode, quiz-assisted mode, and later a preference/taste-training mode.

Account Experience Track

Add magic-link style login options, with rate limits and email delivery observability.
Add social login/provider linking after the provider identity model is stable.
Add profile editing for display name, avatar, and basic preferences.
Add account achievements, taste progress, and gamified memory-building only after the underlying movie-memory signals are reliable.
Add feedback loops that explain how recommendation feedback changes future recommendations.

Recommendation Experience Track

Define Recommendation V2 in #610 around three independent axes: audience context, match depth, and candidate source strategy.
Treat quiz answers, swipe reactions, account memory, and result feedback as inputs into a shared taste-signal model.
Rework the guided quiz around "what do you want tonight?" instead of relying on a favorite movie, broad genre labels, and optional actor input.
Build #609 as the Fast Pick guided flow with minimal intent, hard avoids, and discovery appetite. The current flow separates audience selection from match depth, supports Solo/Duo/Group, and sends experienceMode: fast-pick.
Build #608 as the Normal mode flow with richer positive/negative signals, optional reference movies, and first-class Duo compromise handling. The first slices add Normal-mode hard avoids, carry those negative signals through the existing recommendation payload, and expose Duo as a separate two-person entry/results path.
Add an alternate taste-swipe mode for users who have watched many films and prefer to react to concrete movie cards instead of answering abstract questions.
Start #612 by carrying candidate source provenance through recommendation results, persistence, logs, eval reports, a source-strategy policy, and route/job/pipeline metadata before changing retrieval defaults.
Connect #612 to retrieval behavior in stages: bounded hybrid-fast/compromise-hybrid fallback first, then tmdb-first generation with hard-avoid/discovery-aware TMDB query shaping and source/metadata eval thresholds before making it the Normal quality default.
Add experienceMode as the product-facing selector for this policy layer, defaulting existing traffic to normal-match while letting Fast Pick requests choose fast-pick.
Move toward TMDB-first candidate generation: use TMDB for broad discovery and keep the local database as a cache/enrichment/reranking layer rather than the whole movie universe.
Keep TMDB ids as the preferred movie identity and log ambiguous title/year matches for later admin/back-office review.
See RECOMMENDATION-ROADMAP.md for the staged plan.

Group Recommendation Rooms Track

Keep #359 as the umbrella for the group-room milestone instead of treating it as a single implementation PR.
#467: build room persistence, TTL, cleanup, and participant storage first.
#468: add share links, participant join flow, and readiness state.
#469: run the recommendation pipeline from completed room answers and persist a stable shared result.
#470: add QR invite and projector mode only after the core room flow works.
Preserve the current same-device group mode until room-backed group mode is complete enough to replace it intentionally.

Route handlers validate input, call orchestration, and map responses without owning business logic in the current API surface.
Domain orchestration does not depend on route-local module placement.
Infrastructure concerns stay behind clients, repositories, or queue adapters.
Shared exports remain intentional and do not hide ownership in cross-boundary code paths.

Recommendation Flow

A reusable recommendation pipeline exists.
Recommendation pipeline ownership is no longer tied to src/app/api/movie-recommendation.
Queueing, DB writes, and response mapping are separated cleanly from recommendation decision logic.
Similarity thresholds and related calibration docs point to the same source of truth.
Positive user memory (liked) influences ranking.
Quiz submit handoff no longer depends on route-local reset timing.

Documentation Alignment

README reflects the current apps/web/src structure.
Development docs reflect the current apps/, packages/, and services/ layout.
Service docs and architecture docs use the same terminology for boundaries and ownership.
Agent guidance exists in root AGENTS.md.

Non-Goals (For Now)

No large-scale restructuring just to make the repo look more "monorepo-like".
No large-scale file moves that mix structural and behavioral change.
No broad rewrite of working features.
No premature package splitting before clear ownership boundaries exist.