feat(stacks/eh-search): add site-search FastAPI service

Mirrors /opt/ai-apps/eh-search/ on the server, including the full FastAPI app (intent routing, FTS+fuzzy+substring hybrid, multi-source federation across vehicles + blog + brands + pages + static + tag bridge), SQL schema (Postgres materialized view with german_unaccent text search, pg_trgm for fuzzy), Dockerfile and compose. Sanitized the hardcoded password in sql/01_init.sql — replaced with REPLACE_ME_BEFORE_APPLYING placeholder since this repo is public. The eh-search service binds only on the private network (10.0.0.8:8200) and is reachable only via Pegasus nginx proxy at /api/search. Refs OP#1094 OP#1105 OP#1112 OP#1116 OP#1117
feat(stacks/forgejo): add self-hosted Git stack
2026-04-11 22:19:39 +02:00 · 2026-04-11 22:19:25 +02:00 · 2026-04-11 22:19:25 +02:00 · 2026-04-11 22:19:10 +02:00
29 changed files with 2299 additions and 1 deletions
--- a/.gitignore
+++ b/.gitignore
@ -0,0 +1,75 @@
 # Secrets — NEVER commit
 .env
 .env.local
 .env.*.local
 *.key
 *.pem
 *.p12
 *.pfx
 secrets/
 credentials/
 # Backups and dumps
 *.sql
 *.sql.gz
 *.tar.gz
 *.tar.xz
 backups/
 dumps/
 # Runtime data that shouldn't be in version control
 data/
 volumes/
 storage/
 acme.json
 # Python
 __pycache__/
 *.py[cod]
 *$py.class
 *.so
 .venv/
 venv/
 env/
 .Python
 pip-log.txt
 .coverage
 .pytest_cache/
 # Node
 node_modules/
 npm-debug.log*
 yarn-debug.log*
 yarn-error.log*
 dist/
 build/
 .next/
 .astro/
 # IDE / Editor
 .vscode/
 .idea/
 *.swp
 *.swo
 *~
 .DS_Store
 Thumbs.db
 # Logs
 *.log
 logs/
 # OS
 .DS_Store
 .AppleDouble
 .LSOverride
 Thumbs.db
 ehthumbs.db
 Desktop.ini
 # Temporary
 *.tmp
 *.bak
 *.orig
 .#*
 #*#
--- a/Agent.md
+++ b/Agent.md
@ -0,0 +1,99 @@
 # Agent Briefing — electric-horses-infra
 Du bist ein AI-Agent der an der Infrastructure-as-Code-Ablage von Electric Horses / Richter & Zech arbeitet.
 ## Was ist dieses Repo?
 **Die versionierte Beschreibung unserer Hetzner-Cloud-Infrastruktur.**
 Alles was wir auf Servern laufen lassen (docker-compose-Stacks, Configs, Scripts) ist hier abgebildet — ohne Secrets. Das Repo ist kein Deploy-Mechanismus (wir ziehen nicht automatisch aus Git auf die Server), sondern die **Wahrheit-von-gestern**: was aktuell läuft.
 Wenn Server A und dieses Repo unterschiedliche Inhalte zeigen, ist entweder ein Commit noch offen ODER die AI hat manuell am Server gearbeitet und das Repo nicht nachgezogen — letzteres ist ein Bug und muss nachgeholt werden.
 ## Vor jeder Aktion
 1. **Lies `README.md`** — die aktuellen Regeln
 2. **Lies `Agent.md`** im betroffenen Stack-Ordner (z.B. `stacks/forgejo/Agent.md`) falls vorhanden
 3. **Verstehe den aktuellen Zustand** — `git log --oneline -20` zeigt die letzten Aktionen
 ## Konventionen
 ### Commit-Messages
 - Auf Deutsch ODER Englisch, konsistent pro Commit
 - Konventioneller Prefix (feat / fix / docs / refactor / chore)
 - Referenz auf OpenProject wenn anwendbar: `Refs OP#1234`
 - Kurz und konkret, Detail in Commit-Body
 Beispiele:
 ```
 feat(forgejo): add backup cron script
 Refs OP#1119
 ```
 ```
 fix(eh-search): tighten substring match to 3+ chars
 Fixes the "ze" noise issue where 2-char queries matched
 in unrelated words. Brand-prefix matches still work.
 Refs OP#1112
 ```
 ### Secrets — NIEMALS committen
 - `.env` immer gitignored
 - `.env.example` mit Platzhaltern ist OK
 - Wenn du echte Credentials brauchst: sie leben auf dem Server in `/opt/<stack>/.env`
 - API-Tokens, Passwörter, Zertifikate: **nie** ins Repo, auch nicht in Comments
 ### Docker-Compose-Files
 - Spiegeln 1:1 den Stand auf dem Server
 - Image-Tags sind gepinnt (nicht `:latest`) für Reproduzierbarkeit
 - Environment-Variablen immer via `env_file` oder `environment:` mit `${VAR}` Syntax — nie hardcoded
 ### Docs
 - Architektur-Beschreibungen in `docs/architecture/`
 - ADRs in `docs/adr/<NNNN>-<slug>.md` mit fortlaufender Nummer
 - Runbooks in `docs/runbooks/<topic>.md`
 - Markdown-Diagramme mit Mermaid (rendert im Forgejo-Web-UI)
 ## Verbindung zur realen Infrastruktur
 | Stack | Server | Pfad auf Server | Dieses Repo |
 |---|---|---|---|
 | forgejo | ai-apps (10.0.0.8) | `/opt/ai-apps/forgejo/` | `stacks/forgejo/` |
 | eh-search | ai-apps (10.0.0.8) | `/opt/ai-apps/eh-search/` | `stacks/eh-search/` |
 | (weitere kommen in folgenden Commits) |
 **Workflow bei Änderungen:**
 1. AI macht Änderung auf dem Server (`docker-compose.yml` anpassen, restart)
 2. AI committet die identische Änderung in dieses Repo
 3. User reviewt den Commit im Forgejo Web-UI
 4. Bei Problemen: Rollback via `git revert` + manuelles Neudeployen auf dem Server
 ## Zugang zu Servern während der Arbeit
 - `ssh ai-apps` — Hetzner cx22, Private 10.0.0.8, Public 91.98.226.44
 - `ssh pegasus` — Hetzner cx33, Private 10.0.0.10 (kein Public IPv4)
 - `ssh webmin-kronos` — Hetzner cx32 mit sudo, DNS + Apache Proxy
 - `ssh authentik-sso` — Hetzner, Private 10.0.0.7, SSO
 Lokale Doku mit Credentials: `/Users/benjaminweinlich/Library/Mobile Documents/com~apple~CloudDocs/AI/IT & Infrastruktur/` (nicht in diesem Repo — das Repo ist public!)
 ## Dieses Repo ist PUBLIC
 Das bedeutet:
 - **Nichts** was schützenswert wäre commiten (keine Passwörter, interne Kundendaten, Fahrzeug-VINs außer öffentlich dokumentierten Musterdaten)
 - Die README + ADRs können gern "Marketing-tauglich" sein — sie zeigen nach außen wie wir denken und bauen
 - Für sensiblere Details ein privates Repo nutzen (evtl. zukünftig `electric-horses-infra-private`)
 ## OpenProject
 - **Projekt:** www.electric-horses.de Relaunch (ID 33)
 - **Relevante Phasen:**
  - M6: Site Search (abgeschlossen)
  - M7: Operations & Documentation Foundation
    - M7.1: Forgejo Deployment (#1119) — abgeschlossen
    - M7.2: Erstes Infra-Repo (dieses Repo!)
    - M7.3+: ADRs/Runbooks spiegeln, Mirror zu GitHub, etc.
 ## Related External Docs
 Außerhalb dieses Repos lebt Doku in:
 - `/Users/benjaminweinlich/Library/Mobile Documents/com~apple~CloudDocs/AI/IT & Infrastruktur/`
  - `Forgejo/` — ADRs + Runbooks für den Forgejo-Stack (werden in M7.3 hier hin gespiegelt)
  - `Authentik/howto-oauth2-provider.md` — Template für OIDC-Integrationen
  - `Hetzner/ai-apps-stacks.md` — Cloud-Inventory
--- a/README.md
+++ b/README.md
@ -1,3 +1,54 @@
 # electric-horses-infra
-Infrastructure as Code — docker-compose Stacks, Configs, Architektur-Docs und ADRs für die Electric Horses / Richter & Zech Hetzner-Cloud-Infrastruktur.
+**Infrastructure as Code** für Electric Horses / Richter & Zech.
 Dieses Repository enthält die versionierten Docker-Compose-Stacks, Reverse-Proxy-Configs, Architektur-Dokumentation und Architecture Decision Records (ADRs) für unsere Hetzner-Cloud-Infrastruktur.
 ## Was lebt hier?
 ```
 electric-horses-infra/
 ├── Agent.md                   # AI-Briefing für jede Session
 ├── README.md                  # diese Datei
 ├── .gitignore                 # was NICHT commitet wird
 │
 ├── stacks/                    # Docker-Compose Stacks (1:1 was auf den Servern läuft)
 │   ├── forgejo/               # code.sdda.eu — dieses Git-Hosting
 │   └── eh-search/             # electric-horses.de Site-Search
 │
 └── docs/                      # Dokumentation
    └── architecture/          # Topologie-Snapshots, Stack-Inventories
 ```
 **Konvention:** Jeder Stack-Ordner spiegelt den Stack auf dem tatsächlichen Server — aber ohne Secrets. Die `.env` mit echten Werten lebt auf dem Server (mode 600), hier ist nur die `.env.example` als Template.
 ## Was gehört hier REIN
 - `docker-compose.yml` Dateien
 - `Dockerfile`s
 - Nginx-Configs
 - Python / Shell Scripts die Teil der Infra sind
 - SQL-Migrationen / Schema-Dumps (nicht Datenbank-Inhalte!)
 - `.env.example` mit Platzhaltern
 - Markdown-Dokumentation, Diagramme (Mermaid), ADRs, Runbooks
 ## Was gehört hier NICHT REIN
 - **Niemals** echte `.env` Dateien, Passwörter, API-Keys, Zertifikate
 - Runtime-Daten (Datenbank-Inhalte, Git-Repos im Git-Repo, uploaded files)
 - Build-Artefakte (`node_modules/`, `dist/`, `__pycache__/`)
 - Logs
 - Backup-Dumps mit Nutzerdaten (DSGVO)
 Wenn du unsicher bist: Lass es lieber raus. Es ist einfacher eine fehlende Datei nachzureichen als ein Secret aus der Git-History zu entfernen.
 ## Mitarbeit
 **Du als Human:** Du schreibst Wünsche und Feedback in Claude-Code / OpenProject, reviewst Commits im Web-UI (code.sdda.eu).
 **Die AI:** Liest `Agent.md` am Sessionstart, macht Änderungen in bestehenden Stacks, committet mit Referenzen auf OpenProject-Work-Packages (`Refs OP#1234`).
 ## OpenProject Referenz
 - **Parent-Phase:** M7 Operations & Documentation Foundation (#1118)
 - **Repo-Setup:** M7.2 (dieser Commit)
 - Nachschauen: https://openproject.sdda.eu/projects/www-dot-electric-horses-dot-de-relaunch/
--- a/docs/architecture/ai-apps-stacks.md
+++ b/docs/architecture/ai-apps-stacks.md
@ -0,0 +1,79 @@
 # ai-apps — Service Stack Inventory
 **Stand:** 2026-04-11
 **Host:** Hetzner cx22 (8 GB RAM, 2 vCPU shared, 75 GB Disk)
 **Netzwerke:** Public 91.98.226.44, Privat 10.0.0.8
 **OS:** Debian / Docker 29.x + Compose v5
 **Swap:** 2 GB (/swapfile in fstab)
 ## Service-Übersicht
 | Stack | Containers | Exposed | Domain(s) | DB | Memory (peak) |
 |---|---|---|---|---|---|
 | **shared** | traefik, portainer | 80, 443 | `traefik.sdda.eu`, `containers.sdda.eu` | — | ~200 MB |
 | **n8n** | n8n, n8n-db | — (via Traefik) | `n8n.sdda.eu` | Postgres 16 | ~400 MB |
 | **locosoft-hilfe-system** | locosoft-backend, locosoft-frontend, locosoft-db, locosoft-redis, chromadb | — | `locosoft.sdda.eu`, `chromadb.sdda.eu` | Postgres 16 + Redis + ChromaDB | ~1.5 GB |
 | **eh-search** | eh-search, eh-search-redis | 10.0.0.8:8200 (privat) | via Pegasus nginx `/api/search` | Redis | ~250 MB |
 | **loco-replica** | loco-replica-db | 10.0.0.8:5433 (privat) | — | Postgres 17 | ~300 MB |
 | **electric-horses-audits** | eh-audits | — (n8n-triggered) | — | — | ~200 MB |
 | **forgejo** ★NEU★ | forgejo, forgejo-db | 222 (SSH), via Traefik | **`code.sdda.eu`** | Postgres 16 | ~800 MB |
 **Gesamt-Schätzwert:** ~3.5 GB Peak, gemessener Durchschnitt ~1.9 GB + Cache.
 ## Traefik (shared)
 - **Version:** 3.6.2
 - **ACME Resolver:** `letsencrypt` (HTTP-Challenge, Email `info@sdda.eu`)
 - **Dynamic Config:** `/opt/shared/traefik/dynamic/` (File Provider für Middlewares: `authentik.yml`, `authentik-outpost.yml`)
 - **Docker Network:** `traefik-public` (external, alle Web-Services hängen dort)
 - **ACME Store:** `/opt/shared/traefik/acme.json`
 ## Authentik Integration
 - **Authentik-Server:** separater Host `authentik-sso` (138.199.202.82 / 10.0.0.7), **NICHT** auf ai-apps
 - **ForwardAuth-Apps** (existing): `locosoft-forward-auth`, `n8n-forward-auth` (beide Proxy Provider, Embedded Outpost)
 - **Native OIDC-Apps** (neu ab 2026-04-11): `forgejo` (OAuth2/OpenID Provider, siehe `../Forgejo/adr/0003-*.md`)
 ## Stack-Organisation
 Jeder Stack lebt in eigenem Verzeichnis unter `/opt/ai-apps/<stack>/`:
 ```
 /opt/ai-apps/
 ├── electric-horses-audits/
 ├── eh-search/
 ├── forgejo/        ← NEW 2026-04-11
 ├── locosoft-hilfe-system/
 ├── loco-replica/
 ├── mobile-refdata/   ← scripts only
 └── n8n/
 /opt/shared/          ← Traefik + Portainer
 /opt/backups/         ← centralised backup target
 ```
 **Pattern:** Jeder Stack hat eine eigene `docker-compose.yml` und `.env`, eigene interne Netzwerke, gemeinsames `traefik-public` für Web-Exposure.
 ## Ressourcen-Bewertung (2026-04-11)
 - **RAM:** 7.6 GB total, 1.9 GB wirklich in use, 5.7 GB available (inkl. Cache), 250 MB free, 2 GB Swap aktiv
 - **Disk:** 20 GB frei von 75 GB (73% belegt)
 - **Load:** ~0.1 (idle)
 - **Fazit:** Forgejo passt noch rein, aber der Spielraum für zukünftige Services (z.B. Forgejo Actions, Monitoring, Semantic Search Index) ist begrenzt. Bei >85% RAM-Auslastung: Upgrade auf cx32 (16 GB, +4 €/Monat).
 ## DNS Einträge unter `sdda.eu` die auf ai-apps zeigen (91.98.226.44)
 - `ai.sdda.eu` (Alias)
 - `containers.sdda.eu` (Portainer)
 - `traefik.sdda.eu` (Traefik Dashboard)
 - `chromadb.sdda.eu`
 - `locosoft.sdda.eu`
 - `n8n.sdda.eu`
 - **`code.sdda.eu`** ← Forgejo (NEU)
 ## Backups (Stand 2026-04-11)
 - **forgejo:** Cron 03:00 → `/opt/backups/forgejo/`, Retention 14 Tage (seit M7.1)
 - **andere Stacks:** Keine zentrale Backup-Infrastruktur. Hetzner Cloud Snapshots manuell. Rclone → Nextcloud noch nicht eingerichtet (geplant M7.5).
 ## Zugriff
 - **SSH:** `ssh ai-apps` (User root, Key-basiert)
 - **Portainer:** https://containers.sdda.eu (admin)
 - **Traefik Dashboard:** https://traefik.sdda.eu
 ## OpenProject-Referenz
 - **M7.1** (#1119) — Forgejo Deployment: https://openproject.sdda.eu/work_packages/1119
 - **M7** (#1118) — Operations & Documentation Foundation (Parent Phase)
--- a/stacks/eh-search/.env.example
+++ b/stacks/eh-search/.env.example
@ -0,0 +1,28 @@
 # eh-search — Environment template
 # Copy to .env and fill in actual values. NEVER commit .env to Git.
 ENVIRONMENT=dev
 # Postgres (eh_vehicles via loco-replica-db on host private network)
 DB_HOST=10.0.0.8
 DB_PORT=5433
 DB_NAME=eh_vehicles
 DB_USER=search_read
 DB_PASSWORD=<siehe Password-Manager / sql/01_init.sql Hinweis>
 # Redis (intra-stack, no password)
 REDIS_HOST=eh-search-redis
 REDIS_PORT=6379
 REDIS_DB=0
 # Cache TTLs (seconds)
 CACHE_TTL_RESULT=60
 CACHE_TTL_SUGGEST=600
 CACHE_TTL_EMPTY=300
 # Directus (for slug resolution)
 DIRECTUS_URL=http://10.0.0.10:8055
 DIRECTUS_SLUG_REFRESH_SECONDS=300
 # CORS (leer wenn same-origin via Nginx auf Pegasus)
 CORS_ORIGINS=
--- a/stacks/eh-search/Agent.md
+++ b/stacks/eh-search/Agent.md
@ -0,0 +1,65 @@
 # Agent Briefing — eh-search Stack
 Du arbeitest am Site-Search-Service von electric-horses.de. Lies erst `../../Agent.md` am Repo-Root für globale Konventionen.
 ## Was ist eh-search?
 Ein FastAPI-basierter Suchservice für die Electric-Horses-Website. Unterstützt:
 - **Exact Lookup** via Kommissionsnummer (z.B. `D9094`) und DVN (z.B. `9094`)
 - **Full-Text Search** (Postgres `ts_vector` mit `german_unaccent` Dictionary)
 - **Fuzzy Search** (`pg_trgm` Trigramm-Matching)
 - **Multi-Source Federation:** Vehicles + Blog-Posts + Brands + Pages + Static Pages + Tag-Bridge
 - **Cmd+K Command Palette** im Astro-Frontend, mit Silent-SSO-Style "Öffnet sich bei Tastendruck"
 ## Live-Deployment
 - **Server:** ai-apps (Hetzner cx22, 10.0.0.8 privat)
 - **Pfad:** `/opt/ai-apps/eh-search/`
 - **Binding:** NUR `10.0.0.8:8200` (privat), NICHT public. Wird über Pegasus Nginx via `/api/search` erreicht.
 - **Frontend-Integration:** Astro Command-Palette auf `electric-horses.de` + `dev.electric-horses.de`
 - **Content-Hosting:** Der Suchindex nutzt die Postgres-DB `eh_vehicles` (Schema in `sql/`), ein In-Memory Content-Index (aus Directus gezogen), und hardcoded Static Pages + Tag-Bridge.
 ## Architektur-Verweis
 - **Datenquelle für Fahrzeuge:** PostgreSQL `eh_vehicles` auf loco-replica-db (privater Postgres 17 Container)
 - **Datenquelle für Blog/Brands/Pages:** Directus REST API auf Pegasus (`10.0.0.10:8055`)
 - **Cache:** Redis (dediziert im Stack)
 - **Intent Routing:** Regex-basiert, siehe `app/intent_router.py`
 - **Zwei Komm-Nr-Konzepte in Loco-Soft:** (a) Kundenseitig `{type_letter}{dvn}` z.B. `D9094`, (b) interne Fahrzeug-Nummer. **Nur (a) der Kunde jemals sehen.** Siehe Memory `project_eh_search_phase1.md`.
 ## Kritische Files
 - `app/main.py` — FastAPI Entry + Endpoint-Definitionen
 - `app/intent_router.py` — Query → Intent (komm_nr, dvn, keyword_search, ...)
 - `app/search/exact.py` — Exact Lookup (Komm-Nr, DVN, VIN)
 - `app/search/fts.py` — FTS + Fuzzy + Substring Fallback + Hybrid
 - `app/search/static_pages.py` — Static Page Registry + Tag→Page Bridge (~60 Mappings)
 - `app/content_index.py` — In-Memory Index aus Directus (Blog, Brands, Pages)
 - `app/schemas.py` — Pydantic-Response-Models
 - `sql/01_init.sql` — Initial Postgres Setup (pg_trgm, MView, search_read user)
 - `sql/02_komm_nr_fix.sql` — Komm-Nr Datenmodell-Korrektur
 ## Ops-Kommandos
 ```bash
 ssh ai-apps
 cd /opt/ai-apps/eh-search
 # Rebuild & restart nach Code-Änderung
 docker compose up -d --build eh-search
 # Cache invalidieren
 curl -X POST http://10.0.0.8:8200/cache/invalidate
 # Health
 curl -s http://10.0.0.8:8200/health | jq .
 # Logs
 docker logs eh-search --tail 30
 ```
 ## OpenProject
 - **M6** — Site Search Phase 1 (abgeschlossen)
 - **M6.2** — Federated Multi-Source (abgeschlossen)
 - **M6.3** — UX Refinement (abgeschlossen)
 - **M6.4** — Komm-Nr Datenmodell-Fix (abgeschlossen)
 - **M6.5** — Sync-Fix für Deaktivierungen (abgeschlossen)
 ## Related
 - Memory: `project_eh_search_phase1.md` — vollständige Historie und Gotchas
 - `../../Agent.md` — Repo-weites Briefing
--- a/stacks/eh-search/Dockerfile
+++ b/stacks/eh-search/Dockerfile
@ -0,0 +1,23 @@
 FROM python:3.12-slim
 WORKDIR /app
 # System deps (minimal)
 RUN apt-get update && apt-get install -y --no-install-recommends \
        curl \
    && rm -rf /var/lib/apt/lists/*
 # Python deps
 COPY requirements.txt .
 RUN pip install --no-cache-dir -r requirements.txt
 # App code
 COPY app/ ./app/
 # Healthcheck
 HEALTHCHECK --interval=30s --timeout=5s --start-period=10s --retries=3 \
    CMD curl -fsS http://127.0.0.1:8200/health || exit 1
 EXPOSE 8200
 CMD ["uvicorn", "app.main:app", "--host", "0.0.0.0", "--port", "8200", "--workers", "2", "--access-log"]
--- a/stacks/eh-search/README.md
+++ b/stacks/eh-search/README.md
@ -0,0 +1,63 @@
 # eh-search — Site-Search für electric-horses.de
 **Live:** Via Nginx-Proxy auf `https://www.electric-horses.de/api/search?q=...`
 **Interne Adresse:** `http://10.0.0.8:8200` (privat, nicht public!)
 **Stack-Pfad:** `/opt/ai-apps/eh-search/` auf ai-apps
 ## Was macht das?
 Ein FastAPI-Service der eine **Cmd+K Command Palette** auf der Electric-Horses-Website mit Suchergebnissen versorgt. Durchsucht:
 - 75 Fahrzeuge (Postgres FTS + Fuzzy + Komm-Nr/DVN Exact Lookup)
 - 28 Blog-Posts (mit Tags + neue SEO-Descriptions)
 - 24 Marken
 - 6 Legal-Pages (Impressum, Datenschutz, ...)
 - 8 Static Pages (Werkstatt, Ersatzteile, Kontakt, ...)
 - 60 Tag → Page Bridges für semantische Verbindungen
 **Killer-Feature:** Eingabe einer 4-stelligen Komm-Nr (oder `D9094` etc.) → Direct Redirect zur Fahrzeug-Detailseite. Latenz <50ms. Ideal für Mitarbeiter am Telefon.
 ## Architektur
 ```
 Browser Cmd+K
    ↓
 www.electric-horses.de/api/search
    ↓ (Nginx Proxy auf Pegasus)
 10.0.0.8:8200 (eh-search auf ai-apps, privat)
    ↓
 ├─ Postgres eh_vehicles (loco-replica-db Container, Port 5433)
 ├─ Redis Cache (Stack-intern)
 ├─ Directus REST API (in-memory Content Index)
 └─ Static Page Registry (hardcoded in Python)
 ```
 ## Sicherheit
 - **Bind NUR auf privates Netz** (10.0.0.8:8200) — nicht über Public-IP erreichbar
 - **Read-Only DB-User** (`search_read`) — kann nur SELECT auf Materialized View
 - **Same-Origin via Pegasus Nginx** — kein CORS nötig, kein neues Cert, kein neues DNS
 - **Rate Limit** via Nginx (30 req/s pro IP)
 ## Files
 - `docker-compose.yml` — Stack
 - `Dockerfile` — Python 3.12 + FastAPI + asyncpg + redis-py
 - `requirements.txt` — Python-Deps
 - `.env.example` — Template für `.env` (niemals echte `.env` committen!)
 - `app/` — FastAPI-Anwendung
 - `sql/` — Postgres-Schema und Migrations (search_vehicles Materialized View, pg_trgm, unaccent)
 - `Agent.md` — AI-Briefing
 - `README.md` — diese Datei
 ## Live-Update-Workflow
 Wenn sich Fahrzeugdaten ändern (vehicle_sync.py nächtlich auf Pegasus), wird am Ende des Syncs automatisch die Materialized View refreshed UND der Redis-Cache invalidiert. Directus-Edits an Blog/Brands/Pages feuern einen Webhook an eh-search → sofortige Cache-Invalidation und Content-Index-Refresh.
 ## OpenProject Phasen (alle abgeschlossen)
 - **M6** — Postgres FTS + Fuzzy + Exact + Command Palette (Phase 1)
 - **M6.2** — Federated Multi-Source (Phase 1.5)
 - **M6.3** — UX Refinement (sleek, mobile-first, smart group ordering)
 - **M6.4** — Komm-Nr Datenmodell-Fix (D9094 korrekt statt falsche interne Nummer)
 - **M6.5** — Sync-Fix für deaktivierte Fahrzeuge
 ## Out of Scope (mögliche Phase 2)
 - Semantic Vector Search via Qdrant + bge-Embeddings
 - Voice Input via Whisper
 - LLM Filter-Extraktion via lokales Mistral/Gemma
 - Image Search / OCR
 - Cross-Encoder Reranking
--- a/stacks/eh-search/app/init.py
+++ b/stacks/eh-search/app/init.py
--- a/stacks/eh-search/app/cache.py
+++ b/stacks/eh-search/app/cache.py
@ -0,0 +1,80 @@
 """Redis cache wrapper with namespaced keys."""
 import hashlib
 import json
 from typing import Any
 import redis.asyncio as redis_async
 from app.config import settings
 _redis: redis_async.Redis | None = None
 NAMESPACE = "search:"
 async def init_redis() -> None:
    global _redis
    _redis = redis_async.Redis(
        host=settings.redis_host,
        port=settings.redis_port,
        db=settings.redis_db,
        decode_responses=True,
        socket_timeout=2,
    )
    await _redis.ping()
 async def close_redis() -> None:
    global _redis
    if _redis is not None:
        await _redis.close()
        _redis = None
 def get_redis() -> redis_async.Redis:
    if _redis is None:
        raise RuntimeError("Redis not initialized")
    return _redis
 def _hash(value: str) -> str:
    return hashlib.sha1(value.encode("utf-8")).hexdigest()[:16]
 def key_query(query: str) -> str:
    return f"{NAMESPACE}q:{_hash(query.lower())}"
 def key_suggest(prefix: str) -> str:
    return f"{NAMESPACE}suggest:{_hash(prefix.lower())}"
 def key_empty(query: str) -> str:
    return f"{NAMESPACE}empty:{_hash(query.lower())}"
 def key_top_brands() -> str:
    return f"{NAMESPACE}top:brands"
 async def get_json(key: str) -> Any | None:
    data = await get_redis().get(key)
    if data is None:
        return None
    try:
        return json.loads(data)
    except (json.JSONDecodeError, TypeError):
        return None
 async def set_json(key: str, value: Any, ttl: int) -> None:
    await get_redis().set(key, json.dumps(value, default=str), ex=ttl)
 async def invalidate_all() -> int:
    """Delete all keys in our namespace. Returns count deleted."""
    r = get_redis()
    count = 0
    async for key in r.scan_iter(match=f"{NAMESPACE}*", count=200):
        await r.delete(key)
        count += 1
    return count
--- a/stacks/eh-search/app/config.py
+++ b/stacks/eh-search/app/config.py
@ -0,0 +1,46 @@
 """Configuration via environment variables (Pydantic Settings)."""
 from pydantic_settings import BaseSettings, SettingsConfigDict
 class Settings(BaseSettings):
    model_config = SettingsConfigDict(env_file=".env", extra="ignore")
    environment: str = "dev"
    # Postgres
    db_host: str
    db_port: int = 5433
    db_name: str = "eh_vehicles"
    db_user: str = "search_read"
    db_password: str
    # Redis
    redis_host: str = "eh-search-redis"
    redis_port: int = 6379
    redis_db: int = 0
    # Cache TTLs
    cache_ttl_result: int = 60
    cache_ttl_suggest: int = 600
    cache_ttl_empty: int = 300
    # Directus
    directus_url: str = "http://10.0.0.10:8055"
    directus_slug_refresh_seconds: int = 300
    # CORS
    cors_origins: str = ""
    @property
    def dsn(self) -> str:
        return (
            f"postgresql://{self.db_user}:{self.db_password}"
            f"@{self.db_host}:{self.db_port}/{self.db_name}"
        )
    @property
    def cors_origin_list(self) -> list[str]:
        return [o.strip() for o in self.cors_origins.split(",") if o.strip()]
 settings = Settings()
--- a/stacks/eh-search/app/content_index.py
+++ b/stacks/eh-search/app/content_index.py
@ -0,0 +1,305 @@
 """In-memory content index for blog posts, brands, legal pages.
 Loaded from Directus on startup, refreshed every 5 min in background,
 and on POST /cache/invalidate. Total dataset is small (~90 items),
 fits comfortably in RAM.
 """
 import asyncio
 import logging
 import time
 from dataclasses import dataclass, field
 from typing import Any
 import httpx
 from app.config import settings
 log = logging.getLogger(__name__)
@dataclass
 class BlogPost:
    id: int
    title: str
    slug: str
    excerpt: str = ""
    seo_description: str = ""
    tags: list[str] = field(default_factory=list)
    category: str = ""
    published_at: str | None = None
    image_id: str | None = None
@dataclass
 class Brand:
    id: int
    name: str
    slug: str
    short_description: str = ""
    description: str = ""
    logo_id: str | None = None
    vehicle_count: int = 0
@dataclass
 class LegalPage:
    id: int
    title: str
    slug: str
    seo_description: str = ""
 # Mutable global state — protected by _lock for refresh atomicity
 _blog_posts: list[BlogPost] = []
 _brands: list[Brand] = []
 _legal_pages: list[LegalPage] = []
 _last_refresh: float = 0.0
 _lock = asyncio.Lock()
 # Slugs of pages that are NOT really top-level static pages but live in
 # the Directus pages collection. We treat them as legal/info pages.
 LEGAL_PAGE_SLUGS = {
    "impressum", "datenschutz", "agb",
    "barrierefreiheit", "batterie-entsorgung", "widerruf",
 }
 async def _fetch(client: httpx.AsyncClient, collection: str, fields: str, filter_field: str = "status", filter_value: str = "published") -> list[dict]:
    url = f"{settings.directus_url}/items/{collection}"
    params = {
        "fields": fields,
        "limit": -1,
    }
    if filter_field:
        params[f"filter[{filter_field}][_eq]"] = filter_value
    try:
        resp = await client.get(url, params=params)
        resp.raise_for_status()
        return resp.json().get("data", [])
    except Exception as e:
        log.warning("Failed to fetch %s: %s", collection, e)
        return []
 async def refresh() -> dict:
    """Pull all collections from Directus and rebuild the in-memory index."""
    global _blog_posts, _brands, _legal_pages, _last_refresh
    async with httpx.AsyncClient(timeout=15) as client:
        blog_data, brand_data, page_data = await asyncio.gather(
            _fetch(client, "blog_posts",
                   "id,title,slug,excerpt,seo_description,tags,category,published_at,image"),
            _fetch(client, "brands",
                   "id,name,slug,short_description,description,logo",
                   filter_field="is_active", filter_value="true"),
            _fetch(client, "pages",
                   "id,title,slug,seo_description"),
        )
    new_blog = [
        BlogPost(
            id=int(p["id"]),
            title=p.get("title") or "",
            slug=p.get("slug") or "",
            excerpt=p.get("excerpt") or "",
            seo_description=p.get("seo_description") or "",
            tags=[t for t in (p.get("tags") or []) if isinstance(t, str)],
            category=p.get("category") or "",
            published_at=p.get("published_at"),
            image_id=p.get("image"),
        )
        for p in blog_data
        if p.get("slug")
    ]
    new_brands = [
        Brand(
            id=int(b["id"]),
            name=b.get("name") or "",
            slug=b.get("slug") or "",
            short_description=b.get("short_description") or "",
            description=b.get("description") or "",
            logo_id=b.get("logo"),
        )
        for b in brand_data
        if b.get("slug")
    ]
    new_legal = [
        LegalPage(
            id=int(p["id"]),
            title=p.get("title") or "",
            slug=p.get("slug") or "",
            seo_description=p.get("seo_description") or "",
        )
        for p in page_data
        if p.get("slug") and p["slug"] in LEGAL_PAGE_SLUGS
    ]
    async with _lock:
        _blog_posts = new_blog
        _brands = new_brands
        _legal_pages = new_legal
        _last_refresh = time.time()
    counts = {"blog": len(new_blog), "brands": len(new_brands), "legal": len(new_legal)}
    log.info("Content index refreshed: %s", counts)
    return counts
 async def background_refresher() -> None:
    while True:
        await asyncio.sleep(settings.directus_slug_refresh_seconds)
        try:
            await refresh()
        except Exception as e:
            log.warning("Background content refresh error: %s", e)
 def _score_blog(post: BlogPost, q: str) -> float:
    """Weighted match score for a blog post against query q (lowercased)."""
    score = 0.0
    title_l = post.title.lower()
    excerpt_l = post.excerpt.lower()
    seo_l = post.seo_description.lower()
    cat_l = post.category.lower()
    # Title weights highest (substring requires len >= 3 to avoid 2-char noise)
    if title_l == q:
        score += 10
    elif title_l.startswith(q):
        score += 6
    elif len(q) >= 3 and q in title_l:
        score += 4
    # Tags (real curated keywords)
    for tag in post.tags:
        tl = tag.lower()
        if tl == q:
            score += 5
            break
        elif tl.startswith(q) or (len(q) >= 3 and q in tl):
            score += 2
            break
    # Category
    if q in cat_l:
        score += 1
    # Excerpt + seo_description (lighter)
    if q in excerpt_l:
        score += 1
    if q in seo_l:
        score += 0.5
    return score
 def _score_brand(brand: Brand, q: str) -> float:
    score = 0.0
    name_l = brand.name.lower()
    if name_l == q:
        score += 15  # Brand exact match — dominant signal
    elif name_l.startswith(q):
        score += 10  # Prefix should beat any page substring match
    elif len(q) >= 3 and q in name_l:
        score += 5
    if len(q) >= 3 and q in (brand.short_description or "").lower():
        score += 1
    if len(q) >= 3 and q in (brand.description or "").lower():
        score += 0.5
    return score
 def _score_legal(page: LegalPage, q: str) -> float:
    score = 0.0
    title_l = page.title.lower()
    slug_l = page.slug.lower()
    if title_l == q or slug_l == q:
        score += 10
    elif title_l.startswith(q) or slug_l.startswith(q):
        score += 7
    elif len(q) >= 3 and (q in title_l or q in slug_l):
        score += 4
    if len(q) >= 3 and q in (page.seo_description or "").lower():
        score += 1
    return score
 def search_blog(query: str, limit: int = 5) -> list[dict]:
    q = query.strip().lower()
    if not q:
        return []
    scored = [(p, _score_blog(p, q)) for p in _blog_posts]
    scored = [(p, s) for p, s in scored if s > 0]
    scored.sort(key=lambda x: -x[1])
    return [
        {
            "type": "blog",
            "title": p.title,
            "slug": f"/blog/{p.slug}",
            "snippet": (p.seo_description or p.excerpt or "")[:180],
            "tags": p.tags,
            "category": p.category,
            "published_at": p.published_at,
            "image_id": p.image_id,
            "score": s,
            "matched_via": "blog",
        }
        for p, s in scored[:limit]
    ]
 def search_brands(query: str, limit: int = 5) -> list[dict]:
    q = query.strip().lower()
    if not q:
        return []
    scored = [(b, _score_brand(b, q)) for b in _brands]
    scored = [(b, s) for b, s in scored if s > 0]
    scored.sort(key=lambda x: -x[1])
    return [
        {
            "type": "brand",
            "title": b.name,
            "slug": f"/marken/{b.slug}",
            "snippet": (b.short_description or "")[:180],
            "logo_id": b.logo_id,
            "score": s,
            "matched_via": "brand",
        }
        for b, s in scored[:limit]
    ]
 def search_legal(query: str, limit: int = 5) -> list[dict]:
    q = query.strip().lower()
    if not q:
        return []
    scored = [(p, _score_legal(p, q)) for p in _legal_pages]
    scored = [(p, s) for p, s in scored if s > 0]
    scored.sort(key=lambda x: -x[1])
    return [
        {
            "type": "page",
            "title": p.title,
            "slug": f"/{p.slug}",
            "snippet": (p.seo_description or "")[:180],
            "score": s,
            "matched_via": "legal",
        }
        for p, s in scored[:limit]
    ]
 def get_blog_posts_with_tag(tag: str) -> list[BlogPost]:
    tag_l = tag.lower()
    return [p for p in _blog_posts if any(t.lower() == tag_l for t in p.tags)]
 def stats() -> dict:
    return {
        "blog": len(_blog_posts),
        "brands": len(_brands),
        "legal": len(_legal_pages),
        "last_refresh_age_s": int(time.time() - _last_refresh) if _last_refresh else None,
    }
--- a/stacks/eh-search/app/db.py
+++ b/stacks/eh-search/app/db.py
@ -0,0 +1,43 @@
 """Async Postgres pool."""
 import asyncpg
 from app.config import settings
 _pool: asyncpg.Pool | None = None
 async def init_pool() -> None:
    global _pool
    _pool = await asyncpg.create_pool(
        dsn=settings.dsn,
        min_size=2,
        max_size=10,
        command_timeout=5,
    )
 async def close_pool() -> None:
    global _pool
    if _pool is not None:
        await _pool.close()
        _pool = None
 def get_pool() -> asyncpg.Pool:
    if _pool is None:
        raise RuntimeError("DB pool not initialized")
    return _pool
 async def fetch(query: str, *args) -> list[asyncpg.Record]:
    async with get_pool().acquire() as conn:
        return await conn.fetch(query, *args)
 async def fetchrow(query: str, *args) -> asyncpg.Record | None:
    async with get_pool().acquire() as conn:
        return await conn.fetchrow(query, *args)
 async def fetchval(query: str, *args):
    async with get_pool().acquire() as conn:
        return await conn.fetchval(query, *args)
--- a/stacks/eh-search/app/intent_router.py
+++ b/stacks/eh-search/app/intent_router.py
@ -0,0 +1,82 @@
 """Intent routing — pure regex, no AI, sub-millisecond.
 Loco-Soft Komm-Nr Format:
  Type-Buchstabe + 4-6 stellige Zahl, optional Space dazwischen.
  Type-Letter: N=Neu, T=Tageszul., V=Vorfuehr, D=Differenzbest., G=Gebraucht, L=Leihgabe.
  Beispiele: 'D9094', 'd9094', 'D 9094', 'n8093', 'L9083'
 Pure 4-6 stellige Zahlen werden als DVN-Lookup behandelt (DVN ist eindeutig).
 """
 import re
 from dataclasses import dataclass
 from typing import Literal
 # Komm-Nr: Type + DVN (mit oder ohne Space)
 KOMM_NR_RE = re.compile(r"^[NTVDGLntvdgl]\s*\d{4,6}$")
 # DVN allein: pure Zahl 4-6 Stellen
 DVN_RE = re.compile(r"^\d{4,6}$")
 # VIN: 17 chars, no I/O/Q
 VIN_RE = re.compile(r"^[A-HJ-NPR-Z0-9]{17}$")
@dataclass
 class Intent:
    type: Literal[
        "komm_nr",
        "dvn",
        "vin",
        "autocomplete_only",
        "keyword_search",
        "empty",
    ]
    direct_redirect: bool = False
    normalized_query: str = ""
 def route(raw_query: str) -> Intent:
    q = (raw_query or "").strip()
    if not q:
        return Intent(type="empty", normalized_query="")
    # Rule 1: Komm-Nr (Type + DVN, e.g. 'D9094', 'D 9094', 'd9094')
    if KOMM_NR_RE.match(q):
        # Normalize: remove spaces, uppercase the type letter
        cleaned = q.replace(" ", "")
        normalized = cleaned[0].upper() + cleaned[1:]
        return Intent(
            type="komm_nr",
            direct_redirect=True,
            normalized_query=normalized,
        )
    # Rule 2: Pure 4-6 digit number = DVN lookup (eindeutig)
    if DVN_RE.match(q):
        return Intent(
            type="dvn",
            direct_redirect=True,
            normalized_query=q,
        )
    # Rule 3: 17-char alphanumeric (no IOQ) = VIN
    upper = q.upper().replace(" ", "")
    if VIN_RE.match(upper):
        return Intent(
            type="vin",
            direct_redirect=True,
            normalized_query=upper,
        )
    # Rule 4: Single char only -> autocomplete (FTS unbrauchbar bei 1 Zeichen)
    if len(q) < 2:
        return Intent(
            type="autocomplete_only",
            normalized_query=q.lower(),
        )
    # Rule 5: Default — keyword + fuzzy search
    return Intent(
        type="keyword_search",
        normalized_query=q.lower(),
    )
--- a/stacks/eh-search/app/main.py
+++ b/stacks/eh-search/app/main.py
@ -0,0 +1,300 @@
 """eh-search FastAPI application — Phase 1.5 Federated Multi-Source.
 Endpoints:
 - GET  /health           -> liveness + dependency checks
 - GET  /search?q=        -> federated search across vehicles + content + static pages
 - GET  /suggest?q=       -> autocomplete
 - POST /cache/invalidate -> clear all caches + refresh content index
 """
 import asyncio
 import logging
 import time
 from contextlib import asynccontextmanager
 from fastapi import FastAPI, Query, Request, Response
 from fastapi.middleware.cors import CORSMiddleware
 from app import cache, content_index, db
 from app.config import settings
 from app.intent_router import route as route_intent
 from app.schemas import (
    HealthResponse,
    InvalidateResponse,
    SearchResponse,
    SearchResultItem,
    SuggestResponse,
 )
 from app.search import exact, fts, static_pages, suggest
 from app import slug_resolver
 logging.basicConfig(
    level=logging.INFO,
    format="%(asctime)s %(levelname)s %(name)s: %(message)s",
 )
 log = logging.getLogger("eh-search")
@asynccontextmanager
 async def lifespan(app: FastAPI):
    log.info("Starting eh-search (env=%s) Phase 1.5", settings.environment)
    await db.init_pool()
    await cache.init_redis()
    await slug_resolver.refresh()
    await content_index.refresh()
    refresher_slug = asyncio.create_task(slug_resolver.background_refresher())
    refresher_content = asyncio.create_task(content_index.background_refresher())
    log.info("eh-search ready")
    try:
        yield
    finally:
        refresher_slug.cancel()
        refresher_content.cancel()
        await db.close_pool()
        await cache.close_redis()
        log.info("eh-search stopped")
 app = FastAPI(
    title="eh-search",
    description="Federated search service for electric-horses.de",
    version="0.2.0",
    lifespan=lifespan,
 )
 if settings.cors_origin_list:
    app.add_middleware(
        CORSMiddleware,
        allow_origins=settings.cors_origin_list,
        allow_methods=["GET"],
        allow_headers=["*"],
        max_age=600,
    )
 # ---------------------------------------------------------------------------
 # Health
 # ---------------------------------------------------------------------------
@app.get("/health", response_model=HealthResponse)
 async def health() -> HealthResponse:
    db_status = "ok"
    vehicles_count = 0
    try:
        vehicles_count = await db.fetchval("SELECT count(*) FROM search_vehicles")
    except Exception as e:
        db_status = f"error: {e}"
    redis_status = "ok"
    try:
        await cache.get_redis().ping()
    except Exception as e:
        redis_status = f"error: {e}"
    overall = "ok" if db_status == "ok" and redis_status == "ok" else "degraded"
    return HealthResponse(
        status=overall,
        environment=settings.environment,
        db=db_status,
        redis=redis_status,
        vehicles_count=vehicles_count or 0,
        content=content_index.stats(),
    )
 # ---------------------------------------------------------------------------
 # Federated Search
 # ---------------------------------------------------------------------------
 async def _vehicle_search(query: str, intent_type: str, normalized: str, limit: int) -> list[SearchResultItem]:
    if intent_type == "komm_nr":
        return await exact.by_komm_nr(normalized)
    if intent_type == "dvn":
        return await exact.by_dvn(normalized)
    if intent_type == "vin":
        return await exact.by_vin(normalized)
    if intent_type == "keyword_search":
        return await fts.hybrid_search(normalized, limit=limit)
    return []
 def _content_to_items(raw: list[dict]) -> list[SearchResultItem]:
    """Convert content_index dict results to SearchResultItem."""
    return [SearchResultItem(**r) for r in raw]
 async def _enrich_vehicle_slugs(items: list[SearchResultItem]) -> None:
    """Vehicles need their Directus slug resolved."""
    for item in items:
        if item.type == "vehicle" and item.directus_product_id:
            item.slug = await slug_resolver.get_slug(item.directus_product_id)
 def _bridge_pages_for_blog(blog_results: list[SearchResultItem]) -> list[SearchResultItem]:
    """If matched blog posts have tags that bridge to a static page,
    surface that page as an additional result.
    """
    found_pages: dict[str, float] = {}
    for blog in blog_results:
        if not blog.tags:
            continue
        slugs = static_pages.get_pages_for_tags(blog.tags)
        for slug in slugs:
            page = static_pages.page_by_slug(slug)
            if page:
                # Score: based on top blog match score, capped
                found_pages[slug] = max(found_pages.get(slug, 0), min(blog.score, 5.0))
    items = []
    for slug, score in found_pages.items():
        page = static_pages.page_by_slug(slug)
        if page is None:
            continue
        items.append(SearchResultItem(
            type="page",
            title=page.title,
            slug=page.slug,
            snippet=page.snippet,
            score=score,
            matched_via="tag_bridge",
        ))
    return items
@app.get("/search", response_model=SearchResponse)
 async def search(
    response: Response,
    q: str = Query(..., min_length=1, max_length=200),
    limit: int = Query(10, ge=1, le=50),
 ) -> SearchResponse:
    started = time.perf_counter()
    intent = route_intent(q)
    cache_key = cache.key_query(f"v2:{intent.normalized_query}:{limit}")
    cached = await cache.get_json(cache_key)
    if cached is not None:
        cached["cache_hit"] = True
        response.headers["X-Cache-Hit"] = "true"
        return SearchResponse(**cached)
    norm = intent.normalized_query
    # Run all sources in parallel
    if intent.type in ("komm_nr", "dvn", "vin"):
        # Exact lookups skip content/static (they should hit a vehicle)
        vehicle_task = _vehicle_search(q, intent.type, norm, limit)
        vehicle_items = await vehicle_task
        content_blog: list[SearchResultItem] = []
        content_brands: list[SearchResultItem] = []
        content_legal: list[SearchResultItem] = []
        page_items: list[SearchResultItem] = []
    elif intent.type == "keyword_search":
        vehicle_task = _vehicle_search(q, intent.type, norm, limit)
        vehicle_items, content_blog_raw, content_brand_raw, content_legal_raw, page_raw = await asyncio.gather(
            vehicle_task,
            asyncio.to_thread(content_index.search_blog, norm, 5),
            asyncio.to_thread(content_index.search_brands, norm, 5),
            asyncio.to_thread(content_index.search_legal, norm, 5),
            asyncio.to_thread(static_pages.search_static_pages, norm, 5),
        )
        content_blog = _content_to_items(content_blog_raw)
        content_brands = _content_to_items(content_brand_raw)
        content_legal = _content_to_items(content_legal_raw)
        page_items = _content_to_items(page_raw)
    else:
        # autocomplete_only / empty
        vehicle_items = []
        content_blog = []
        content_brands = []
        content_legal = []
        page_items = []
    await _enrich_vehicle_slugs(vehicle_items)
    # Tag-bridge: blog matches surface their bridged static pages too
    bridged_pages = _bridge_pages_for_blog(content_blog)
    # Merge static pages from direct match + bridge (dedup by slug, keep higher score)
    page_map: dict[str, SearchResultItem] = {}
    for p in page_items + bridged_pages + content_legal:
        if p.slug and (p.slug not in page_map or page_map[p.slug].score < p.score):
            page_map[p.slug] = p
    pages_final = sorted(page_map.values(), key=lambda x: -x.score)
    # Combined results, ordered by type priority then by score
    type_priority = {"page": 0, "brand": 1, "vehicle": 2, "blog": 3}
    all_results: list[SearchResultItem] = (
        pages_final[:5] + content_brands[:5] + vehicle_items[:limit] + content_blog[:5]
    )
    # Direct redirect logic:
    # - commission_number / vin with single hit -> vehicle slug
    # - exact static page match (score >= 9) when ONLY one hit -> page slug
    direct_redirect = False
    if intent.direct_redirect and len(vehicle_items) == 1 and vehicle_items[0].slug:
        direct_redirect = True
    elif intent.type == "keyword_search" and len(pages_final) == 1 and pages_final[0].score >= 9 and not vehicle_items and not content_brands and not content_blog:
        direct_redirect = True
    took_ms = int((time.perf_counter() - started) * 1000)
    payload = SearchResponse(
        query=q,
        intent=intent.type,
        direct_redirect=direct_redirect,
        total=len(all_results),
        results=all_results,
        took_ms=took_ms,
        cache_hit=False,
    )
    ttl = settings.cache_ttl_empty if not all_results else settings.cache_ttl_result
    await cache.set_json(cache_key, payload.model_dump(), ttl=ttl)
    response.headers["X-Cache-Hit"] = "false"
    return payload
 # ---------------------------------------------------------------------------
 # Suggest
 # ---------------------------------------------------------------------------
@app.get("/suggest", response_model=SuggestResponse)
 async def suggest_endpoint(
    response: Response,
    q: str = Query("", max_length=50),
    limit: int = Query(8, ge=1, le=20),
 ) -> SuggestResponse:
    started = time.perf_counter()
    q_norm = (q or "").strip().lower()
    cache_key = cache.key_suggest(f"{q_norm}:{limit}")
    cached = await cache.get_json(cache_key)
    if cached is not None:
        response.headers["X-Cache-Hit"] = "true"
        return SuggestResponse(**cached)
    if not q_norm:
        items = await suggest.top_brands(limit=limit)
    else:
        items = await suggest.prefix_suggest(q_norm, limit=limit)
    payload = SuggestResponse(
        query=q,
        suggestions=items,
        took_ms=int((time.perf_counter() - started) * 1000),
    )
    await cache.set_json(cache_key, payload.model_dump(), ttl=settings.cache_ttl_suggest)
    response.headers["X-Cache-Hit"] = "false"
    return payload
 # ---------------------------------------------------------------------------
 # Cache invalidation
 # ---------------------------------------------------------------------------
@app.post("/cache/invalidate", response_model=InvalidateResponse)
 async def invalidate(request: Request) -> InvalidateResponse:
    cleared = await cache.invalidate_all()
    # Refresh both indexes in the background
    asyncio.create_task(slug_resolver.refresh())
    asyncio.create_task(content_index.refresh())
    log.info(
        "Cache invalidated: %d keys (from %s)",
        cleared,
        request.client.host if request.client else "?",
    )
    return InvalidateResponse(cleared=cleared, scope="all")
--- a/stacks/eh-search/app/schemas.py
+++ b/stacks/eh-search/app/schemas.py
@ -0,0 +1,82 @@
 """Pydantic response models."""
 from pydantic import BaseModel, Field
 from typing import Literal, Any
 IntentType = Literal[
    "komm_nr",
    "dvn",
    "vin",
    "autocomplete_only",
    "keyword_search",
    "empty",
 ]
 ResultType = Literal["vehicle", "page", "blog", "brand"]
 class SearchResultItem(BaseModel):
    """Unified result item — fields are optional depending on type."""
    type: ResultType = "vehicle"
    title: str
    slug: str | None = None
    snippet: str | None = None
    score: float = 0.0
    matched_via: str = ""
    # Vehicle-specific
    vehicle_id: int | None = None
    commission_number: str | None = None
    vin: str | None = None
    brand: str | None = None
    model: str | None = None
    price: float | None = None
    primary_image_id: str | None = None
    directus_product_id: int | None = None
    # Blog-specific
    tags: list[str] | None = None
    category: str | None = None
    published_at: str | None = None
    image_id: str | None = None
    # Brand-specific
    logo_id: str | None = None
    model_config = {"extra": "allow"}
 class SearchResponse(BaseModel):
    query: str
    intent: IntentType
    direct_redirect: bool = False
    total: int
    results: list[SearchResultItem]
    took_ms: int
    cache_hit: bool = False
 class SuggestItem(BaseModel):
    text: str
    type: Literal["brand", "model", "category"]
    count: int = 0
 class SuggestResponse(BaseModel):
    query: str
    suggestions: list[SuggestItem]
    took_ms: int
 class HealthResponse(BaseModel):
    status: str
    environment: str
    db: str
    redis: str
    vehicles_count: int
    content: dict[str, Any] = Field(default_factory=dict)
 class InvalidateResponse(BaseModel):
    cleared: int
    scope: str = "all"
--- a/stacks/eh-search/app/search/init.py
+++ b/stacks/eh-search/app/search/init.py
--- a/stacks/eh-search/app/search/exact.py
+++ b/stacks/eh-search/app/search/exact.py
@ -0,0 +1,78 @@
 """Exact lookups: Komm-Nr, DVN, VIN.
 Loco-Soft Komm-Nr-Schema:
  Type-Buchstabe (N/T/V/D/G/L) + DVN-Zahl (4-6 Stellen)
  Beispiele: 'D9094' (Nissan Leaf), 'N8093' (Askoll XKP45), 'L9083' (Opel)
  N = Neu, T = Tageszul., V = Vorfuehr,
  D = Differenzbest., G = Gebraucht, L = Leihgabe
 DVN allein ist eindeutig pro Fahrzeug, deshalb auch als Such-Einstieg möglich.
 """
 import re
 from app import db
 from app.schemas import SearchResultItem
 COLUMNS = """
    vehicle_id, dvn, commission_number, vin, brand, model, title,
    price::float8 as price, primary_image_id::text as primary_image_id,
    directus_product_id, dealer_vehicle_type
 """
 KOMM_NR_RE = re.compile(r"^([NTVDGLntvdgl])\s*(\d{4,6})$")
 def normalize_komm_nr(raw: str) -> str | None:
    """'D 9094' / 'd9094' / ' D9094 ' -> 'D9094'. Returns None if not a valid pattern."""
    if not raw:
        return None
    m = KOMM_NR_RE.match(raw.strip())
    if not m:
        return None
    return m.group(1).upper() + m.group(2)
 async def by_komm_nr(raw: str) -> list[SearchResultItem]:
    """Lookup by full Komm-Nr (Type-Letter + DVN). Eindeutig wenn vorhanden."""
    normalized = normalize_komm_nr(raw)
    if not normalized:
        return []
    sql = f"SELECT {COLUMNS} FROM search_vehicles WHERE commission_number = $1 LIMIT 5"
    rows = await db.fetch(sql, normalized)
    return [_row_to_item(r, matched_via="exact_komm") for r in rows]
 async def by_dvn(number: str) -> list[SearchResultItem]:
    """Lookup by DVN allein (4-6 stellige Zahl ohne Type-Buchstabe).
    DVN ist eindeutig pro Fahrzeug, also liefert das immer 0 oder 1 Treffer."""
    try:
        dvn_int = int(number)
    except (ValueError, TypeError):
        return []
    sql = f"SELECT {COLUMNS} FROM search_vehicles WHERE dvn = $1 LIMIT 5"
    rows = await db.fetch(sql, dvn_int)
    return [_row_to_item(r, matched_via="exact_dvn") for r in rows]
 async def by_vin(vin: str) -> list[SearchResultItem]:
    sql = f"SELECT {COLUMNS} FROM search_vehicles WHERE vin = $1 LIMIT 5"
    rows = await db.fetch(sql, vin)
    return [_row_to_item(r, matched_via="exact_vin") for r in rows]
 def _row_to_item(row, matched_via: str) -> SearchResultItem:
    return SearchResultItem(
        vehicle_id=row["vehicle_id"],
        commission_number=row["commission_number"],
        vin=row["vin"],
        brand=row["brand"],
        model=row["model"],
        title=row["title"] or "",
        price=row["price"],
        primary_image_id=row["primary_image_id"],
        directus_product_id=row["directus_product_id"],
        score=1.0,
        matched_via=matched_via,
    )
--- a/stacks/eh-search/app/search/fts.py
+++ b/stacks/eh-search/app/search/fts.py
@ -0,0 +1,148 @@
 """Postgres Full-Text Search (Typ 2) + pg_trgm Fuzzy (Typ 3) + Substring fallback."""
 import re
 from app import db
 from app.schemas import SearchResultItem
 COLUMNS = """
    vehicle_id, commission_number, vin, brand, model, title,
    price::float8 as price, primary_image_id::text as primary_image_id,
    directus_product_id
 """
 # Sanitize tsquery input — keep word chars, spaces, dots, hyphens
 _SAFE_TOKEN = re.compile(r"[^\w\s\.\-]", re.UNICODE)
 def _build_prefix_tsquery(query: str) -> str:
    """Convert free-text query into a safe tsquery with prefix match.
    'ray'        -> 'ray:*'
    'zero motor' -> 'zero:* & motor:*'
    """
    cleaned = _SAFE_TOKEN.sub(" ", query)
    tokens = [t for t in cleaned.split() if t]
    if not tokens:
        return ""
    return " & ".join(f"{t}:*" for t in tokens)
 async def fts_search(query: str, limit: int = 10) -> list[SearchResultItem]:
    """Full-text search with German + unaccent dictionary, prefix-enabled."""
    tsquery_str = _build_prefix_tsquery(query)
    if not tsquery_str:
        return []
    sql = f"""
        SELECT {COLUMNS},
               ts_rank_cd(search_tsv, q)::float8 AS rank
        FROM search_vehicles, to_tsquery('german_unaccent', $1) q
        WHERE search_tsv @@ q
        ORDER BY rank DESC, brand, model
        LIMIT $2
    """
    try:
        rows = await db.fetch(sql, tsquery_str, limit)
    except Exception:
        sql_fallback = f"""
            SELECT {COLUMNS},
                   ts_rank_cd(search_tsv, q)::float8 AS rank
            FROM search_vehicles, plainto_tsquery('german_unaccent', $1) q
            WHERE search_tsv @@ q
            ORDER BY rank DESC, brand, model
            LIMIT $2
        """
        rows = await db.fetch(sql_fallback, query, limit)
    return [_to_item(r, "fts", float(r["rank"] or 0)) for r in rows]
 async def fuzzy_search(query: str, limit: int = 10) -> list[SearchResultItem]:
    """Trigram similarity for typo tolerance."""
    sql = f"""
        WITH scored AS (
            SELECT {COLUMNS},
                   GREATEST(
                       similarity(title, $1),
                       similarity(COALESCE(commission_number, ''), $1)
                   )::float8 AS sim
            FROM search_vehicles
            WHERE title % $1
               OR commission_number % $1
        )
        SELECT * FROM scored
        WHERE sim > 0.25
        ORDER BY sim DESC
        LIMIT $2
    """
    rows = await db.fetch(sql, query, limit)
    return [_to_item(r, "fuzzy", float(r["sim"])) for r in rows]
 async def substring_search(query: str, limit: int = 10) -> list[SearchResultItem]:
    """Last-resort: ILIKE substring on title/brand/model.
    Catches stop-words (e.g. German 'es') that FTS strips, and very short
    queries that don't survive stemming. Uses the existing trgm GIN index.
    """
    pattern = f"%{query.lower()}%"
    sql = f"""
        SELECT {COLUMNS}, 0.5::float8 AS rank
        FROM search_vehicles
        WHERE LOWER(title) LIKE $1
           OR LOWER(brand) LIKE $1
           OR LOWER(COALESCE(model, '')) LIKE $1
        ORDER BY brand, model
        LIMIT $2
    """
    rows = await db.fetch(sql, pattern, limit)
    return [_to_item(r, "substring", 0.5) for r in rows]
 async def hybrid_search(query: str, limit: int = 10) -> list[SearchResultItem]:
    """FTS first; if too few hits, add fuzzy; if still empty, add substring.
    Strategy:
    1. Always run FTS (cheap, indexed, prefix-enabled)
    2. If FTS has >= 3 results, return them
    3. Otherwise also run fuzzy and merge
    4. If still nothing, run substring fallback (handles stop-words)
    """
    fts_results = await fts_search(query, limit=limit)
    if len(fts_results) >= 3:
        return fts_results
    fuzzy_results = await fuzzy_search(query, limit=limit)
    seen: dict[int, SearchResultItem] = {r.vehicle_id: r for r in fts_results}
    for r in fuzzy_results:
        if r.vehicle_id not in seen:
            seen[r.vehicle_id] = r
    if not seen:
        # Last resort: substring (catches stop-words like 'es')
        substring_results = await substring_search(query, limit=limit)
        for r in substring_results:
            seen[r.vehicle_id] = r
    merged = list(seen.values())
    merged.sort(key=lambda x: (
        0 if x.matched_via == "fts" else (1 if x.matched_via == "fuzzy" else 2),
        -x.score,
    ))
    return merged[:limit]
 def _to_item(r, matched_via: str, score: float) -> SearchResultItem:
    return SearchResultItem(
        vehicle_id=r["vehicle_id"],
        commission_number=r["commission_number"],
        vin=r["vin"],
        brand=r["brand"],
        model=r["model"],
        title=r["title"] or "",
        price=r["price"],
        primary_image_id=r["primary_image_id"],
        directus_product_id=r["directus_product_id"],
        score=score,
        matched_via=matched_via,
    )
--- a/stacks/eh-search/app/search/static_pages.py
+++ b/stacks/eh-search/app/search/static_pages.py
@ -0,0 +1,194 @@
 """Static page registry + tag-to-page bridge.
 Hardcoded list of top-level Astro pages that aren't in Directus.
 The tag bridge maps blog tags (real, curated by the team) to these pages,
 turning the blog content taxonomy into a search vocabulary for the site.
 """
 from dataclasses import dataclass, field
 from typing import Iterable
@dataclass
 class StaticPage:
    slug: str          # URL path, e.g. "/werkstatt"
    title: str         # Display title
    snippet: str       # Description shown in search results
    keywords: list[str] = field(default_factory=list)  # Direct synonyms
 # Top-level pages of electric-horses.de that live as Astro routes,
 # not in Directus. Order matters for tie-breaking (earlier = preferred).
 STATIC_PAGES: list[StaticPage] = [
    StaticPage(
        slug="/werkstatt",
        title="Werkstatt",
        snippet="Spezialwerkstatt fuer E-Roller und E-Motorraeder. Inspektion, Reparatur, Akku-Service.",
        keywords=["werkstatt", "reparatur", "service", "inspektion", "wartung"],
    ),
    StaticPage(
        slug="/ersatzteile",
        title="Ersatzteile",
        snippet="Original-Ersatzteile und Zubehoer fuer E-Roller, E-Motorraeder und E-Scooter.",
        keywords=["ersatzteile", "teile", "zubehoer", "originalteile"],
    ),
    StaticPage(
        slug="/fahrzeuge",
        title="Fahrzeuge",
        snippet="Unser kompletter Bestand an E-Rollern, E-Motorraedern und E-Autos.",
        keywords=["fahrzeuge", "bestand", "angebot", "uebersicht", "alle"],
    ),
    StaticPage(
        slug="/marken",
        title="Marken",
        snippet="Alle Marken im Ueberblick: Askoll, RAY, ZERO, Energica, Nissan und mehr.",
        keywords=["marken", "hersteller", "brands"],
    ),
    StaticPage(
        slug="/vermietung",
        title="Vermietung",
        snippet="E-Motorrad-Vermietung in Wendelstein bei Nuernberg. Tagestouren, Wochenmiete.",
        keywords=["vermietung", "mieten", "leihen", "verleih", "rental"],
    ),
    StaticPage(
        slug="/kontakt",
        title="Kontakt",
        snippet="Kontakt zu Electric Horses: Telefon, E-Mail, Anfahrt, Oeffnungszeiten.",
        keywords=["kontakt", "anfahrt", "adresse", "telefon", "email", "oeffnungszeiten"],
    ),
    StaticPage(
        slug="/ueber-uns",
        title="Ueber uns",
        snippet="Electric Horses ist die E-Mobilitaets-Sparte des Autohaus Richter & Zech.",
        keywords=["ueber uns", "team", "geschichte", "richter zech", "wendelstein"],
    ),
    StaticPage(
        slug="/blog",
        title="Blog",
        snippet="News, Ratgeber und Erfahrungsberichte rund um Elektromobilitaet.",
        keywords=["blog", "news", "artikel", "ratgeber"],
    ),
 ]
 # Tag → Page bridge.
 # These tags exist on real blog posts; matching one of them surfaces the
 # linked page (in addition to the blog posts that carry the tag).
 # Source: actual tag pool extracted from blog_posts.tags.
 TAG_TO_PAGE_BRIDGE: dict[str, str] = {
    # ─── Werkstatt-Themen (Technik, Service, Reifen, Akku) ───────
    "akku": "/werkstatt",
    "akkuladung": "/werkstatt",
    "ladedauer": "/werkstatt",
    "reichweite": "/werkstatt",
    "verbrauch": "/werkstatt",
    "bremsen": "/werkstatt",
    "reifen": "/werkstatt",
    "metzeler": "/werkstatt",
    "pirelli": "/werkstatt",
    "heidenau": "/werkstatt",
    "wartung": "/werkstatt",
    "service": "/werkstatt",
    "installation": "/werkstatt",
    "montage": "/werkstatt",
    "werkzeug": "/werkstatt",
    "trittbrettverbreiterung": "/werkstatt",
    "radnabenmotor": "/werkstatt",
    "propilot": "/werkstatt",
    "effizienz": "/werkstatt",
    "ledersitze": "/werkstatt",
    # ─── Beratung & Foerderung (Kontakt) ─────────────────────────
    "versicherung": "/kontakt",
    "foerderung": "/kontakt",
    "foerderungen": "/kontakt",
    "praemie": "/kontakt",
    "preisvorteil": "/kontakt",
    "innovationspraemie": "/kontakt",
    "umweltpraemie": "/kontakt",
    "kosten": "/kontakt",
    "vergleich": "/kontakt",
    "vorfuehrer": "/kontakt",
    "preis": "/kontakt",
    "goelectric": "/kontakt",
    # ─── Fuehrerschein & Legales (Werkstatt-Section "Recht") ─────
    "abe": "/werkstatt",
    "ekfv": "/werkstatt",
    "klassea1": "/werkstatt",
    "klasseb": "/werkstatt",
    "schluesselzahl196": "/werkstatt",
    "fuehrerschein": "/werkstatt",
    "strassenzulassung": "/werkstatt",
    "verkehrsregeln": "/werkstatt",
    "verordnung": "/werkstatt",
    "legal": "/werkstatt",
    "ekickroller": "/werkstatt",
    "ekfv": "/werkstatt",
    # ─── Vermietung-Themen ───────────────────────────────────────
    "touristik": "/vermietung",
    "freizeit": "/vermietung",
    # ─── Marken-Indikatoren (linken auf /marken) ─────────────────
    "marknunabhaengig": "/marken",
 }
 def search_static_pages(query: str, limit: int = 5) -> list[dict]:
    """Search static pages by title, slug, keywords, snippet substring.
    Returns dicts in the same shape as content search results.
    """
    q = (query or "").strip().lower()
    if not q:
        return []
    results = []
    for page in STATIC_PAGES:
        score = 0.0
        # Exact title match -> very high
        if page.title.lower() == q:
            score = 10.0
        # Title prefix
        elif page.title.lower().startswith(q):
            score = 7.0
        # Slug match (without leading slash)
        elif page.slug.lstrip("/").lower() == q:
            score = 9.0
        # Title substring (only for queries >= 3 chars to avoid noise like 'ze' in 'fahrzeuge')
        elif len(q) >= 3 and q in page.title.lower():
            score = 5.0
        # Keyword exact / prefix
        elif any(kw == q for kw in page.keywords):
            score = 6.0
        elif any(kw.startswith(q) for kw in page.keywords):
            score = 4.0
        # Snippet substring (weakest, also requires >= 3 chars)
        elif len(q) >= 3 and q in page.snippet.lower():
            score = 2.0
        if score > 0:
            results.append({
                "type": "page",
                "title": page.title,
                "slug": page.slug,
                "snippet": page.snippet,
                "score": score,
                "matched_via": "static_page",
            })
    results.sort(key=lambda r: -r["score"])
    return results[:limit]
 def get_pages_for_tags(tags: Iterable[str]) -> set[str]:
    """Given an iterable of blog tags, return the set of page slugs they map to."""
    if not tags:
        return set()
    return {TAG_TO_PAGE_BRIDGE[t.lower()] for t in tags if t and t.lower() in TAG_TO_PAGE_BRIDGE}
 def page_by_slug(slug: str) -> StaticPage | None:
    for p in STATIC_PAGES:
        if p.slug == slug:
            return p
    return None
--- a/stacks/eh-search/app/search/suggest.py
+++ b/stacks/eh-search/app/search/suggest.py
@ -0,0 +1,50 @@
 """Autocomplete / Suggest endpoint (Typ 11)."""
 from app import db
 from app.schemas import SuggestItem
 async def top_brands(limit: int = 10) -> list[SuggestItem]:
    rows = await db.fetch(
        "SELECT brand, count(*) AS cnt FROM search_vehicles "
        "WHERE brand IS NOT NULL GROUP BY brand ORDER BY cnt DESC LIMIT $1",
        limit,
    )
    return [
        SuggestItem(text=r["brand"], type="brand", count=r["cnt"])
        for r in rows
    ]
 async def prefix_suggest(prefix: str, limit: int = 8) -> list[SuggestItem]:
    """Brand + model prefix matching, case-insensitive."""
    pattern = f"{prefix.lower()}%"
    # Brands first
    brand_rows = await db.fetch(
        "SELECT brand, count(*) AS cnt FROM search_vehicles "
        "WHERE LOWER(brand) LIKE $1 GROUP BY brand ORDER BY cnt DESC LIMIT $2",
        pattern,
        limit,
    )
    items = [
        SuggestItem(text=r["brand"], type="brand", count=r["cnt"])
        for r in brand_rows
    ]
    # Then models if room
    remaining = limit - len(items)
    if remaining > 0:
        model_rows = await db.fetch(
            "SELECT DISTINCT brand || ' ' || model AS text, count(*) AS cnt "
            "FROM search_vehicles "
            "WHERE model IS NOT NULL AND LOWER(model) LIKE $1 "
            "GROUP BY brand, model ORDER BY cnt DESC LIMIT $2",
            pattern,
            remaining,
        )
        items.extend(
            SuggestItem(text=r["text"], type="model", count=r["cnt"])
            for r in model_rows
        )
    return items
--- a/stacks/eh-search/app/slug_resolver.py
+++ b/stacks/eh-search/app/slug_resolver.py
@ -0,0 +1,66 @@
 """Resolves directus_product_id -> slug, with periodic refresh.
 Uses Directus REST API. Slug map is small (~75 entries) so we keep it
 fully in-memory and refresh on a schedule (or on cache invalidation).
 """
 import asyncio
 import logging
 import time
 import httpx
 from app.config import settings
 log = logging.getLogger(__name__)
 _slug_map: dict[int, str] = {}
 _last_refresh: float = 0.0
 _lock = asyncio.Lock()
 async def refresh() -> int:
    """Pull product_id -> slug from Directus. Returns count."""
    global _slug_map, _last_refresh
    url = f"{settings.directus_url}/items/products"
    params = {
        "fields": "id,slug",
        "limit": -1,
        "filter[status][_eq]": "published",
    }
    try:
        async with httpx.AsyncClient(timeout=10) as client:
            resp = await client.get(url, params=params)
            resp.raise_for_status()
            data = resp.json().get("data", [])
        new_map = {
            int(item["id"]): item["slug"]
            for item in data
            if item.get("id") is not None and item.get("slug")
        }
        async with _lock:
            _slug_map = new_map
            _last_refresh = time.time()
        log.info("Slug map refreshed: %d entries", len(new_map))
        return len(new_map)
    except Exception as e:
        log.warning("Slug refresh failed: %s", e)
        return 0
 async def get_slug(directus_product_id: int | None) -> str | None:
    if directus_product_id is None:
        return None
    # Lazy refresh if stale
    if time.time() - _last_refresh > settings.directus_slug_refresh_seconds:
        await refresh()
    return _slug_map.get(int(directus_product_id))
 async def background_refresher():
    """Periodic background task — refresh slug map every N seconds."""
    while True:
        await asyncio.sleep(settings.directus_slug_refresh_seconds)
        try:
            await refresh()
        except Exception as e:
            log.warning("Background slug refresh error: %s", e)
--- a/stacks/eh-search/docker-compose.yml
+++ b/stacks/eh-search/docker-compose.yml
@ -0,0 +1,57 @@
 # eh-search Stack on ai-apps
 # Service is INTERNAL ONLY - bound to private network 10.0.0.8
 # Reached by Pegasus via nginx proxy_pass on 10.0.0.10
 services:
  eh-search:
    build: .
    image: eh-search:dev
    container_name: eh-search
    restart: unless-stopped
    env_file: .env
    ports:
      # Bind ONLY to private network IP, not 0.0.0.0
      - "10.0.0.8:8200:8200"
    networks:
      - eh-search-internal
    depends_on:
      eh-search-redis:
        condition: service_healthy
    healthcheck:
      test: ["CMD", "curl", "-fsS", "http://127.0.0.1:8200/health"]
      interval: 30s
      timeout: 5s
      retries: 3
      start_period: 10s
    deploy:
      resources:
        limits:
          memory: 512M
          cpus: "0.5"
    logging:
      driver: json-file
      options:
        max-size: "10m"
        max-file: "3"
  eh-search-redis:
    image: redis:7-alpine
    container_name: eh-search-redis
    restart: unless-stopped
    command: redis-server --maxmemory 64mb --maxmemory-policy allkeys-lru --save ""
    networks:
      - eh-search-internal
    healthcheck:
      test: ["CMD", "redis-cli", "ping"]
      interval: 10s
      timeout: 3s
      retries: 5
    deploy:
      resources:
        limits:
          memory: 96M
          cpus: "0.2"
 networks:
  eh-search-internal:
    driver: bridge
--- a/stacks/eh-search/requirements.txt
+++ b/stacks/eh-search/requirements.txt
@ -0,0 +1,7 @@
 fastapi==0.115.0
 uvicorn[standard]==0.30.6
 asyncpg==0.29.0
 redis==5.0.8
 pydantic==2.9.2
 pydantic-settings==2.5.2
 httpx==0.27.2
--- a/stacks/forgejo/.env.example
+++ b/stacks/forgejo/.env.example
@ -0,0 +1,18 @@
 # Forgejo Stack — environment template
 # Copy to .env and fill in actual values. NEVER commit .env to Git.
 # Postgres password (generate: openssl rand -hex 32)
 DB_PASSWORD=<openssl rand -hex 32>
 # Forgejo internal crypto
 FORGEJO_SECRET_KEY=<openssl rand -hex 32>
 FORGEJO_INTERNAL_TOKEN=<openssl rand -hex 64>
 # SMTP via Mailcow
 SMTP_USER=electric-horses@sdda.eu
 SMTP_PASSWORD=<aus Mailcow>
 # OIDC (Authentik) — applied via CLI after first boot, reference only here
 OIDC_CLIENT_ID=<from Authentik Application 'Forgejo'>
 OIDC_CLIENT_SECRET=<from Authentik Application 'Forgejo'>
 OIDC_DISCOVERY_URL=https://welcome.sdda.eu/application/o/forgejo/.well-known/openid-configuration
--- a/stacks/forgejo/Agent.md
+++ b/stacks/forgejo/Agent.md
@ -0,0 +1,64 @@
 # Agent Briefing — Forgejo Stack
 Du arbeitest am Forgejo-Git-Hosting. Lies erst `../../Agent.md` am Repo-Root für globale Konventionen.
 ## Live auf
 - **URL:** https://code.sdda.eu
 - **SSH:** `ssh://git@code.sdda.eu:222/<user>/<repo>.git`
 - **Server:** ai-apps (Hetzner cx22, 10.0.0.8)
 - **Pfad auf Server:** `/opt/ai-apps/forgejo/`
 - **Live seit:** 2026-04-11
 - **Version:** Forgejo 10 (`codeberg.org/forgejo/forgejo:10`)
 ## Authentifizierung
 - Primär: **Nativer OIDC via Authentik** (nicht ForwardAuth)
 - Application in Authentik: `forgejo` auf `welcome.sdda.eu`
 - Zugangskontrolle: Gruppen-Policy `forgejo-users` erforderlich
 - Launch-URL: `https://code.sdda.eu/user/oauth2/authentik` (für Silent SSO aus Authentik-Dashboard)
 - Fallback: lokaler `admin-local` User mit `prohibit_login=true` (Emergency)
 - Siehe ADR-0003 (OIDC statt ForwardAuth), ADR-0006 (Silent-SSO-Launch-URL)
 ## Kritische Gotchas
 1. **Volume-Mount auf `/data`**, NICHT `/var/lib/gitea`. Forgejo schreibt alles nach `/data`. Siehe ADR-0005.
 2. **SSH-Port-Kollision:** Forgejo-Image hat system-sshd auf 22, deshalb Forgejos eigener Server auf Container-Port 2222 → Host-Port 222.
 3. **OIDC-Config lebt in der Postgres-DB** (Tabelle `login_source`), NICHT in `app.ini`. Zum Ändern: `docker exec -u git forgejo sh -c 'cd / && forgejo admin auth update-oauth --id 1 ...'`
 4. **`forgejo admin` CLI** braucht `-u git` und `cd /`: `docker exec -u git forgejo sh -c 'cd / && forgejo admin user list'`
 5. **User-Promotion zum Admin** via SQL am saubersten: `UPDATE "user" SET is_admin = true WHERE lower_name = 'NAME';`
 ## Ops-Kommandos
 ```bash
 ssh ai-apps
 cd /opt/ai-apps/forgejo
 # Status
 docker compose ps
 docker logs forgejo --tail 50
 # Restart
 docker compose restart forgejo
 # Update (Major-Version manuell, Tag auf :10 gepinnt)
 docker compose pull
 docker compose up -d
 # Backup manuell
 bash backup.sh
 # User-Verwaltung
 docker exec -u git forgejo sh -c 'cd / && forgejo admin user list'
 docker exec forgejo-db psql -U forgejo -d forgejo \
  -c "UPDATE \"user\" SET is_admin = true WHERE lower_name = 'NAME';"
 ```
 ## Backup
 - Script: `backup.sh` in diesem Ordner (spiegelt `/opt/ai-apps/forgejo/backup.sh`)
 - Cron: `0 3 * * *` auf ai-apps → `/opt/backups/forgejo/`
 - Retention: 14 Tage
 - Format: `forgejo-db-<ts>.sql.gz` + `forgejo-data-<ts>.tar.gz`
 - Offsite: noch nicht (Tier 2 via rclone → Nextcloud geplant M7.5)
 - Restore-Prozedur: siehe `docs/runbooks/forgejo-backup-restore.md` (wird in M7.3 hinzugefügt)
 ## Related
 - `../../Agent.md` — Repo-weites Briefing
 - `../../docs/architecture/ai-apps-stacks.md` — Server-Kontext
 - ADRs 0001-0006 (in iCloud-Ordner, Spiegelung hier in M7.3)
--- a/stacks/forgejo/README.md
+++ b/stacks/forgejo/README.md
@ -0,0 +1,63 @@
 # Forgejo — Self-Hosted Git
 **Live:** https://code.sdda.eu
 **Git SSH:** `ssh://git@code.sdda.eu:222/<user>/<repo>.git`
 ## Was ist das?
 Unser selbst-gehostetes Git-Hosting, basierend auf [Forgejo](https://forgejo.org/) — einem non-profit Community-Fork von Gitea, gesteuert von Codeberg e.V. Hier leben Code, Infrastructure-as-Code (wie dieses Repo!), Dokumentation und alle künftigen internen Tools.
 ## Schnelles Setup auf neuem Host
 ```bash
 # Vorausgesetzt: Traefik mit Let's Encrypt läuft bereits, DNS ist gesetzt,
 # Authentik OIDC-Provider ist konfiguriert (Client-ID + Secret bereit)
 cp .env.example .env
 # Secrets generieren
 sed -i '' -e "s|<openssl rand -hex 32>|$(openssl rand -hex 32)|g" .env   # macOS
 # ODER: sed -i -e "s|...|...|g" .env   auf Linux
 # SMTP + OIDC Credentials manuell eintragen
 docker compose up -d
 sleep 30
 # Admin-Fallback anlegen
 docker exec -u git forgejo forgejo admin user create \
  --username admin-local \
  --email admin-local@sdda.eu \
  --password "<strong-password>" \
  --admin --must-change-password=false
 # OIDC einrichten
 docker exec -u git forgejo forgejo admin auth add-oauth \
  --name authentik \
  --provider openidConnect \
  --key "<CLIENT_ID>" \
  --secret "<CLIENT_SECRET>" \
  --auto-discover-url "https://welcome.sdda.eu/application/o/forgejo/.well-known/openid-configuration" \
  --scopes "openid profile email" \
  --skip-local-2fa
 # admin-local sperren (Notfall-Reserve)
 docker exec forgejo-db psql -U forgejo -d forgejo \
  -c "UPDATE \"user\" SET prohibit_login = true WHERE lower_name = 'admin-local';"
 ```
 ## Zugang
 - **Normal (empfohlen):** Via Authentik OIDC. `code.sdda.eu/user/login` → "Anmelden mit authentik" — oder direkt aus dem Authentik-Dashboard (Silent SSO).
 - **Voraussetzung:** Mitglied der Authentik-Gruppe `forgejo-users`
 - **Emergency-Fallback:** `admin-local` User (in Runbook, nicht hier)
 ## Files in diesem Ordner
 - `docker-compose.yml` — Stack-Definition
 - `.env.example` — Template, **niemals echte `.env` committen**
 - `backup.sh` — Nightly-Backup-Script (pg_dump + tar)
 - `Agent.md` — AI-Briefing für Sessions an diesem Stack
 - `README.md` — diese Datei
 ## Live vs. Repo
 Dieses Repo ist die **versionierte Wahrheit** des Stacks. Wenn du auf ai-apps eine Änderung machst (z.B. `docker-compose.yml` anpasst), **bitte auch hier einchecken**. Andernfalls driftet das Repo vom Server weg und verliert seinen Wert.
 ## OpenProject
 - M7.1 — Forgejo Deployment mit Authentik SSO (abgeschlossen)
 - M7.2 — Repo initialisiert (dieser Commit)
--- a/stacks/forgejo/backup.sh
+++ b/stacks/forgejo/backup.sh
@ -0,0 +1,29 @@
 #!/bin/bash
 # Forgejo Backup — nightly cron
 # Runs pg_dump + tar of /data, retention 14 days.
 # Schedule: 0 3 * * * /opt/ai-apps/forgejo/backup.sh >> /opt/ai-apps/forgejo/backup.log 2>&1
 set -euo pipefail
 TS=$(date +%Y%m%d-%H%M%S)
 DEST=/opt/backups/forgejo
 mkdir -p "$DEST"
 echo "[$(date -Iseconds)] backup start"
 # Postgres dump
 docker exec forgejo-db pg_dump -U forgejo forgejo | gzip > "$DEST/forgejo-db-$TS.sql.gz"
 echo "  db dump: $(ls -lh $DEST/forgejo-db-$TS.sql.gz | awk '{print $5}')"
 # Data volume tar
 docker run --rm \
  -v forgejo_forgejo-data:/data:ro \
  -v "$DEST":/backup \
  alpine \
  tar -czf "/backup/forgejo-data-$TS.tar.gz" -C /data .
 echo "  data tar: $(ls -lh $DEST/forgejo-data-$TS.tar.gz | awk '{print $5}')"
 # Retention: 14 days
 find "$DEST" -type f -mtime +14 -delete
 echo "[$(date -Iseconds)] backup complete"
--- a/stacks/forgejo/docker-compose.yml
+++ b/stacks/forgejo/docker-compose.yml
@ -0,0 +1,103 @@
 # Forgejo Stack — self-hosted Git hosting with Authentik SSO
 # Part of M7.1 (Operations & Documentation Foundation)
 # Network: traefik-public (public via Traefik) + forgejo-internal (service ↔ DB)
 services:
  forgejo:
    image: codeberg.org/forgejo/forgejo:10
    container_name: forgejo
    restart: unless-stopped
    env_file: .env
    environment:
      USER_UID: "1000"
      USER_GID: "1000"
      FORGEJO__database__DB_TYPE: postgres
      FORGEJO__database__HOST: forgejo-db:5432
      FORGEJO__database__NAME: forgejo
      FORGEJO__database__USER: forgejo
      FORGEJO__database__PASSWD: ${DB_PASSWORD}
      FORGEJO__server__DOMAIN: code.sdda.eu
      FORGEJO__server__ROOT_URL: https://code.sdda.eu/
      FORGEJO__server__SSH_DOMAIN: code.sdda.eu
      FORGEJO__server__SSH_PORT: "222"
      FORGEJO__server__START_SSH_SERVER: "true"
      FORGEJO__server__SSH_LISTEN_PORT: "2222"
      FORGEJO__server__HTTP_PORT: "3000"
      FORGEJO__server__LFS_START_SERVER: "true"
      FORGEJO__security__INSTALL_LOCK: "true"
      FORGEJO__security__SECRET_KEY: ${FORGEJO_SECRET_KEY}
      FORGEJO__security__INTERNAL_TOKEN: ${FORGEJO_INTERNAL_TOKEN}
      FORGEJO__service__DISABLE_REGISTRATION: "true"
      FORGEJO__service__ALLOW_ONLY_EXTERNAL_REGISTRATION: "true"
      FORGEJO__service__SHOW_REGISTRATION_BUTTON: "false"
      FORGEJO__service__ENABLE_NOTIFY_MAIL: "true"
      FORGEJO__service__DEFAULT_KEEP_EMAIL_PRIVATE: "true"
      FORGEJO__openid__ENABLE_OPENID_SIGNIN: "false"
      FORGEJO__openid__ENABLE_OPENID_SIGNUP: "false"
      FORGEJO__oauth2_client__ENABLE_AUTO_REGISTRATION: "true"
      FORGEJO__oauth2_client__USERNAME: email
      FORGEJO__oauth2_client__UPDATE_AVATAR: "true"
      FORGEJO__oauth2_client__ACCOUNT_LINKING: "auto"
      FORGEJO__mailer__ENABLED: "true"
      FORGEJO__mailer__PROTOCOL: smtp
      FORGEJO__mailer__SMTP_ADDR: 10.0.0.2
      FORGEJO__mailer__SMTP_PORT: "587"
      FORGEJO__mailer__FROM: "Forgejo <forgejo@sdda.eu>"
      FORGEJO__mailer__USER: ${SMTP_USER}
      FORGEJO__mailer__PASSWD: ${SMTP_PASSWORD}
      FORGEJO__log__LEVEL: Info
    volumes:
      - forgejo-data:/data
      - /etc/timezone:/etc/timezone:ro
      - /etc/localtime:/etc/localtime:ro
    networks:
      - traefik-public
      - forgejo-internal
    ports:
      - "222:2222"
    depends_on:
      forgejo-db:
        condition: service_healthy
    healthcheck:
      test: ["CMD", "wget", "--spider", "-q", "http://127.0.0.1:3000/api/healthz"]
      interval: 30s
      timeout: 5s
      retries: 5
      start_period: 30s
    labels:
      - "traefik.enable=true"
      - "traefik.docker.network=traefik-public"
      - "traefik.http.services.forgejo.loadbalancer.server.port=3000"
      - "traefik.http.routers.forgejo.rule=Host(`code.sdda.eu`)"
      - "traefik.http.routers.forgejo.entrypoints=websecure"
      - "traefik.http.routers.forgejo.tls=true"
      - "traefik.http.routers.forgejo.tls.certresolver=letsencrypt"
      - "traefik.http.routers.forgejo.service=forgejo"
  forgejo-db:
    image: postgres:16-alpine
    container_name: forgejo-db
    restart: unless-stopped
    environment:
      POSTGRES_DB: forgejo
      POSTGRES_USER: forgejo
      POSTGRES_PASSWORD: ${DB_PASSWORD}
    volumes:
      - forgejo-db-data:/var/lib/postgresql/data
    networks:
      - forgejo-internal
    healthcheck:
      test: ["CMD-SHELL", "pg_isready -U forgejo -d forgejo"]
      interval: 10s
      timeout: 5s
      retries: 5
 volumes:
  forgejo-data:
  forgejo-db-data:
 networks:
  traefik-public:
    external: true
  forgejo-internal:
    driver: bridge
Author	SHA1	Message	Date
Benjamin Weinlich	b3813ed6ac	feat(stacks/eh-search): add site-search FastAPI service Mirrors /opt/ai-apps/eh-search/ on the server, including the full FastAPI app (intent routing, FTS+fuzzy+substring hybrid, multi-source federation across vehicles + blog + brands + pages + static + tag bridge), SQL schema (Postgres materialized view with german_unaccent text search, pg_trgm for fuzzy), Dockerfile and compose. Sanitized the hardcoded password in sql/01_init.sql — replaced with REPLACE_ME_BEFORE_APPLYING placeholder since this repo is public. The eh-search service binds only on the private network (10.0.0.8:8200) and is reachable only via Pegasus nginx proxy at /api/search. Refs OP#1094 OP#1105 OP#1112 OP#1116 OP#1117	2026-04-11 22:19:39 +02:00
Benjamin Weinlich	8ba375caaa	feat(stacks/forgejo): add self-hosted Git stack First stack mirrored 1:1 from /opt/ai-apps/forgejo/ on the server. Includes docker-compose.yml (forgejo + postgres 16), .env.example template (NO real secrets), backup.sh (nightly pg_dump + tar), plus Agent.md and README.md. Known gotchas documented in Agent.md: - Volume mount on /data not /var/lib/gitea - SSH port 2222 in container (system sshd occupies 22) - OIDC config lives in DB table login_source, not app.ini Refs OP#1119	2026-04-11 22:19:25 +02:00
Benjamin Weinlich	723ee00388	docs(architecture): add ai-apps stack inventory snapshot Captures the current state of all Docker stacks running on ai-apps (Hetzner cx22, 10.0.0.8) as of 2026-04-11. Includes resource budget, DNS records, Traefik integration pattern, and backup situation. This is a point-in-time snapshot; update when new stacks are added. Refs OP#1118	2026-04-11 22:19:25 +02:00
Benjamin Weinlich	c4a97793d5	chore(repo): initialize infrastructure-as-code repository - README.md explains scope, conventions, what does / does not belong here - Agent.md is the AI session briefing with workflow rules - .gitignore blocks secrets, runtime data, and build artifacts Refs OP#1119	2026-04-11 22:19:10 +02:00