Commit Graph

28 Commits

Author SHA1 Message Date
Gitea Actions
fbd5e0bb70 docs: auto-update ADR index in README [skip ci] 2026-02-09 23:14:26 +00:00
8e3e2043c3 docs: add ADR-0038/0039 and replace llm-workflows references with decomposed repos
All checks were successful
Update README with ADR Index / update-readme (push) Successful in 6s
- ADR-0038: Infrastructure metrics collection (smartctl, SNMP, blackbox, unpoller)
- ADR-0039: Alerting and notification pipeline (Alertmanager → ntfy → Discord)
- Replace llm-workflows GitHub links with Gitea daviestechlabs org repos
- Update AGENT-ONBOARDING.md: remove llm-workflows from file tree, add missing repos
- Update ADR-0006: fix multi-repo reference
- Update ADR-0009: fix broken llm-workflows link
- Update ADR-0024: mark ray-serve repo as created, update historical context
- Update README: fix ADR-0016 status, add 0038/0039 to table, update badges
2026-02-09 18:12:37 -05:00
Gitea Actions
6682fe271f docs: auto-update ADR index in README [skip ci] 2026-02-09 22:44:42 +00:00
29dc6ba326 docs(adr): mark ADR-0016 as accepted (Option A implemented)
All checks were successful
Update README with ADR Index / update-readme (push) Successful in 6s
2026-02-09 17:44:34 -05:00
Gitea Actions
2c4e7520c6 docs: auto-update ADR index in README [skip ci] 2026-02-09 22:25:52 +00:00
3a46a98be3 docs: add ADR index workflow, standardize all ADR formats
All checks were successful
Update README with ADR Index / update-readme (push) Successful in 6s
- Add Gitea Action to auto-update README badges and ADR table on push
- Standardize 8 ADRs from heading-style to inline metadata format
- Add shields.io badges for ADR counts (total/accepted/proposed)
- Replace static directory listing with linked ADR table in README
- Accept ADR-0030 (MFA/YubiKey strategy)
2026-02-09 17:25:27 -05:00
dd277f6459 tuning up runner improvements. 2026-02-06 07:53:31 -05:00
80fb911e22 updating to match everything in my homelab. 2026-02-05 16:13:53 -05:00
f8787379c5 vaultwarden yubikey updates. 2026-02-04 15:15:14 -05:00
3580ee1223 docs: add ADRs 0029-0030 for Authentik registration and MFA
- 0029: User registration workflow with approval and group-based access
- 0030: MFA and Yubikey strategy for Authentik and Vaultwarden
2026-02-04 09:11:29 -05:00
b43c80153c docs: add ADRs 0025-0028 for infrastructure patterns
- 0025: Observability stack (Prometheus + ClickStack + OTEL)
- 0026: Tiered storage strategy (Longhorn + NFS)
- 0027: Database strategy (CloudNativePG for PostgreSQL)
- 0028: Authentik SSO strategy (OIDC/SAML identity provider)
2026-02-04 08:55:15 -05:00
a128c265e4 docs: Add ADRs for secrets management and security policy
- 0017: Secrets Management Strategy (SOPS + Vault + External Secrets)
- 0018: Security Policy Enforcement (Gatekeeper + Trivy)
2026-02-04 08:45:47 -05:00
8f4df84657 chore: Consolidate ADRs into decisions/ directory
- Added ADR-0016: Affine email verification strategy
- Moved ADRs 0019-0024 from docs/adr/ to decisions/
- Renamed to consistent format (removed ADR- prefix)
2026-02-04 08:28:12 -05:00
85b1a9019b feat: adr-0024 2026-02-03 06:46:50 -05:00
5e9f589311 docs(adr): update ADR-0022 to use Go with hot reload
- Switch from Python to Go for smaller images (~10MB vs ~150MB)
- Add fsnotify for hot reload of secrets without pod restart
- Update status from Proposed to Accepted
- Add Prometheus metrics endpoint
- Update resource limits (32Mi vs 128Mi)
- Mark repository creation as complete
2026-02-02 17:49:34 -05:00
add1b5b71e docs(adr): ADR-0023 Valkey for ML inference caching
Document decision to consolidate mlcache into single valkey instance.
Includes future guidance for vLLM KV-cache offloading integration.
2026-02-02 17:30:08 -05:00
e85deaa642 docs(adr): finalize ADR-0021 and add ADR-0022
ADR-0021 (Accepted):
- ntfy as central notification hub
- Alertmanager integration for critical/warning alerts
- Service readiness notifications via Flux notification-controller
- Standardized topic naming

ADR-0022 (Proposed):
- ntfy-discord-bridge Python service design
- SSE subscription with reconnection logic
- Message transformation to Discord embeds
- Priority/tag to color/emoji mapping
- Kubernetes deployment with ExternalSecret for webhook
2026-02-02 11:58:52 -05:00
7b77d6c29f docs(adr): ADR-0021 notification architecture
- Define ntfy as central notification hub
- Standardize topic naming (gitea-ci, alertmanager-alerts, etc.)
- Document Alertmanager integration
- Design ntfy-to-Discord bridge architecture (future work)
2026-02-02 11:54:08 -05:00
09cc796417 docs: ADR-0020 internal registry URLs for CI/CD
Document decision to use registry.lab.daviestechlabs.io for
artifact uploads to bypass Cloudflare 100MB limit.
2026-02-02 11:19:16 -05:00
37b18dad08 accept: ADR-0019 handler deployment strategy 2026-02-02 09:14:33 -05:00
d790aa31eb docs: add ruff linting/formatting conventions for Python repos 2026-02-02 08:45:33 -05:00
a2a426eb3b docs: add ADR-0015 for CI notifications and semantic versioning 2026-02-02 08:00:18 -05:00
730ab32b5d docs: add ADR-0014 for Docker build best practices
Documents standardized Docker patterns:
- BuildKit syntax 1.7 with cache mounts
- uv for Python package installation (10-100x faster)
- OCI Image Spec labels
- HEALTHCHECK directives
- Non-root execution
- Version pinning with ranges

Complements ADR-0012 (uv) and ADR-0013 (CI/CD)
2026-02-02 07:26:43 -05:00
5ba8138719 docs: add ADR-0013 for Gitea Actions CI/CD 2026-02-02 07:15:48 -05:00
598875c5a9 docs: add ADR-0011 (KubeRay), ADR-0012 (uv), update architecture docs 2026-02-02 07:10:47 -05:00
b6f7605fab docs: Update for decomposed repo structure
- AGENT-ONBOARDING: New repo map with daviestechlabs Gitea repos
- TECH-STACK: Reference handler-base instead of llm-workflows
- CODING-CONVENTIONS: Update project structure for new repos
- ADR 0006: Update GitRepository examples for Gitea repos

llm-workflows has been split into:
- handler-base, chat-handler, voice-assistant
- kuberay-images, argo, kubeflow, mlflow, gradio-ui
2026-02-02 05:58:35 -05:00
832cda34bd feat: add comprehensive architecture documentation
- Add AGENT-ONBOARDING.md for AI agents
- Add ARCHITECTURE.md with full system overview
- Add TECH-STACK.md with complete technology inventory
- Add DOMAIN-MODEL.md with entities and bounded contexts
- Add CODING-CONVENTIONS.md with patterns and practices
- Add GLOSSARY.md with terminology reference
- Add C4 diagrams (Context and Container levels)
- Add 10 ADRs documenting key decisions:
  - Talos Linux, NATS, MessagePack, Multi-GPU strategy
  - GitOps with Flux, KServe, Milvus, Dual workflow engines
  - Envoy Gateway
- Add specs directory with JetStream configuration
- Add diagrams for GPU allocation and data flows

Based on analysis of homelab-k8s2 and llm-workflows repositories
and kubectl cluster-info dump data.
2026-02-01 14:30:05 -05:00
4d4f6f464c Initial commit 2026-02-01 19:16:18 +00:00