dd277f6459c3b6bb0000eafbff8ff65b66a96b86
🏠 DaviesTechLabs Homelab Architecture
Production-grade AI/ML platform running on bare-metal Kubernetes
📖 Quick Navigation
| Document | Purpose |
|---|---|
| AGENT-ONBOARDING.md | Start here if you're an AI agent |
| ARCHITECTURE.md | High-level system overview |
| TECH-STACK.md | Complete technology stack |
| DOMAIN-MODEL.md | Core entities and bounded contexts |
| GLOSSARY.md | Terminology reference |
| decisions/ | Architecture Decision Records (ADRs) |
🎯 What This Is
A comprehensive architecture documentation repository for the DaviesTechLabs homelab Kubernetes cluster, featuring:
- AI/ML Platform: KServe inference services, RAG pipelines, voice assistants
- Multi-GPU Support: AMD ROCm (RDNA3/Strix Halo), NVIDIA CUDA, Intel Arc
- GitOps: Flux CD with SOPS encryption
- Event-Driven: NATS JetStream for real-time messaging
- ML Workflows: Kubeflow Pipelines + Argo Workflows
🖥️ Cluster Overview
| Node | Role | Hardware | GPU |
|---|---|---|---|
| storm | Control Plane | Intel 13th Gen | Integrated |
| bruenor | Control Plane | Intel 13th Gen | Integrated |
| catti | Control Plane | Intel 13th Gen | Integrated |
| elminster | Worker | NVIDIA RTX 2070 | 8GB CUDA |
| khelben | Worker (vLLM) | AMD Strix Halo | 64GB Unified |
| drizzt | Worker | AMD Radeon 680M | 12GB RDNA2 |
| danilo | Worker | Intel Core Ultra 9 | Intel Arc |
🚀 Quick Start
View Current Cluster State
# Get node status
kubectl get nodes -o wide
# View AI/ML workloads
kubectl get pods -n ai-ml
# Check KServe inference services
kubectl get inferenceservices -n ai-ml
Key Endpoints
| Service | URL | Purpose |
|---|---|---|
| Kubeflow | kubeflow.lab.daviestechlabs.io |
ML Pipeline UI |
| Companions | companions-chat.lab.daviestechlabs.io |
AI Chat Interface |
| Voice | voice.lab.daviestechlabs.io |
Voice Assistant |
| Gitea | git.daviestechlabs.io |
Self-hosted Git |
📂 Repository Structure
homelab-design/
├── README.md # This file
├── AGENT-ONBOARDING.md # AI agent quick-start
├── ARCHITECTURE.md # High-level system overview
├── CONTEXT-DIAGRAM.mmd # C4 Level 1 (Mermaid)
├── CONTAINER-DIAGRAM.mmd # C4 Level 2
├── TECH-STACK.md # Complete tech stack
├── DOMAIN-MODEL.md # Core entities
├── CODING-CONVENTIONS.md # Patterns & practices
├── GLOSSARY.md # Terminology
├── decisions/ # ADRs
│ ├── 0000-template.md
│ ├── 0001-record-architecture-decisions.md
│ ├── 0002-use-talos-linux.md
│ └── ...
├── specs/ # Feature specifications
└── diagrams/ # Additional diagrams
🔗 Related Repositories
| Repository | Purpose |
|---|---|
| homelab-k8s2 | Kubernetes manifests, Flux GitOps |
| llm-workflows | NATS handlers, Argo/KFP workflows |
| companions-frontend | Go web server, HTMX frontend |
📝 Contributing
- For architecture changes, create an ADR in
decisions/ - Update relevant documentation
- Submit a PR with context
Last updated: 2026-02-01
Languages
Mermaid
100%