100ba21eba2f3ed88a5b44c192875f7b1b040537
All checks were successful
Update README with ADR Index / update-readme (push) Successful in 1m2s
🏠 DaviesTechLabs Homelab Architecture
Production-grade AI/ML platform running on bare-metal Kubernetes
📖 Quick Navigation
| Document | Purpose |
|---|---|
| AGENT-ONBOARDING.md | Start here if you're an AI agent |
| ARCHITECTURE.md | High-level system overview |
| TECH-STACK.md | Complete technology stack |
| DOMAIN-MODEL.md | Core entities and bounded contexts |
| GLOSSARY.md | Terminology reference |
| decisions/ | Architecture Decision Records (ADRs) |
🎯 What This Is
A comprehensive architecture documentation repository for the DaviesTechLabs homelab Kubernetes cluster, featuring:
- AI/ML Platform: KServe inference services, RAG pipelines, voice assistants
- Multi-GPU Support: AMD ROCm (RDNA3/Strix Halo), NVIDIA CUDA, Intel Arc
- GitOps: Flux CD with SOPS encryption
- Event-Driven: NATS JetStream for real-time messaging
- ML Workflows: Kubeflow Pipelines + Argo Workflows
🖥️ Cluster Overview
| Node | Role | Hardware | GPU |
|---|---|---|---|
| storm | Control Plane | Intel 13th Gen | Integrated |
| bruenor | Control Plane | Intel 13th Gen | Integrated |
| catti | Control Plane | Intel 13th Gen | Integrated |
| elminster | Worker | NVIDIA RTX 2070 | 8GB CUDA |
| khelben | Worker (vLLM) | AMD Strix Halo | 64GB Unified |
| drizzt | Worker | AMD Radeon 680M | 12GB RDNA2 |
| danilo | Worker | Intel Core Ultra 9 | Intel Arc |
🚀 Quick Start
View Current Cluster State
# Get node status
kubectl get nodes -o wide
# View AI/ML workloads
kubectl get pods -n ai-ml
# Check KServe inference services
kubectl get inferenceservices -n ai-ml
Key Endpoints
| Service | URL | Purpose |
|---|---|---|
| Kubeflow | kubeflow.lab.daviestechlabs.io |
ML Pipeline UI |
| Companions | companions-chat.lab.daviestechlabs.io |
AI Chat Interface |
| Voice | voice.lab.daviestechlabs.io |
Voice Assistant |
| Gitea | git.daviestechlabs.io |
Self-hosted Git |
📂 Repository Structure
homelab-design/
├── README.md # This file
├── AGENT-ONBOARDING.md # AI agent quick-start
├── ARCHITECTURE.md # High-level system overview
├── CONTEXT-DIAGRAM.mmd # C4 Level 1 (Mermaid)
├── CONTAINER-DIAGRAM.mmd # C4 Level 2
├── TECH-STACK.md # Complete tech stack
├── DOMAIN-MODEL.md # Core entities
├── CODING-CONVENTIONS.md # Patterns & practices
├── GLOSSARY.md # Terminology
├── decisions/ # Architecture Decision Records
├── specs/ # Feature specifications
└── diagrams/ # Additional diagrams
Architecture Decision Records
🔗 Related Repositories
| Repository | Purpose |
|---|---|
| homelab-k8s2 | Kubernetes manifests, Flux GitOps |
| companions-frontend | Go web server, HTMX frontend |
AI/ML Repos (git.daviestechlabs.io/daviestechlabs)
The former monolithic llm-workflows repo has been archived and decomposed into:
| Repository | Purpose |
|---|---|
handler-base |
Shared Python library for NATS handlers |
chat-handler |
Text chat with RAG pipeline |
voice-assistant |
Voice pipeline (STT → RAG → LLM → TTS) |
pipeline-bridge |
Bridge between pipelines and services |
stt-module |
Speech-to-text service |
tts-module |
Text-to-speech service |
ray-serve |
Ray Serve inference services |
kuberay-images |
GPU-specific Ray worker Docker images |
argo |
Argo Workflows (training, batch inference) |
kubeflow |
Kubeflow Pipeline definitions |
mlflow |
MLflow integration utilities |
gradio-ui |
Gradio demo apps (embeddings, STT, TTS) |
ntfy-discord |
ntfy → Discord notification bridge |
📝 Contributing
- For architecture changes, create an ADR in
decisions/ - Update relevant documentation
- Submit a PR with context
Last updated: 2026-02-21
Languages
Mermaid
100%