updates to adrs and fixing to reflect go refactor.

2026-02-23 06:14:23 -05:00
parent f19fa3e969
commit 100ba21eba
7 changed files with 181 additions and 129 deletions
--- a/TECH-STACK.md
+++ b/TECH-STACK.md
@@ -117,9 +117,14 @@ All AI inference runs on a unified Ray Serve endpoint with fractional GPU alloca

 | Application | Language | Framework | Purpose |
 |-------------|----------|-----------|---------|
-| Companions | Go | net/http + HTMX | AI chat interface |
-| Voice WebApp | Python | Gradio | Voice assistant UI |
-| Various handlers | Python | asyncio + nats.py | NATS event handlers |
+| Companions | Go | net/http + HTMX | AI chat interface (SSR) |
+| Chat Handler | Go | handler-base | RAG + LLM text pipeline |
+| Voice Assistant | Go | handler-base | STT → RAG → LLM → TTS pipeline |
+| Pipeline Bridge | Go | handler-base | Kubeflow/Argo workflow triggers |
+| STT Module | Go | handler-base | Speech-to-text bridge |
+| TTS Module | Go | handler-base | Text-to-speech bridge |
+| Voice WebApp | Python | Gradio | Voice assistant UI (dev/testing) |
+| Ray Serve | Python | Ray Serve | GPU inference endpoints |

 ### Frontend

@@ -242,27 +247,41 @@ All AI inference runs on a unified Ray Serve endpoint with fractional GPU alloca

 ---

-## Python Dependencies (handler-base)
+## Go Dependencies (handler-base)

-Core library for all NATS handlers: [handler-base](https://git.daviestechlabs.io/daviestechlabs/handler-base)
+Shared Go module for all NATS handler services: [handler-base](https://git.daviestechlabs.io/daviestechlabs/handler-base)
+
+```go
+// go.mod (handler-base v1.0.0)
+require (
+    github.com/nats-io/nats.go          // NATS client
+    google.golang.org/protobuf           // Protocol Buffers encoding
+    github.com/zitadel/oidc/v3           // OIDC client
+    go.opentelemetry.io/otel             // OpenTelemetry traces + metrics
+    github.com/milvus-io/milvus-sdk-go   // Milvus vector search
+)
+```
+
+See [ADR-0061](decisions/0061-go-handler-refactor.md) for the full refactoring rationale.
+
+## Python Dependencies (ML/AI only)
+
+Python is retained for ML inference, pipeline orchestration, and dev tools:

 ```toml
-# Core
-nats-py>=2.7.0          # NATS client
-msgpack>=1.0.0          # Binary serialization
-httpx>=0.27.0           # HTTP client
+# ray-serve (GPU inference)
+ray[serve]>=2.53.0
+vllm>=0.8.0
+faster-whisper>=1.0.0
+TTS>=0.22.0
+sentence-transformers>=3.0.0

-# ML/AI
-pymilvus>=2.4.0         # Milvus client
-openai>=1.0.0           # vLLM OpenAI API
+# kubeflow (pipeline definitions)
+kfp>=2.12.1

-# Observability
-opentelemetry-api>=1.20.0
-opentelemetry-sdk>=1.20.0
-mlflow>=2.10.0          # Experiment tracking
-
-# Kubeflow (kubeflow repo)
-kfp>=2.12.1             # Pipeline SDK
+# mlflow (experiment tracking)
+mlflow>=3.7.0
+pymilvus>=2.4.0
 ```

 ---