updates to adrs and fixing to reflect go refactor.
All checks were successful
Update README with ADR Index / update-readme (push) Successful in 1m2s
All checks were successful
Update README with ADR Index / update-readme (push) Successful in 1m2s
This commit is contained in:
@@ -117,9 +117,14 @@ All AI inference runs on a unified Ray Serve endpoint with fractional GPU alloca
|
||||
|
||||
| Application | Language | Framework | Purpose |
|
||||
|-------------|----------|-----------|---------|
|
||||
| Companions | Go | net/http + HTMX | AI chat interface |
|
||||
| Voice WebApp | Python | Gradio | Voice assistant UI |
|
||||
| Various handlers | Python | asyncio + nats.py | NATS event handlers |
|
||||
| Companions | Go | net/http + HTMX | AI chat interface (SSR) |
|
||||
| Chat Handler | Go | handler-base | RAG + LLM text pipeline |
|
||||
| Voice Assistant | Go | handler-base | STT → RAG → LLM → TTS pipeline |
|
||||
| Pipeline Bridge | Go | handler-base | Kubeflow/Argo workflow triggers |
|
||||
| STT Module | Go | handler-base | Speech-to-text bridge |
|
||||
| TTS Module | Go | handler-base | Text-to-speech bridge |
|
||||
| Voice WebApp | Python | Gradio | Voice assistant UI (dev/testing) |
|
||||
| Ray Serve | Python | Ray Serve | GPU inference endpoints |
|
||||
|
||||
### Frontend
|
||||
|
||||
@@ -242,27 +247,41 @@ All AI inference runs on a unified Ray Serve endpoint with fractional GPU alloca
|
||||
|
||||
---
|
||||
|
||||
## Python Dependencies (handler-base)
|
||||
## Go Dependencies (handler-base)
|
||||
|
||||
Core library for all NATS handlers: [handler-base](https://git.daviestechlabs.io/daviestechlabs/handler-base)
|
||||
Shared Go module for all NATS handler services: [handler-base](https://git.daviestechlabs.io/daviestechlabs/handler-base)
|
||||
|
||||
```go
|
||||
// go.mod (handler-base v1.0.0)
|
||||
require (
|
||||
github.com/nats-io/nats.go // NATS client
|
||||
google.golang.org/protobuf // Protocol Buffers encoding
|
||||
github.com/zitadel/oidc/v3 // OIDC client
|
||||
go.opentelemetry.io/otel // OpenTelemetry traces + metrics
|
||||
github.com/milvus-io/milvus-sdk-go // Milvus vector search
|
||||
)
|
||||
```
|
||||
|
||||
See [ADR-0061](decisions/0061-go-handler-refactor.md) for the full refactoring rationale.
|
||||
|
||||
## Python Dependencies (ML/AI only)
|
||||
|
||||
Python is retained for ML inference, pipeline orchestration, and dev tools:
|
||||
|
||||
```toml
|
||||
# Core
|
||||
nats-py>=2.7.0 # NATS client
|
||||
msgpack>=1.0.0 # Binary serialization
|
||||
httpx>=0.27.0 # HTTP client
|
||||
# ray-serve (GPU inference)
|
||||
ray[serve]>=2.53.0
|
||||
vllm>=0.8.0
|
||||
faster-whisper>=1.0.0
|
||||
TTS>=0.22.0
|
||||
sentence-transformers>=3.0.0
|
||||
|
||||
# ML/AI
|
||||
pymilvus>=2.4.0 # Milvus client
|
||||
openai>=1.0.0 # vLLM OpenAI API
|
||||
# kubeflow (pipeline definitions)
|
||||
kfp>=2.12.1
|
||||
|
||||
# Observability
|
||||
opentelemetry-api>=1.20.0
|
||||
opentelemetry-sdk>=1.20.0
|
||||
mlflow>=2.10.0 # Experiment tracking
|
||||
|
||||
# Kubeflow (kubeflow repo)
|
||||
kfp>=2.12.1 # Pipeline SDK
|
||||
# mlflow (experiment tracking)
|
||||
mlflow>=3.7.0
|
||||
pymilvus>=2.4.0
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
Reference in New Issue
Block a user