Commit Graph

12 Commits

Author SHA1 Message Date
4069647495 fixing llm readiness check. 2026-02-18 07:31:23 -05:00
53afea9352 chore: add Renovate config for automated dependency updates
Ref: ADR-0057
2026-02-13 15:34:26 -05:00
58319b66ee chore: bump image to v2-202602130804 (MLflow tracking) 2026-02-13 08:05:27 -05:00
1c5dc7f751 feat: add MLflow experiment tracking to all 4 Gradio UIs
Each UI now logs per-request metrics to MLflow:
- llm.py: latency, tokens/sec, prompt/completion tokens (gradio-llm-tuning)
- embeddings.py: latency, text length, batch size (gradio-embeddings-tuning)
- stt.py: latency, audio duration, real-time factor (gradio-stt-tuning)
- tts.py: latency, text length, audio duration (gradio-tts-tuning)

Uses try/except guarded imports so UIs still work if MLflow is
unreachable. Persistent run per Gradio instance, batched metric logging
via MlflowClient.log_batch().
2026-02-13 07:54:06 -05:00
b2d2252342 more bug fixes. 2026-02-12 05:36:15 -05:00
72681217ef updating images. 2026-02-12 05:30:15 -05:00
af67984737 fixing up themes, adding in an endpoint for llm. 2026-02-12 05:27:13 -05:00
861f11e22b fixing up internal envoy gatewya route. 2026-02-12 05:16:48 -05:00
ea02905b6c fix: correct embeddings.py filename in deployment command 2026-02-02 09:14:14 -05:00
ad31e878cb updates. 2026-02-02 07:11:47 -05:00
1f833e0124 feat: Add Gradio UI apps for AI services
- embeddings.py: BGE embeddings demo with similarity
- stt.py: Whisper speech-to-text demo
- tts.py: XTTS text-to-speech demo
- theme.py: Shared DaviesTechLabs Gradio theme
- K8s deployments for each app
2026-02-01 20:45:10 -05:00
8f5de96130 Initial commit 2026-02-02 01:43:56 +00:00