Commit Graph

3 Commits

Author SHA1 Message Date
bc4b230dd9 feat: add vLLM tuning pipeline + recompile voice pipelines with MLflow
New:
- vllm_tuning_pipeline.py: A/B benchmark different vLLM configs,
  logs latency/TPS/TTFT to MLflow (vllm-tuning experiment)
- vllm_tuning_pipeline.yaml: compiled KFP YAML

Updated:
- voice_pipeline.py: per-step NamedTuple outputs with latency tracking,
  new log_pipeline_metrics MLflow component
- voice_pipeline.yaml, tts_pipeline.yaml, rag_pipeline.yaml: recompiled
2026-02-13 08:24:11 -05:00
bd8c8616d0 updates. 2026-02-02 07:12:05 -05:00
c26e4e5ef0 feat: Add Kubeflow Pipeline definitions
- voice_pipeline: STT → RAG → LLM → TTS
- document_ingestion_pipeline: Extract → Chunk → Embed → Milvus
- document_ingestion_mlflow_pipeline: With MLflow tracking
- evaluation_pipeline: Model benchmarking
- kfp-sync-job: K8s job to sync pipelines
2026-02-01 20:41:13 -05:00