Pipeline	Description	Output
`voice_pipeline.py`	STT → RAG → LLM → TTS	voice_pipeline.yaml
`document_ingestion_pipeline.py`	Ingest docs → chunk → embed → Milvus	document_ingestion.yaml
`document_ingestion_mlflow_pipeline.py`	Same with MLflow tracking	document_ingestion_mlflow.yaml
`evaluation_pipeline.py`	Evaluate models against benchmarks	evaluation.yaml

Usage

Compile Pipelines

pip install kfp==2.12.1

# Compile all
python voice_pipeline.py
python document_ingestion_pipeline.py
python evaluation_pipeline.py

# Or compile individually
python -c "from kfp import compiler; from voice_pipeline import voice_assistant_pipeline; compiler.Compiler().compile(voice_assistant_pipeline, 'voice.yaml')"

Upload to Kubeflow

Upload the generated .yaml files to Kubeflow Pipelines UI or use the SDK:

import kfp
client = kfp.Client(host='http://kubeflow.ai-ml.svc.cluster.local/pipeline')
client.upload_pipeline('voice_pipeline.yaml', pipeline_name='Voice Assistant')

Pipeline Details

voice_pipeline

Full voice assistant with RAG:

Transcribe audio (Whisper)
Generate embeddings (BGE)
Search Milvus
Rerank documents (BGE Reranker)
Generate response (vLLM)
Synthesize speech (XTTS)

document_ingestion_pipeline

Ingest documents into vector DB:

Extract text (PDF, DOCX, HTML, TXT)
Chunk with overlap (tiktoken)
Generate embeddings
Store in Milvus collection

evaluation_pipeline

Benchmark model quality:

Load eval dataset (MMLU, etc.)
Run inference
Calculate metrics (accuracy, F1)
Log to MLflow

Integration

kfp-sync-job.yaml

Kubernetes Job to sync compiled pipelines to Kubeflow:

kubectl apply -f kfp-sync-job.yaml

From Argo Workflows

Pipelines can be triggered from Argo via the kfp-integration workflow in the argo repo.

Service Endpoints

Service	Endpoint
Whisper STT	`http://whisper-predictor.ai-ml.svc.cluster.local`
Embeddings	`http://embeddings-predictor.ai-ml.svc.cluster.local`
Reranker	`http://reranker-predictor.ai-ml.svc.cluster.local`
vLLM	`http://llm-draft.ai-ml.svc.cluster.local:8000`
TTS	`http://tts-predictor.ai-ml.svc.cluster.local`
Milvus	`milvus.ai-ml.svc.cluster.local:19530`

argo - Argo Workflows for training
voice-assistant - Real-time voice handler
homelab-design - Architecture docs

README.md

Kubeflow Pipelines

Pipelines