Files
ray-serve/ray_serve/serve_embeddings.py
Billy D. 7ec2107e0c
All checks were successful
Build and Publish ray-serve-apps / build-and-publish (push) Successful in 16s
feat: add MLflow inference logging to all Ray Serve apps
- Add mlflow_logger.py: lightweight REST-based MLflow logger (no mlflow dep)
- Instrument serve_llm.py with latency, token counts, tokens/sec metrics
- Instrument serve_embeddings.py with latency, batch_size, total_tokens
- Instrument serve_whisper.py with latency, audio_duration, realtime_factor
- Instrument serve_tts.py with latency, audio_duration, text_chars
- Instrument serve_reranker.py with latency, num_pairs, top_k
2026-02-12 06:14:30 -05:00

3.2 KiB