feat: Add GPU-specific Ray worker images with CI/CD
Some checks failed
Build and Push Images / build-nvidia (push) Failing after 1s
Build and Push Images / build-rdna2 (push) Failing after 1s
Build and Push Images / build-strixhalo (push) Failing after 1s
Build and Push Images / build-intel (push) Failing after 1s

- Add Dockerfiles for nvidia, rdna2, strixhalo, and intel GPU targets
- Add ray-serve modules (embeddings, whisper, tts, llm, reranker)
- Add Gitea Actions workflow for automated builds
- Add Makefile for local development
- Update README with comprehensive documentation
This commit is contained in:
2026-02-01 15:04:31 -05:00
parent e68d5c1f0e
commit a16ffff73f
16 changed files with 1311 additions and 2 deletions

View File

@@ -0,0 +1,24 @@
# Ray Serve dependencies
ray[serve]==2.53.0
# LLM inference
vllm
# Embeddings and reranking
sentence-transformers
# Speech-to-text
faster-whisper
# Text-to-speech
TTS
# HTTP client
httpx
# Numerical computing
numpy
scipy
# Optional: Intel GPU support (for danilo node)
# intel-extension-for-pytorch