kuberay-images/dockerfiles/Dockerfile.ray-worker-strixhalo at 2e3fbb8c60da855fe1ff8a98a265beb9d40ae699

daviestechlabs/kuberay-images

Fork 0

Files

Billy D. 2e3fbb8c60

Build and Push Images / determine-version (push) Successful in 7s

Details

Build and Push Images / build (Dockerfile.ray-worker-intel, intel) (push) Has been cancelled

Details

Build and Push Images / build (Dockerfile.ray-worker-nvidia, nvidia) (push) Has been cancelled

Details

Build and Push Images / build (Dockerfile.ray-worker-rdna2, rdna2) (push) Has been cancelled

Details

Build and Push Images / build (Dockerfile.ray-worker-strixhalo, strixhalo) (push) Has been cancelled

Details

Build and Push Images / Release (push) Has been cancelled

Details

Build and Push Images / Notify (push) Has been cancelled

Details

feat(strixhalo): full source build of vLLM for gfx1151 (v1.0.20)

- Build vLLM v0.15.1 from source against vendor torch 2.9.1
- Preserve AMD's vendor PyTorch from rocm/pytorch:rocm7.0.2 base
- use_existing_torch.py --prefix to strip torch from build-requires
- Compile C++/HIP extensions for gfx1100 (mapped from gfx1151)
- Install triton/flash-attn from wheels.vllm.ai/rocm with --no-deps
- Add torch vendor verification step to catch accidental overwrites
- Fix GPU_RESOURCE default to match cluster (gpu_strixhalo)
- Remove unsupported expandable_segments from PYTORCH_ALLOC_CONF
- AITER is gfx9-only; gfx11 uses TRITON_ATTN backend by default

2026-02-09 15:46:25 -05:00

9.7 KiB

Raw Blame History

View Raw

9.7 KiB Raw Blame History

9.7 KiB

Raw Blame History