https://github.com/vllm-project/vllm/issues/5449 choose `Runpod pytorch 2.8` <img width="721" alt="Image" src="https://github.com/user-attachments/assets/42434606-4d30-44e9-b3ab-ef273e8a389a" /> export 3 more ports for the model servers we're about to launch <img width="841" alt="Image" src="https://github.com/user-attachments/assets/a8c37c64-a35c-4648-a2c9-55f494cba308" /> reference: https://medium.com/@kimdoil1211/scalable-multi-model-llm-serving-with-vllm-and-nginx-f586912e17da