VLLM - VMVirtualMachine.com

Red Hat OpenShift 4.20: AI, post-quantum, and broader VM assist

vm_adminNovember 12, 2025November 12, 2025

Red Hat has launched model 4.20 of OpenShift. The answer will get new AI tooling, post-quantum encryption, and extra in…

Microsoft

Microsoft expands AKS with RAG functionality and vLLM support

vm_adminApril 1, 2025

During KubeCon, Microsoft announced that it supports Retrieval Augmented Generation (RAG) in KAITO on Azure Kubernetes Service (AKS) clusters. In…

Nvidia

Inferencing with vLLM and Triton on NVIDIA Jetson AGX Orin

vm_adminNovember 3, 2024

NVIDIA’s Triton Inference Server is an open-source inference service framework designed to facilitate the rapid development of AI/ML inference applications.…