Don't see what you need? Contact our team.
Minimal kserve-rest-proxy image.
Minimal kserve-router image.
Minimal kserve-storage-initializer image.
LiteLLM is a unified interface to call 100+ LLMs using the OpenAI format, providing a proxy server for multiple LLM providers.
NVIDIA NeMo Framework is an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere.
The NVIDIA Container Toolkit allows users to build and run GPU accelerated containers.
32 images