inference-platform

Here are 2 public repositories matching this topic...

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

kubernetes inference huggingface llm modelscope llamacpp vllm text-generation-inference ollama sglang inference-platform

Add a description, image, and links to the inference-platform topic page so that developers can more easily learn about it.

To associate your repository with the inference-platform topic, visit your repo's landing page and select "manage topics."