Skip to content

Popular repositories Loading

  1. rmbg-1.4 rmbg-1.4 Public template

    State-of-the-art background removal model, designed to effectively separate foreground from background. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

    Python 20 11

  2. triton-co-pilot triton-co-pilot Public

    Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments

    Python 19 3

  3. Smaug-72B Smaug-72B Public

    Smaug-72B - which topped the Hugging Face LLM leaderboard and it’s the first model with an average score of 80, making it the world’s best open-source foundation model.

    Python 17 5

  4. whisper-large-v3 whisper-large-v3 Public

    State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

    Python 16 13

  5. qwq-32b-preview qwq-32b-preview Public template

    A 32B experimental reasoning model for advanced text generation and robust instruction following. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 16 6

  6. deepseek-r1-distill-qwen-32b deepseek-r1-distill-qwen-32b Public template

    A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 16 22

Repositories

Showing 10 of 160 repositories
  • mistral-7b-instruct-v0.2 Public

    An 7B model with a 32k token context window and optimized attention mechanisms for superior dialogue and reasoning. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    inferless/mistral-7b-instruct-v0.2’s past year of commit activity
    Python 0 0 0 0 Updated Apr 8, 2025
  • phi4-vllm-gguf Public Forked from rbgo404/phi4-vllm-gguf

    A 14B model optimized in GGUF format for efficient inference, designed to excel in complex reasoning tasks. <metadata> gpu: A100 | collections: ["vLLM","GGUF"] </metadata>

    inferless/phi4-vllm-gguf’s past year of commit activity
    Python 0 7 0 0 Updated Apr 8, 2025
  • melo-tts Public

    A high-quality text-to-speech model by MyShell.ai that supports multiple English accents and real-time inference.

    inferless/melo-tts’s past year of commit activity
    Python 0 1 0 0 Updated Apr 7, 2025
  • donut-doc-vqa Public

    An OCR-free document understanding model that uses a Swin Transformer encoder and BART decoder, fine-tuned on the DocVQA dataset.

    inferless/donut-doc-vqa’s past year of commit activity
    Python 0 1 0 0 Updated Apr 7, 2025
  • stable-diffusion-xl-turbo Public template

    A distilled and cost-effective variant of SDXL that delivers high-quality text-to-image generation with accelerated inference speed. <metadata> gpu: T4 | collections: ["Diffusers"] </metadata>

    inferless/stable-diffusion-xl-turbo’s past year of commit activity
    Python 3 10 0 0 Updated Apr 7, 2025
  • stable-diffusion-v1-5 Public template

    A text-to-image model by Stability AI, renowned for generating high-quality, diverse images from text prompts. <metadata> gpu: T4 | collections: ["Diffusers"] </metadata>

    inferless/stable-diffusion-v1-5’s past year of commit activity
    Python 0 1 0 0 Updated Apr 7, 2025
  • whisper-large-v3 Public

    State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

    inferless/whisper-large-v3’s past year of commit activity
    Python 16 13 0 0 Updated Apr 7, 2025
  • inferless/Customer-Service-Voicebot’s past year of commit activity
    Python 2 2 0 0 Updated Apr 6, 2025
  • mistral-small-3.1-24b-instruct Public template

    Advanced multimodal language model developed by Mistral AI with enhanced text performance, robust vision capabilities, and an expanded context window of up to 128,000 tokens. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>

    inferless/mistral-small-3.1-24b-instruct’s past year of commit activity
    Python 1 7 0 0 Updated Apr 3, 2025
  • spatiallm-llama-1b Public template

    A 3D large language model that processes point cloud data to produce structured 3D scene representations. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>

    inferless/spatiallm-llama-1b’s past year of commit activity
    Python 1 3 0 0 Updated Apr 1, 2025

Top languages

Loading…

Most used topics

Loading…