Skip to content

Conversation

zishanahmed08
Copy link

@zishanahmed08 zishanahmed08 commented Apr 14, 2025

Summary
This PR introduces three new model deployment templates for the Kimi VL family, optimized for serving via vLLM with OpenAI-compatible APIs.

Kimi-VL, an efficient open-source Mixture-of-Experts (MoE) vision-language model (VLM) that offers advanced multimodal reasoning, long-context understanding, and strong agent capabilities—all while activating only 2.8B parameters in its language decoder (Kimi-VL-A3B).

Added Template

  1. Kimi VL A3B Instruct model
    Lightweight and fast.
    Exposed Port: 9000

Twitter Post:
https://x.com/Zishanahmed08/status/1911919153008509280

This contribution is part of the Nosana Builders Challenge.
Looking forward to feedback and improvements!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants