ComfyUI-Kimi-VL

Make Kimi-VL avialbe in ComfyUI.

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities.

Installation

Make sure you have ComfyUI installed
Clone this repository into your ComfyUI's custom_nodes directory:

cd ComfyUI/custom_nodes
git clone https://github.com/Yuan-ManX/ComfyUI-Kimi-VL.git

Install dependencies:

cd ComfyUI-Kimi-VL
pip install -r requirements.txt

Model

🤗 For general multimodal perception and understanding, OCR, long video and long document, video perception, and agent uses, we recommend Kimi-VL-A3B-Instruct for efficient inference; for advanced text and multimodal reasoning (e.g. math), please consider using Kimi-VL-A3B-Thinking.

Model	#Total Params	#Activated Params	Context Length	Download Link
Kimi-VL-A3B-Instruct	16B	3B	128K	🤗 Hugging Face
Kimi-VL-A3B-Thinking	16B	3B	128K	🤗 Hugging Face

Note

Recommended parameter settings:

For Thinking models, it is recommended to use Temperature = 0.6.
For Instruct models, it is recommended to use Temperature = 0.2.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
src		src
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
nodes.py		nodes.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ComfyUI-Kimi-VL

Installation

Model

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Yuan-ManX/ComfyUI-Kimi-VL

Folders and files

Latest commit

History

Repository files navigation

ComfyUI-Kimi-VL

Installation

Model

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages