MiniCPM‑V 4.5: Latest Image Examples and a High-Refresh-Rate Multimodal Model for Reverse Prompting
5.0
0 reviewsDescription
We’re excited to announce the public release of ComfyUI-MiniCPM, a powerful custom node for ComfyUI that integrates MiniCPM-V vision-language models (both Transformers and GGUF) into your local workflows.
What it can doComfyUI-MiniCPM enables high-quality visual instruction-following for tasks such as:
Image and video captioning
Object and scene identification
Visual analysis and explanation
Key Features
Support for both Transformers and GGUF models
Automatic model downloading (CPU & GPU variants)
Prompt types: Describe, Analyze, Identify, Explain, Caption, etc.
Speed/memory optimization options for efficient local usage
https://github.com/1038lab/ComfyUI-MiniCPM
If this node is helpful to you or if you like Our work, please give a ⭐ on Github, It will be the greatest encouragement for our efforts!
Discussion
(No comments yet)
Loading...
Reviews
No reviews yet
Versions (1)
- latest (a month ago)
Node Details
Primitive Nodes (6)
AILab_LoadImage (1)
AILab_MiniCPM_4_V (1)
AILab_MiniCPM_4_V_Advanced (1)
PreviewAny (3)
Custom Nodes (0)
Model Details
Checkpoints (0)
LoRAs (0)