2D to 3D
5.0
0 reviewsDescription
This is a workflow to compare prompt word inference effects, comparing the image recognition capabilities of gemini, clipinterrogator and image2prompt. By the way, I also compared the depth maps of miragold, depth anything and zoe depth.
https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini
free gemini api key: https://makersuite.google.com/app/apikey
Add your Gemini_API_Key to the custom_nodes/ComfyUI-Gemini/config.json file.
https://github.com/zhongpei/Comfyui_image2prompt
Discussion
(No comments yet)
Loading...
Reviews
No reviews yet
Versions (1)
- latest (a year ago)
Node Details
Primitive Nodes (8)
Anything Everywhere
ClipInterrogator
Image scale to side
Custom Nodes (52)
ComfyUI
- LoadImage
- CLIPTextEncode
- PreviewImage
- ControlNetLoader
- ControlNetApply
- ConditioningConcat
- VAEDecode
- CheckpointLoaderSimple
- KSampler
- SaveImage
- VAEEncode
- Scribble_XDoG_Preprocessor
- Zoe_DepthAnythingPreprocessor
- Zoe-DepthMapPreprocessor
- Image2Text
- LoadImage2TextModel
- DisplayText_Zho
- Gemini_API_S_Zho
- MarigoldDepthEstimation
Model Details
Checkpoints (1)
samaritan3dCartoon_v40SDXL.safetensors
LoRAs (0)