2D to 3D

5.0

0 reviews

10.2K

3.0K

Description

This is a workflow to compare prompt word inference effects, comparing the image recognition capabilities of gemini, clipinterrogator and image2prompt. By the way, I also compared the depth maps of miragold, depth anything and zoe depth.

https://github.com/ZHO-ZHO-ZHO/ComfyUI-Gemini

free gemini api key: https://makersuite.google.com/app/apikey

Add your Gemini_API_Key to the custom_nodes/ComfyUI-Gemini/config.json file.

https://github.com/zhongpei/Comfyui_image2prompt

Discussion

(No comments yet)

Loading...

Author

Datou

213

521.0K

9.2K

1.8M

Reviews

No reviews yet

Versions (1)

- latest (a year ago)

Node Details

Primitive Nodes (8)

Anything Everywhere (6)

ClipInterrogator (1)

Image scale to side (1)

Custom Nodes (52)

ComfyUI

- LoadImage (1)
- CLIPTextEncode (7)
- PreviewImage (6)
- ControlNetLoader (6)
- ControlNetApply (6)
- ConditioningConcat (3)
- VAEDecode (3)
- CheckpointLoaderSimple (1)
- KSampler (3)
- SaveImage (3)
- VAEEncode (1)

ComfyUI Nodes for Inference.Core

- Scribble_XDoG_Preprocessor (3)
- Zoe_DepthAnythingPreprocessor (1)
- Zoe-DepthMapPreprocessor (1)

Comfyui_image2prompt

- Image2Text (1)
- LoadImage2TextModel (1)

ComfyUI-Gemini

- DisplayText_Zho (3)
- Gemini_API_S_Zho (1)

Marigold depth estimation in ComfyUI

- MarigoldDepthEstimation (1)

Model Details

Checkpoints (1)

samaritan3dCartoon_v40SDXL.safetensors

LoRAs (0)

OpenArt

Workflows

Active Sessions

2D to 3D

Description

Discussion

Loading...