Adventures in SDXL (14,000x10,000 zoomable collage)

wgty@lemmy.world · edit-2 1 year ago

Segmentation controlnet with an image of the cover template with different parts being different colors as the input. The segmentation controlnet makes Stablediffusion treats different colors as different things, so it keeps each part distinct. If you use colors that aren’t in the actual color reference chart, it just sorta improvises, so I used black and white. On one I used the color that corresponds to painting,picture for the area inside the “frame” where the actual art is. Since I was doing it all in one generation with one prompt I just sorta had to accept what it gave me for the colors and textures of the text and borders.

wgty@lemmy.world · 1 year ago

I had fun trying to do this “in camera” without any editing after generation. I ended up using segmentation but after playing around with a lot of different options for what colors to use (trying stone, book, window, painting), I gave up and just made it black and white and still used segmentation.

wgty@lemmy.world · edit-2 1 year ago

Adventures in SDXL (14,000x10,000 zoomable collage)

wgty@lemmy.world · 1 year ago

Adventures in SDXL (Zoomable 14,000 x 10,000)

wgty@lemmy.world · 1 year ago

Anachronistic photographs.

wgty@lemmy.world · 1 year ago

It definitely depends on the model. But if you want better results you’re going to want to use adetailer.

I threw your prompt into dreamshaper 6.31 (because it’s the latest version I had on my server). Here’s with no adetailer:

Here’s with adetailer but using the exact same prompt as the main image (leaving both fields blank but turning adetailer on):

Here’s with adetailer but putting only the expression part in the positive prompt. Negative prompt still the same as the main image.
(confused concerned upset questioning (funny fun amused (cringe (raised eyebrow) shy blush ashamed bruh face), <lora:add_detail:1.2>

Now, probably a good idea to have redhead woman in there or something, but my point is how rapidly you can improve things with very minimal effort. If you’re willing to mess around with adetailer prompts and denoising strength, you can do a lot with faces.

wgty@lemmy.world · 1 year ago

deleted by creator

wgty@lemmy.world · 1 year ago

Extremely dank. What checkpoint?

wgty@lemmy.world · edit-2 1 year ago

These are very large (3000x2000), so here’s an album link: https://imgur.com/a/LOZcR39

Love at first sight.

Obligatory gas masks and fire.

What the fuck does anything on this vehicle do?

wgty@lemmy.world · 1 year ago

Are you using any hand-specific negative embeddings? adetailer also can do a decent job of fixing hands with the hand segmentation.

wgty@lemmy.world · 1 year ago

Reminds me of the bastion cinematic. To the point where I’m guessing either a still from that, or fanart of that cinematic was in the training set.

wgty@lemmy.world · edit-2 1 year ago

I just noticed it doesn’t put the literal prompt in the png info. The “dagger (weapon)” needs to be entered as “dagger \(weapon\)” so that the parenthesis around weapon get sent to CLIP

wgty@lemmy.world · edit-2 1 year ago

Here, have some 4x clipdrop upscaling:
https://i.ibb.co/7RFyRFn/00224-1127511940-clipdrop-enhance.png

wgty@lemmy.world · edit-2 1 year ago

Couple suggestions:

Try using A-Zovya RPG Artist Tools
Use the adetailer extension to improve faces. Just turning it on for faces will have a huge improvement. Make sure to turn off the normal “restore faces”.
When you’re happy with a final image, re-generate it with upscaling.
Try using negative embeddings like fastnegative, deepnegative, BadDream, EasyNegative, and maybe negativehand. Don’t have to use all of them, and some work better for some checkpoints and styles than others. Usually I use 2 or 3 together.
Use quality tags in the positive and negative prompt.

EDIT:
My quick attempt at making something with my own advice: https://i.imgur.com/xjhc2tA.png

(masterpiece, best quality, high quality, highres, ultra-detailed), elf rogue, stealth, fantasy, (dagger (weapon):1), black cloak, ((masculine)), (long black hair), feather collar Negative prompt: (ng_deepnegative_v1_75t, FastNegativeV2, verybadimagenegative_v1.1-6400), (worst quality, low quality, normal quality), (woman, female, feminine), (archer, bow) Steps: 35, Sampler: DPM++ SDE Karras, CFG scale: 7, Seed: 607032892, Size: 512x768, Model hash: 8c4042921a, Model: aZovyaRPGArtistTools_v3VAE, Denoising strength: 0.35, Clip skip: 2, ADetailer model: face_yolov8n.pt, ADetailer confidence: 0.3, ADetailer dilate/erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer version: 23.6.2, Hires upscale: 2, Hires steps: 14, Hires upscaler: R-ESRGAN 4x+, Version: v1.3.2

The vampire vibe in your original image could definitely be brought back, but I assume it was more of an unintended side-effect of your prompt than intentional.

wgty@lemmy.world · 1 year ago

Relative Humidity

wgty@lemmy.world · 1 year ago

(masterpiece, best quality), landscape, impressionism, river flowing through wildflowers, swirling marbled clouds, multicolored ink, mossy stone bridge, lora:add_detail:1 Negative prompt: (worst quality, low quality), house, building, castle, BadDream, railing, power lines, utility pole, symmetry Steps: 35, Sampler: DPM++ SDE Karras, CFG scale: 7, Seed: 1127511940, Size: 768x512, Model hash: 86fca15160, Model: toonyou_beta5Unstable, Denoising strength: 0.35, Clip skip: 2, Hires upscale: 1.5, Hires steps: 14, Hires upscaler: R-ESRGAN 4x+, Lora hashes: “add_detail: 7c6bad76eb54”, Version: v1.3.2

wgty@lemmy.world · 1 year ago

Bridge and clouds

wgty@lemmy.world · 1 year ago

I made quite a few, but most weren’t great. Here’s the whole set though: https://imgur.com/a/6ZwsXst

wgty@lemmy.world · 1 year ago

Photo from anthropologist Monty Pytorch, the first person to ever live among the Neanderthals.