keeping features the same but adjusting elements within the image

MindBlown! 🌬️🤯 · 1 year ago

keeping features the same but adjusting elements within the image

VioneT · 1 year ago

I would generate one image, say background first. Write the generator to have quite specific conditions I.e. depth of view, positioning, scale, size etc etc (I’m totally clueless as to the limitations here so again please correct if not possible). Result, one image. Backdrop.

Images can be fine-tuned with tags (albeit the AI tries to), so you can change the conditions of the images.

Then generate the character image. Maybe whole separate generator. Again, very specific with scale or pose, positioning etc and without background.

You can also do the character in the same generator, no need to code another one, you just need to change the prompts.

My thoughts were the locked-layer-combination!? To combine the two images 🤷🏻‍♂️ and voila.

The layering part is quite easy to implement (see this example generator I made a while back) but you would need to edit the character image to be cropped so it can overlay successfully on top of the image. If you were to use ‘inpainting’ you could just ‘paint-over’ the current image without needing to layer and position multiple images.

The next step would be to then adjust parameters within each generator but revolving around the same seed (might be talking nonsense here as again, I’m just learning) to adjust in the desired way but retaining the features I.e. colours, style etc for continuity.

Using the same seed is a good idea to have the ‘same’ essence, or replicable composition (see the prompting guide that I linked earlier). But yeah, a local setup with ‘inpainting’ would be better, there are also other ‘plugins’ for the local setups to instruct the AI area by area. Also note that you might need a powerful computer (and graphics card) to run the local models quickly.

MindBlown! 🌬️🤯 · edit-2 1 year ago

ahhh I see, tags will be involved for sure then. I’ll take a look at your generator too and try and get my head around whats going on. I wonder; you mentioned ‘cropped image’ in your reply…well it just got me thinking about how images are ‘recognised’ and if there is a possibility in the AI to differentiate the character from background (if background is given transparency) and create the crop itself? As in, a lasso plug in type thing that can auto detect? Just an abstract thought so don’t know if its completely ludicrous or not!? 🤷‍♂️

Yes, GPU…might be my achilies heal with that at present. Im just running off of a laptop, 8GB RAM kind of crap! Apparently could run Stable Diffusion from it but not SDxL without it being slooooow! Would having a local Stable Diffusion and not xL be pointless?

VioneT · 1 year ago

I don’t think there are any AI that can auto-crop an image, at least that is currently available and can be integrated on the site. I think v1.5 models are the creme of the crop currently since there are a lot of models for it (also it having not much censorship compared to the XL, I think). I haven’t made a local setup myself so I can’t really point you to a direction. I would suggest looking into the StableDiffusion subreddit ( or at the community here on Lemmy), to start.

MindBlown! 🌬️🤯 · 1 year ago

Ive been looking in to the image cropping and finding some jscript plugin subreddits so will delve a little deeper to see what people have been coming up with. I’m also going to try SDXL on my laptop to see what happens…if it takes an age to do anything will resort to SD…Thanks for all the pointers 👍