Hi all,

first off I’d just like to say how blown away I am by the potentials of Perchance. I bet to most of you this is baby stuff but for me this is my first step in to this world and its just incredible stuff.

So I have a question…more so to if its possible and then I’ll properly wrap my head around the coding of it.

I’m looking to create a short sequence of scenes, like a still animation. Some time the background will stay the same but the character would change pose say. Or maybe the background (say a kitchen scene as example) may change camera angle/view and the character would change position/pose. Im not looking to create frame by frame stuff. Just scene changes but retaining features through out. I totally can see it being possible to do, was just hoping to hear some advice from people that have much more experience than I do.

If any of that doesn’t make sense (most probably!) please just ask and I’ll try to better explain.

TIA

Sam

p.s. Oh I should probably state that I plan to use t2i to create the scenes, then overlay/combine character and adjust accordingly

  • VioneTM
    link
    English
    37 months ago

    I would generate one image, say background first. Write the generator to have quite specific conditions I.e. depth of view, positioning, scale, size etc etc (I’m totally clueless as to the limitations here so again please correct if not possible). Result, one image. Backdrop.

    Images can be fine-tuned with tags (albeit the AI tries to), so you can change the conditions of the images.

    Then generate the character image. Maybe whole separate generator. Again, very specific with scale or pose, positioning etc and without background.

    You can also do the character in the same generator, no need to code another one, you just need to change the prompts.

    My thoughts were the locked-layer-combination!? To combine the two images 🤷🏻‍♂️ and voila.

    The layering part is quite easy to implement (see this example generator I made a while back) but you would need to edit the character image to be cropped so it can overlay successfully on top of the image. If you were to use ‘inpainting’ you could just ‘paint-over’ the current image without needing to layer and position multiple images.

    The next step would be to then adjust parameters within each generator but revolving around the same seed (might be talking nonsense here as again, I’m just learning) to adjust in the desired way but retaining the features I.e. colours, style etc for continuity.

    Using the same seed is a good idea to have the ‘same’ essence, or replicable composition (see the prompting guide that I linked earlier). But yeah, a local setup with ‘inpainting’ would be better, there are also other ‘plugins’ for the local setups to instruct the AI area by area. Also note that you might need a powerful computer (and graphics card) to run the local models quickly.

    • MindBlown! 🌬️🤯OP
      link
      English
      2
      edit-2
      7 months ago

      ahhh I see, tags will be involved for sure then. I’ll take a look at your generator too and try and get my head around whats going on. I wonder; you mentioned ‘cropped image’ in your reply…well it just got me thinking about how images are ‘recognised’ and if there is a possibility in the AI to differentiate the character from background (if background is given transparency) and create the crop itself? As in, a lasso plug in type thing that can auto detect? Just an abstract thought so don’t know if its completely ludicrous or not!? 🤷‍♂️

      Yes, GPU…might be my achilies heal with that at present. Im just running off of a laptop, 8GB RAM kind of crap! Apparently could run Stable Diffusion from it but not SDxL without it being slooooow! Would having a local Stable Diffusion and not xL be pointless?

      • VioneTM
        link
        English
        27 months ago

        I don’t think there are any AI that can auto-crop an image, at least that is currently available and can be integrated on the site. I think v1.5 models are the creme of the crop currently since there are a lot of models for it (also it having not much censorship compared to the XL, I think). I haven’t made a local setup myself so I can’t really point you to a direction. I would suggest looking into the StableDiffusion subreddit ( or at the community here on Lemmy), to start.

        • MindBlown! 🌬️🤯OP
          link
          English
          27 months ago

          Ive been looking in to the image cropping and finding some jscript plugin subreddits so will delve a little deeper to see what people have been coming up with. I’m also going to try SDXL on my laptop to see what happens…if it takes an age to do anything will resort to SD…Thanks for all the pointers 👍