Under the hood of image generator

Grth@lemmy.world · 1 year ago

Under the hood of image generator

perchance@lemmy.world · edit-2 1 year ago

Yep SD 1.5, and you should be able to replicate the text-to-image-plugin’s results locally by just following vanilla tutorials on /r/stablediffusion or youtube with pretty much any of the top models on civitai - I’m not doing anything special. Your local results will actually end up be better than the plugin’s because I have a stupid amount of regex and stuff trying (and somewhat failing) to prevent the model from creating oversexualised stuff for benign prompts, and that almost always comes at a cost of quality/coherence. I’m not the best person to ask about troubleshooting local setups, but I’d just advise that you follow a tutorial/guide exactly to start with, and then once you’ve replicated what they’ve shown, you start exploring your own prompts, tweaking parameters, etc.

Grth@lemmy.world · 1 year ago

Thanks for getting back to me so quickly, even if it’s taken me so long to respond. Whatever non-special things you’re doing seem to work really well! It works a lot faster than my local instance (suboptimal video card to blame there) and I get really good consistent base results, which I can then pull into my own instance to do more fun stuff with inpainting, upscaling and the like. So hopefully that ad revenue is making it worth your while. :D

perchance@lemmy.world · 1 year ago

Yeah it’s definitely worth investing in a fast graphics card if you’re getting deep into AI stuff, but they’re pricey. Inpainting and image-to-image should be possible on perchance within the next month or so if all goes well. Ad revenue doesn’t cover all the server costs yet, so I pay for a portion of it out of my own pocket, but it’ll eventually be self-sustaining and it’s not ‘breaking the bank’ for me. Much closer to self-sustaining than it was 12 months ago when I made the plugin - research community has made SD inference a lot more efficient.

Ashenthorn@lemmy.world · 1 year ago

@perchance@lemmy.world Is there a roadmap or existing discussion anywhere about your experiments with the t2i plugin or where you might be going with it? Or for user questions/feedback/requests?

I’m currently getting some fantastic results with it “as is”… but additional options are always appreciated. =)

perchance@lemmy.world · 1 year ago

Best place is probably here on the lemmy community. I’ll post updates here when there are new features available (e.g. inpainting, image-to-image), etc.

Also, @VioneT@lemmy.world has some notes and interesting experiments (with linked generators to play with) here: https://perchance.org/learn-perchance-plugins-text-to-image

tomanivolley@lemmy.world · edit-2 1 year ago

Sorry to bug you about this, but do you have a specific model you’d recommend? I’ve tried ~10 different models on civitai and I’m having a hard time replicating the results I get from your 2D Disney option without using a Seed for an image I’ve created on Perchance. This is even when I copy/paste your prompts.

You’re not wrong that some of the results I’ve gotten are just as good in some ways, but it’s really bugging me that I can’t seem to replicate that exact style.