Show HN: InvokeAI, an open source Stable Diffusion toolkit and WebUI

414 pointsby sophrocyneover 2 years ago

Hey everyone!Excited to be able to share the release of `InvokeAI 2.0 - A Stable Diffusion Toolkit`, an open source project that aims to provide both enthusiasts and professionals a suite of robust image creation tools. Optimized for efficiency, InvokeAI needs only ~3.5GB of VRAM to generate a 512x768 image (and less for smaller images), and is compatible with Windows/Linux/Mac (M1 & M2).InvokeAI was one of the earliest forks off of the core CompVis repo (formerly lstein/stable-diffusion), and recently evolved into a full-fledged community driven and open source stable diffusion toolkit titled InvokeAI. The new version of the tool introduces an entirely new WebUI Front-end with a Desktop mode, and an optimized back-end server that can be interacted with via CLI or extended with your own fork.This version of the app improves in-app workflows leveraging GFPGAN and Codeformer for face restoration, and RealESRGAN upscaling - Additionally, the CLI also supports a large variety of features: - Inpainting - Outpainting - Prompt Unconditioning - Textual Inversion - Improved Quality for Hi-Resolution Images (Embiggen, Hi-res Fixes, etc.) - And more...Future updates planned included UI driven outpainting/inpainting, robust Cross Attention support, and an advanced node workflow for automating and sharing your workflows with the community.We're excited by the release, and about the future of democratizing the ability to create. Check out the repo (<a href="https://github.com/invoke-ai/InvokeAI" rel="nofollow">https://github.com/invoke-ai/InvokeAI</a>) to get started, and join us on Discord (<a href="https://discord.gg/ZmtBAhwWhy" rel="nofollow">https://discord.gg/ZmtBAhwWhy</a>)!

19 comments

cercatrovaover 2 years ago

Speaking of SD, I wonder if 1.4 will be the last truly open release as Emad said 1.5 would release a while ago but it's been held up for "compliance" reasons. Maybe they got legal threats due to using artists' works and stock images. If so, that would be sad to see it.In a way it reminds me of people who make unofficial remakes of games but get cease and desists if they show gameplay while in development. The correct move is to fully develop the game and release it, then if you get C&Ds, too late, the game is already available to download.

评论 #33160477 未加载

评论 #33156330 未加载

评论 #33155690 未加载

评论 #33155835 未加载

swyxover 2 years ago

[OT] its been hard for me to trace the universe of stable diffusion forks so ive been maintaining a list here: <a href="https://github.com/sw-yx/prompt-eng#sd-major-forks" rel="nofollow">https://github.com/sw-yx/prompt-eng#sd-major-forks</a>please let me know/send PRs if i missed anything, its been a couple months so i'm overdue for a round of cleanup/reorganizing

评论 #33157303 未加载

评论 #33156107 未加载

评论 #33157310 未加载

评论 #33156607 未加载

评论 #33157565 未加载

评论 #33156037 未加载

评论 #33157721 未加载

评论 #33159667 未加载

评论 #33155669 未加载

lawikover 2 years ago

Oh, I used the dreeam.py script to back a Telegram bot. It later ended up in my demo for my talk Chat Bots as User Interfaces (with Elixir): <a href="https://www.youtube.com/watch?v=DFGHaER6_j4" rel="nofollow">https://www.youtube.com/watch?v=DFGHaER6_j4</a>I primarily used the InvokeAI release because I found it was easy to get going with on Linux and then it was simple enough to hack around with.Also the first tool I've ever used where I've rode on the ragged edge of what my 3070 is okay with. I've had graphical glitches due to occupying all the video memory (KDE doesn't like it). I've had to quit apps to make it work.Thanks for making a useful thing of all this Stable Diffusion stuff. I've enjoyed it.

tehsauceover 2 years ago

A Shameless plug, if anyone is interested in building apps using stable diffusion and wants to keep things as cheap as possible, I built a very user-friendly API that is 1/4 the cost of the official stable diffusion API. There is also a free demo.You can try it out:<a href="https://computerender.com" rel="nofollow">https://computerender.com</a>.

评论 #33157283 未加载

KaoruAoiShihoover 2 years ago

Is there anything new here that might interest an existing user of auti's gui to switch?

评论 #33155448 未加载

评论 #33156489 未加载

评论 #33156475 未加载

评论 #33159672 未加载

nohatover 2 years ago

I've been using a modified version of lsteins fork since almost the beginning. Recommended! It does lack some of the features of eg automatic1111, but it has good cli, and actually has a license, which is pretty important (as novelai has learned).

Timwiover 2 years ago

Sounds awesome! Unfortunately, it says that it requires a GPU. Please consider making it accessible to people without a GPU, for example using OpenVino like this (command line only) project does:<a href="https://github.com/bes-dev/stable_diffusion.openvino" rel="nofollow">https://github.com/bes-dev/stable_diffusion.openvino</a>Thanks!

评论 #33157091 未加载

评论 #33157137 未加载

cmxchover 2 years ago

How hard of a requirement is the NVidia graphics chip? Polaris era AMD chips do work decently at the 4gb level (although a bit finicky) and Navi/Big Navi AMD cards work reasonably well with modern ROCm.

评论 #33155864 未加载

评论 #33156564 未加载

iFireover 2 years ago

Can you make the ui InvokeAI as easy to install as running a Windows 11 command line script?I couldn't get it to work following <a href="https://invoke-ai.github.io/InvokeAI/installation/INSTALL_WINDOWS/" rel="nofollow">https://invoke-ai.github.io/InvokeAI/installation/INSTALL_WI...</a>Similar to <a href="https://github.com/cmdr2/stable-diffusion-ui/releases/tag/v2.16" rel="nofollow">https://github.com/cmdr2/stable-diffusion-ui/releases/tag/v2...</a>

评论 #33156219 未加载

neilvover 2 years ago

Nice! lstein is the SD fork that I ended up using, and I'm delighted to see it evolve into InvokeAI and keep getting better.

Ukeover 2 years ago

How good are solutions like stable diffusion at inpainting nowadays? What about the watermarks of getty et at that have been part of some of dall-e 2.0 images. Could one feasably remove such watermarks or stuff like a white grid array with these solutions?So how convincing are these solutions in the worst case is what i am asking.

lucasfcostaover 2 years ago

This is much needed. Even for a software engineer like me, it was quite cumbersome to use Stable Diffusion locally without such an UI.I feel like there's just so much to improve though. Maybe SD is the definitive proof that one single feature can trickle down into many others just by adding good UI on top of it.

评论 #33162935 未加载

hda2over 2 years ago

What about safety filters? All the safety filters in the SD interfaces/services I used so far are too false-positive happy. Can these filters be disabled or at least toned down in InvokeAI? If so, how easily?

评论 #33160382 未加载

cmsjover 2 years ago

Yay! I built an IRC bot for SD using lstein's repo because it was the first one that I could get to work reliably on M1, so I'm really glad to see the process continue really well with InvokeAI!

paulirishover 2 years ago

PSA: You can email support@github to ask them to "detach my repo as a fork", in case the repo has matured so much it shouldn't have the "forked from …" treatment.

评论 #33156721 未加载

pdntspaover 2 years ago

Min requirements say 12gb, I take it this doesn't have the optimizations that automatic1111 has for <8gb cards?

评论 #33157767 未加载

评论 #33157331 未加载

ionwakeover 2 years ago

I was unable to get this to run on the Mac M1 over the last week - has anyone here had any success?

评论 #33160333 未加载

pdntspaover 2 years ago

I am super stoked to see all these Stable Diffusion forks floating around, and I don't want to shit on the authors and their work that hard, but I swear the installation and packaging of these things is INSANE.* Every single one of these seems to be a web UI, when this is desktop software that needs a desktop computer or workstation to run. Have we all collectively forgotten how to program PyGTK?* Model files always go in the code repo. Have we forgotten how home folders work or what their purpose is? At the very least this one instructs you to make a shortcut/symlink if you don't want to copy the ckpt file yet again* On that note, everything is autodownloaded to wherever the hell the programmer wants (once again, usually in the code repo itself). I must have four or five different copies of ESRGAN, and I spent a bunch of time monkeying around with automatic1111's fork trying to get it to correctly see everything when I ripped out the models folder and symlinked one in from a different place on my hard drive.To the authors: can you all please get together and standardize some of this stuff? Models should go in user's homefolders, or at a customizable location, and NOT within the scope of stuff that can be touched by git pull. (Doing so causes git to freak out in many circumstances)The breakneck pace of innovation here is awesome, but it feels like all gas no brakes on the usability front.In the Bad Old Days(tm) you ran an install script which generates a desktop icon and you click that to run it. Meanwhile with this, on Windows, one has to open an anaconda prompt, activate the anaconda venv (or whatever it is), then manually invoke the whole thing with 'python scripts/invoke.py --web'. And if there's a one-click install script included (which invoke doesn't, but I am not knocking it for this!), half the time they seem to try and pull down the entire world all over again (a la sd-webui).Like I get this need to make it easy to use, but it's like c'mon, there's is existing convention for all these things. Folks, please follow it!If I had a wishlist, or the wherewithal to fork my own version, it would have:* an actual GUI made with an actual windowing toolkit. I don't know why the hell everyone is so afraid of GTK, but I would use that. pyGTK is pretty simple IME, you can even read the C++ docs and it all maps over really nice to python. It doesn't need to be pretty!* configurable model locations, preferably in an agreed-upon standardized hierarchy* a standardized way of embedding prompt data into the PNG, a la automatic1111* an uncomplicated but not overly optimistic setup process. An install.py and run.py, both with sensible defaults so that you don't need any command-line switches to run it except for special circumstances, and if it wants to autodownload updates then CHECK WITH ME FIRST! And preferably one that doesn't try to move my entire world (heres looking at you, sd-webui). And it will load the venv/conda environment for me.And yes, for all the "put your money where your mouth is", I've been thinking about forking. But I don't know if I have the time or energy to keep up with all the developments in this space. But hey you never know...

评论 #33158599 未加载

评论 #33162967 未加载

评论 #33159003 未加载

评论 #33161352 未加载

gernbover 2 years ago

This is great but it requires lots of "geek" (installing dependencies, borking your system with brew, etc...)Vs DiffusionBee which just works<a href="https://diffusionbee.com/" rel="nofollow">https://diffusionbee.com/</a>Maybe the two projects can merge?

19 comments

cercatrovaover 2 years ago

评论 #33160477 未加载

评论 #33156330 未加载

评论 #33155690 未加载

评论 #33155835 未加载

swyxover 2 years ago

评论 #33157303 未加载

评论 #33156107 未加载

评论 #33157310 未加载

评论 #33156607 未加载

评论 #33157565 未加载

评论 #33156037 未加载

评论 #33157721 未加载

评论 #33159667 未加载

评论 #33155669 未加载

lawikover 2 years ago

tehsauceover 2 years ago

评论 #33157283 未加载

KaoruAoiShihoover 2 years ago

Is there anything new here that might interest an existing user of auti's gui to switch?

评论 #33155448 未加载

评论 #33156489 未加载

评论 #33156475 未加载

评论 #33159672 未加载

nohatover 2 years ago

Timwiover 2 years ago

评论 #33157091 未加载

评论 #33157137 未加载

cmxchover 2 years ago

评论 #33155864 未加载

评论 #33156564 未加载

iFireover 2 years ago

评论 #33156219 未加载

neilvover 2 years ago

Nice! lstein is the SD fork that I ended up using, and I'm delighted to see it evolve into InvokeAI and keep getting better.

Ukeover 2 years ago

lucasfcostaover 2 years ago

评论 #33162935 未加载

hda2over 2 years ago

评论 #33160382 未加载

cmsjover 2 years ago

Yay! I built an IRC bot for SD using lstein's repo because it was the first one that I could get to work reliably on M1, so I'm really glad to see the process continue really well with InvokeAI!

paulirishover 2 years ago

PSA: You can email support@github to ask them to "detach my repo as a fork", in case the repo has matured so much it shouldn't have the "forked from …" treatment.

评论 #33156721 未加载

pdntspaover 2 years ago

Min requirements say 12gb, I take it this doesn't have the optimizations that automatic1111 has for <8gb cards?

评论 #33157767 未加载

评论 #33157331 未加载

ionwakeover 2 years ago

I was unable to get this to run on the Mac M1 over the last week - has anyone here had any success?