科技回声

11 条评论

Flux159超过 1 年前

Arxiv paper here <a href="https://arxiv.org/abs/2312.12491" rel="nofollow noreferrer">https://arxiv.org/abs/2312.12491</a>I think that it's possible to get faster than their default timings for a 4090 (I have been able to get 10fps without optimizations with SDXL Turbo and 1 iteration step), but their other improvements like using a Stochastic Similarity Filter to prevent unnecessary generations are good for getting fast results w/out having to pin your GPU at 100% all the time.

acheong08超过 1 年前

This feels unreal. It feels like a decade passed within a year.

评论 #38751150 未加载

评论 #38750124 未加载

评论 #38750949 未加载

评论 #38749968 未加载

评论 #38750651 未加载

评论 #38751815 未加载

smusamashah超过 1 年前

I just tried the realtime-text2img demo (uses npm for frontend which i think is too much for this). Modified it to produce only 1 image instead of 16. Works well on a laptop with RTX-3080. It's probably 2 images / sec.EDIT: The `examples\screen` demo almost feels realtime. Says 4 fps on the window but don't what it represents.EDIT: Denoising in img2img is very low though which means thee returned image is only slightly different from base image.

评论 #38751431 未加载

modeless超过 1 年前

Does 100fps mean I can provide a new input every 10 ms and get a new output every 10ms? Or do inputs need to be batched together to get that average throughput?

评论 #38750131 未加载

kristopolous超过 1 年前

This more or less just worked as documented. Most of these demos tend to blow up and give really wonky deep errors.Good job. Give it a try. Look into the server.py of realtime-txt2img to change the model if you want to generate something other than anime. Pointing it to say <a href="https://huggingface.co/runwayml/stable-diffusion-v1-5" rel="nofollow noreferrer">https://huggingface.co/runwayml/stable-diffusion-v1-5</a> works fine.The results are genuinely fast. Not great, but fast. If you change to the SDXL via LCM-LoRA <a href="https://huggingface.co/latent-consistency" rel="nofollow noreferrer">https://huggingface.co/latent-consistency</a> you may get better stuff but that's when it's going to get difficult and you'll start to run into those mysterious crashes I talked about that require, you know, actual work.my setup: 4090/3990x/CUDA 12.2/debian sid. ymmv.

ilaksh超过 1 年前

How does the demo with the girl moving in and out of frame work? Is it ControlNet?

评论 #38749694 未加载

评论 #38749718 未加载

评论 #38749831 未加载

_joel超过 1 年前

Maybe we're all living in a simulation^H^H^H^H^H pipeline-level solution for real-time interactive generation.

评论 #38753683 未加载

brcmthrowaway超过 1 年前

What is the fps on Apple Silicon?

评论 #38750339 未加载

评论 #38751636 未加载

评论 #38749840 未加载

timexironman超过 1 年前

Is there a video of it I can view anywhere?

评论 #38753706 未加载

badloginagain超过 1 年前

Yo I just heard about MidJourney this year.And this appears to be a local runtime stable diffusion streaming library?Bruh.

评论 #38749941 未加载

评论 #38753712 未加载

programjames超过 1 年前

This paper is horribly written. It's like the authors are trying to sell me on them as researchers, instead of helping me understand their research (y'know, the entire reason journals got started??). An entire section for "stream batching" was just too much, and none of their ideas were innovative or unique. It was incredibly dense, simply because it's obfuscated, which makes me believe the authors themselves don't really understand what they're doing.The results aren't even very good. They claim 60x speedup, but compared to what? HuggingFace's Diffusers Autopipeline... a company notorious for buggy code and inefficient pipelines. And that's for naively running the pipeline on every image. Give me a break.

评论 #38751861 未加载

评论 #38750513 未加载

11 条评论

Flux159超过 1 年前

acheong08超过 1 年前

This feels unreal. It feels like a decade passed within a year.

评论 #38751150 未加载

评论 #38750124 未加载

评论 #38750949 未加载

评论 #38749968 未加载

评论 #38750651 未加载

评论 #38751815 未加载

smusamashah超过 1 年前

评论 #38751431 未加载

modeless超过 1 年前

Does 100fps mean I can provide a new input every 10 ms and get a new output every 10ms? Or do inputs need to be batched together to get that average throughput?

评论 #38750131 未加载

kristopolous超过 1 年前

ilaksh超过 1 年前

How does the demo with the girl moving in and out of frame work? Is it ControlNet?

评论 #38749694 未加载

评论 #38749718 未加载

评论 #38749831 未加载

_joel超过 1 年前

Maybe we're all living in a simulation^H^H^H^H^H pipeline-level solution for real-time interactive generation.

评论 #38753683 未加载

brcmthrowaway超过 1 年前

What is the fps on Apple Silicon?

评论 #38750339 未加载

评论 #38751636 未加载

评论 #38749840 未加载

timexironman超过 1 年前

Is there a video of it I can view anywhere?

评论 #38753706 未加载

badloginagain超过 1 年前

Yo I just heard about MidJourney this year.And this appears to be a local runtime stable diffusion streaming library?Bruh.

StreamDiffusion: A pipeline-level solution for real-time interactive generation

11 条评论

StreamDiffusion: A pipeline-level solution for real-time interactive generation

11 条评论