Ubershaders: A Ridiculous Solution to an Impossible Problem (2017)

222 点作者 Grognak大约 1 年前

11 条评论

phire大约 1 年前

Has it really been 9 years since I started working on Ubershaders?I'm a little surprised no better solution has come along. Vulkan didn't even exist back then (and DirectX 12 had only just released) but instead of making things better, it digs it's feet even deeper into the assumption that all shaders will be known ahead of time (resulting in long "shader recompilation" dialogs on startup on many games).I've been tempted to build my own fast shader compiler into Dolphin for many common GPU architectures. Hell, it wouldn't even be a proper compiler, more of a templated emitter as all shaders fit a pattern. Register allocation and scheduling could all be pre-calculated.But that would be even more insane than ubershaders, as it would be one backend per gpu arch. And some drivers (like Nvidia) don't provide a way to inject pre-compiled shader binaries.On the positive side, ubershaders do solve the problem, and modern GPU drivers do a much better job at accepting ubershaders than they did 9 years ago. Though that's primarily because (as far as I'm aware) examples of Dolphin's ubershader have made their way into every single shader compiler test suite.

评论 #40400943 未加载

评论 #40395067 未加载

评论 #40400908 未加载

评论 #40395632 未加载

sfink大约 1 年前

It's interesting to see the parallels between this and an engine for a dynamic programming language. The one I'm most familiar with is JavaScript.When you first need to run something, you run it on the interpreter (JS) / ubershader (Dolphin). But once you know it's going to be run repeatedly (rarely for JS, almost always for Dolphin), you kick off an async compilation to produce JIT code (JS) / a specialized shader (Dolphin). You continue running in the expensive mode (interpreter / ubershader) until the compilation is complete, then you switch over seamlessly.

评论 #40400264 未加载

评论 #40397780 未加载

GaggiX大约 1 年前

The shader compilation stutter reminds me of a video I recently saw where a developer solved the problem by running a large portion of his game during its first loading: <a href="https://youtu.be/oG-H-IfXUqI" rel="nofollow">https://youtu.be/oG-H-IfXUqI</a>The developer register himself playing the game and during the first loading of the game, the entire gameplay is replayed at high speed in the background on the machine.

corysama大约 1 年前

The pixel shading of the GameCube were slower than that of the OG Xbox. But, it was quite a bit more flexible. Specifically, the GameCube could load a couple textures, do a bit of math, then use that math to load some more texels. The Xbox could only load textures as the starting instructions before doing math and tried to make up for that with a few "do very specific math and load textures in a single instruction" ops.But, still... Both GPUs were pretty well suited for this ubershader approach because they had a small, fixed limit on the number of instructions they could run. And, very strictly defined functionality for each instruction. They weren't really "shaders" as much as highly flexible fixed function stages that you could reasonably wedge in a text shader compiler as a front end and only get a moderate to high amount of complaints about how strict and limited the rules were for the assembly. I recall that both shading units could reasonably be fully specified as C structs that you manually packed into the GPU registers instead of using a shader compiler at all.

评论 #40394584 未加载

评论 #40394523 未加载

dang大约 1 年前

Discussed at the time:Ubershaders: A Ridiculous Solution to an Impossible Problem - <a href="https://news.ycombinator.com/item?id=14884992">https://news.ycombinator.com/item?id=14884992</a> - July 2017 (88 comments)

popcar2大约 1 年前

This is a really neat article because the Godot engine is adding Ubershaders as well to fix shader compilation stuttering: <a href="https://github.com/godotengine/godot/pull/90400">https://github.com/godotengine/godot/pull/90400</a>

评论 #40399495 未加载

doophus大约 1 年前

What was the missing piece for "shader sharing"?Would it be possible to build a web-hosted database of encountered shader configs against a game id, and have Dolphin fetch that list when a game launches and start doing async compilation?When Dolphin encounters a new shader that wasn't in the db, it phones home to request it to be added it to the list.I feel an automated sharing solution would build up coverage pretty quickly, and finding a stutter would eventually be considered an achievement - "no-one's been here before!"

评论 #40397119 未加载

conorpo大约 1 年前

Does anyone know why this isn't an issue for modern games on PC? I assume it's because more uniforms are used, and the amount of shaders that actually need to be compiled at runtime is minimized, not to mention that the Graphics API is optimized to compile the shaders in the format they are provided. So is the issue with Dolphin that GameCube games would compile new shaders for lots of different configurations of effects / stages? Would some sort of preprocessor that converts shader compilations to some mini-ubershader with uniforms that can handle a lot of the different effects be feasible? And then depending on how many completely different shaders there are you would have many different mini-ubershaders?

评论 #40396806 未加载

评论 #40395575 未加载

评论 #40395882 未加载

评论 #40397727 未加载

评论 #40395769 未加载

评论 #40395498 未加载

nightowl_games大约 1 年前

I've thought about writing a GPU side interpreter for SDF definitions for a while. I made a SDF shader generator that dumps out shaders with hard coded values, but doing it with bytecode would be cool. I'm sure this has been done before..

smallstepforman大约 1 年前

I’m suprised to see that Ubershaders still exist, most game engines have settled on a set of fit-for-purpose custom shaders with almost no conditionals (for performance reasons), which is the opposite of UberShaders.

评论 #40398555 未加载

评论 #40400667 未加载

DrNosferatu大约 1 年前

Why not just cache the shader compilation output and save it do disk? Only stutters or object pops on the 1st run.