Weep for Graphics Programming

367 pointsby sampsabout 9 years ago

28 comments

Upvoted simply because -- notwithstanding all of the entirely valid complaints -- this is actually the short, sweet introduction to GL that I would've loved when I first picked it up.All of these things took me some serious time to learn. Partly because of my sheer incredulity at the stringliness of it all. Finding good GL tutorials is hard, after all: "surely", I thought, "these are just bad examples, and I should keep looking for a better way".A quick, frank throwdown of "this is the way it is" like this would've gotten me over that hump much faster. Even if it's crushing, I'm filing this link away for "must read" when someone asks me where to get started in GL programming.

评论 #11621709 未加载

评论 #11623322 未加载

评论 #11623278 未加载

评论 #11625070 未加载

评论 #11621886 未加载

kvarkabout 9 years ago

In Rust ecosystem, we've been experimenting with different ways to make CPU-GPU interaction safer when it comes to graphics abstractions.In gfx-rs [1] one defines a Pipeline State Object with a macro, and then all the input/output data and resources are just fields in a regular Rust struct [2]. So assignment pretty much works as one expects. All the compatibility checks are done at run (init) time, when the shader programs are linked/reflected.In vulkano [3] the Rust structures are generated at compile time by analyzing your SPIR-V code.<pre><code> [1] https://github.com/gfx-rs/gfx [2] https://github.com/gfx-rs/gfx/blob/2c00b52568e5e7da3df227d415eab9f55feba5a9/examples/shadow/main.rs#L314 [3] https://github.com/tomaka/vulkano</code></pre>

评论 #11620852 未加载

评论 #11620634 未加载

评论 #11623344 未加载

Athasabout 9 years ago

I'm not sure why bytecode is supposed to be significantly better than storing the shaders as strings. Sure, you get rid of a complex parser and can more easily use it as a compiler target, but the core problem remains: the compiler can still not reason across the device boundary. The CPU compiler does not understand what happens on the GPU, and the GPU compiler has no idea what the CPU is going to do do it.Although I'm not a big fan of CUDA, it does have an advantage in that it combines both worlds in a single language. This does permit optimisations that cross devices boundaries, but I have no idea whether the CUDA compiler does any of this.Apologies for tooting my own horn, buI have been working on a language that can optimise on both sides of the CPU-GPU-boundary: <a href="http://futhark-lang.org/" rel="nofollow">http://futhark-lang.org/</a> (Although it is more high-level and less graphics-oriented than what I think the author is looking for.)

评论 #11620225 未加载

评论 #11623449 未加载

评论 #11620212 未加载

评论 #11621000 未加载

评论 #11620735 未加载

评论 #11626377 未加载

zamalekabout 9 years ago

I spent a lot of time making toy engines with XNA and carried much of its lessons to my toy engines in C++.If your asset can be built, then your asset should be built. Content pipelines.> Shaders are StringsIf you're using Vulkan, compile your shaders to SPIR-V alongside your project. If you're using OpenGL you're mostly out of luck - but there's still no reason to inline the shaders even with the most rudimentary content manager. Possibly do a syntax pass while building your project.> Stringly Typed Binding BoilerplateIf you build your assets first then you can generate the boilerplate code with strong bindings (grabbing positions etc. in the ctor, publicly exposed getters/setters). I prototyped this in XNA but never really made a full solution.Graphics APIs are zero-assumption APIs - they don't care how you retrieve assets. Either write a framework or use an opinionated off-the-shelf engine. Keeping it this abstract allows for any sort of imaginable scenario. For example: in my XNA prototype, the effects (shaders) were deeply aware of my camera system. In the presence of strong bindings I wouldn't have been able to do that.Changing the way that the APIs work (to something not heterogeneous) would require Khronos, Microsoft and Apple to define the interaction interface for every single language that can currently use those APIs.It's loosely defined like this for a reason - these are low-level APIs. It's up to you to make a stronger binding for your language.

评论 #11621511 未加载

评论 #11624875 未加载

junkeabout 9 years ago

Young fool, only now, at the end, do you understand... code is data.CEPL demos: <a href="https://www.youtube.com/watch?v=2Z4GfOUWEuA&list=PL2VAYZE_4wRKKr5pJzfYD1w4tKCXARs5y" rel="nofollow">https://www.youtube.com/watch?v=2Z4GfOUWEuA&list=PL2VAYZE_4w...</a>I get it, there are many good reasons to stick to C++ if you want to ship a game today.

评论 #11624131 未加载

评论 #11619746 未加载

评论 #11622598 未加载

rjmunroabout 9 years ago

I don't think the representation of shaders as strings vs bytecode is a problem at the low level. Strings are slightly inefficient, but you could just pretend that they are bytecode, where the bytes are all restricted to the ASCII range.What is required is a language where the compiler can analyse the code and decide what to do on the CPU and what to do on the GPU as part of it's optimisation. It could even emit different versions for different CPU / GPU combinations or JIT compile on the target machine once it knows what the actual capabilities are (maybe in some cases it makes sense to do more on the CPU if the GPU is less powerful). You could possibly also run more or even all code on the CPU to enable easier debugging.The language could be defined at high enough level that it can be statically analysed and verified, which I think would answer the criticisms in the article.

haxiomicabout 9 years ago

The haxe-language project 'hxsl' takes a pretty good stab at improving the situation. With hxsl you write your GPU and CPU code in the same language (haxe) and it generates shader strings (in GLSL or AGAL) at compiletime along with CPU code for whatever platform you're targeting.There's not a lot of documentation around it at the moment but the author explains a little about it in this video <a href="https://youtu.be/-WeGME_T9Ew?t=31m49s" rel="nofollow">https://youtu.be/-WeGME_T9Ew?t=31m49s</a>Source code on github <a href="https://github.com/ncannasse/heaps/tree/master/hxsl" rel="nofollow">https://github.com/ncannasse/heaps/tree/master/hxsl</a>

评论 #11619820 未加载

评论 #11619854 未加载

Negative1about 9 years ago

I think his conclusion is valid; compilers can now handle register assignment and other boilerplate pretty well (Metal does this superbly and SPIR-V seems very good).My only qualm is that he treats these aspects that he deems outdated as unnecessary, which is just not true. For a long time there was no other alternative. Introducing High Level languages for programmable shading was a huge deal and it actually decreased a lot of complexity. In reality it simplified a lot of stuff that was quite difficult before GLSL/HLSL came along.He seems to be making some kind of rallying call for change but the next generation is already here. We'll have to keep supporting that old approach for a while longer but the problem really has been solved (to some extent).Also, old person rant: "Back in my day, we only had 8 register combiners and 4 blend states. And we liked it!"

kirillkhabout 9 years ago

When writing my first OpenGL code, I was stunned by the amount of boilerplate required. It is so bad it reminds me of COBOL. No one in their right mind would tolerate that monstrosity in a normal CPU program (except, possibly, some hardcore C++ fans).I think in order to solve this, we need three things: 1) Intermediate representation for compiled GPU code. Bytecode mentioned in the article sounds like it. 2) Cross-platform GPU programming support in our programming languages' standard libraries. 3) Compiler plugins and DSLs that output the IR and link to the library.

评论 #11620100 未加载

评论 #11625075 未加载

评论 #11619889 未加载

kelvin0about 9 years ago

Gets much worse than that, for anyone having shipped AAA games on multiple consoles: 1) Ubershader program files (#ifdef's pepper generously everywhere) 2) Each console has it's own quirks and gotchas that you try to abstract 3) CPU/GPU interface is a nightmare and debugging is as painful as it gets.

评论 #11619575 未加载

Arnsasteabout 9 years ago

He listed some of the downsides of the OpenGL approach to shaders, but he forgot to address the (IMHO) really big advantage: You can use OpenGL in every programming language that is able to call C functions, which is basically every programming language.Somebody in this thread mentioned CUDA, which is great, but it also has the downside that you practically have to use C++ on the CPU side. You also can use e.g. python, but when you do you are back to compiling CUDA code at runtime.Sure, you could argue that we can simply create a new programming language for applications that use the graphics card (Or use C++ like CUDA). The problem with this is that few people would use it just because its a bit easier to do CPU-GPU communication. An there is a second much larger problem: Different graphics API vendors would create different programming languages which makes it much harder for a application to support multiple graphics API as a lot of games do today with DirectX and OpenGL.Perhaps there is another better solution, but I can't see it right now.

评论 #11621607 未加载

chongliabout 9 years ago

What we really need is a project to do for GPUs what RISC-V[0] is about to do for CPUs. It's high time we as a society broke away from the stranglehold proprietary companies such as Nvidia and Intel have over us. It's time for a completely open computing platform for all of us to use to the fullest advantage of society.[0] <a href="https://en.m.wikipedia.org/wiki/RISC-V" rel="nofollow">https://en.m.wikipedia.org/wiki/RISC-V</a>

评论 #11621479 未加载

exDM69about 9 years ago

The issues pointed out by OP are mostly gone in the new Vulkan API, which is the new graphics API from Khronos. Shaders are shipped in SPIR-V bytecode binary format and the binding of resources to shader inputs is more memory-oriented.Time to go learn a new API :)

评论 #11619675 未加载

joeld42about 9 years ago

CUDA sounds like the kind of "heterogeneous" model the author is suggesting. Unfortunately it has it's own set of problems.I'm not really sure what kind of API the author wants. Sure, it's annoying that data in GPU-land and CPU-land are difficult to get to work together. But that's not the API's fault, it's because they are physically very far removed from each other. They don't share memory (and if they did you'd still have to lock and manage it). You could make an API that made them seem more transparent to the programmer, but then you're back to the GL 1.0 mess where you have zero control and the driver does dumb things at random times because it doesn't know anything about the program that is executing.

评论 #11621789 未加载

csabahruskaabout 9 years ago

In lambdacube 3d you can program the CPU/GPU (OpenGL+GLSL) in one typed language. (<a href="http://lambdacube3d.com/" rel="nofollow">http://lambdacube3d.com/</a>)

_yosefkabout 9 years ago

OpenCL was moving to LLVM IR away from programs-as-source-code-strings at some point. Still, you ought to have a JIT step if you want to be able to ship the same binary to multiple systems with GPUs not compatible at the binary level. And BTW shipping LLVM IR on the CPU would make things better in terms of supporting CPUs with incompatible ISAs... whether that ever overtakes native binaries is a question (it did to an extent, of course, with the JVM and what-not.) Of course you could move the JIT off the client device completely and do it at the App Store - prepare a bunch of binaries for all the devices out there - but still, the developers will have shipped IR.Once you get to shipping IR instead of strings, which is nice I guess in that it should make initialization somewhat faster and the GPU drivers somewhat smaller, I'm not sure what you're going to get from being able to treat a CPU/GPU program as a single whole. Typically the stuff running on the GPU or any other sort of accelerator does not call back into the CPU - these are "leaf functions", optimized as a separate program, pretty much. I guess it'd be nice to be able to optimize a given call to such a function automatically ("here the CPU passes a constant so let's constant-fold all this stuff over here.") The same effect can be achieved today by creating a GPU wrapper code passing the constants and having the CPU call that, avoiding this doesn't sound like a huge deal. Other than that, what big improvement opportunities am I missing due to the CPU compiler not caring what the GPU compiler is doing and what code the GPU is running?(Not a rhetorical question, I expect to be missing something; I work on accelerator design but I haven't ever thought very deeply about an integrated CPU+accelerator compiler, except for a high-level language producing code for both the CPU and some accelerators - but this is a different situation, and there you don't care what say OpenGL or OpenCL do, you generate both and you're the compiler and you use whatever opportunities for program analysis that your higher level language gives you. Here I think the point was that we miss opportunities due to not having a single compiler analyzing our C/C++ CPU code and our shader/OpenCL/... code as a single whole - it's this area where I don't see a lot of missed opportunities and asking where I'm wrong.)

评论 #11619902 未加载

评论 #11626264 未加载

xigencyabout 9 years ago

It seems unlikely that any of this will change, because control is in the hands of a relatively few powerful organizations who honestly have no interest in making life easier for software engineers working on graphics programming; and those engineers who do have to put up with it have long since given up or chalk it up to experience; and those developers would probably resist any changes now simply because change requires more work and learning how to do things all over again.The hardware developers (AMD/nVidia/Intel) have an interest in not changing their device drivers, the software vendors (OpenGL and DirectX) have little interest in redeveloping technology, and the software developers with the most capital, game engine developers, have already found workarounds and hire enough engineers to plug leaks in their lifeboats. The state of tools for game developers is so shoddy as well that trying to retrofit language compilers and shader compilers to work together seems like a drawn out task.It's sort of a David and Goliath situation if you think you can change the graphics programming landscape on your own. Plus, we all know how poorly these standards are developed over time.

评论 #11622171 未加载

评论 #11621572 未加载

评论 #11622533 未加载

vanderZwanabout 9 years ago

Would Halide[0] be what he's looking for? It's a DSL, sure, but one that is embedded in C++, so you can at least program your whole pipeline in place, giving you a good grasp of the flow of the program(s).[0] <a href="http://halide-lang.org/" rel="nofollow">http://halide-lang.org/</a>

评论 #11619637 未加载

ixtliabout 9 years ago

Very well written. I get the sense that the reason we're all here now is that historically the people who use these APIs as client programmers have been somewhat bad at explaining what they want.

评论 #11622640 未加载

IvanK_netabout 9 years ago

I did not find anything interesting or useful in this article.Author just says, that OpenGL and "shaders as strings" are bad, without explaining why. Then author speaks about some mysterious "Heterogeneity" without specifying what it means."We need programming models that let us write one program that spans multiple execution contexts" - what does it mean? How should that model look like, how should it be different? It is like saying "cars with 4 wheels are bad, cars with different number of wheels would be better".

评论 #11619922 未加载

评论 #11619938 未加载

DonHopkinsabout 9 years ago

DreemGL [1] [2] addresses these problems by compiling JavaScript code into shaders."DreemGL is an open-source multi-screen prototyping framework for mediated environments, with a visual editor and shader styling for webGL and DALi runtimes written in JavaScript." [3][1] <a href="https://github.com/dreemproject/dreemgl" rel="nofollow">https://github.com/dreemproject/dreemgl</a>[2] <a href="http://docs.dreemproject.org/docs/api/index.html#!/guide/dreem_in_10_part1" rel="nofollow">http://docs.dreemproject.org/docs/api/index.html#!/guide/dre...</a>[3] <a href="https://dreemproject.org/" rel="nofollow">https://dreemproject.org/</a>

评论 #11624621 未加载

rawnlqabout 9 years ago

One interesting library for building shaders is: <a href="http://acko.net/blog/shadergraph-2/" rel="nofollow">http://acko.net/blog/shadergraph-2/</a> or <a href="https://github.com/unconed/shadergraph" rel="nofollow">https://github.com/unconed/shadergraph</a>Made by the guy behind MathBox.

robbiesabout 9 years ago

I think the author has good intentions (don't we all), but I don't think he understands enough about graphics programming to make some of the proclamations/requests in the article. Not that I blame him...it's hard to get an understanding of GPU programming outside the scope of a game dev or IHV.> To define an object’s appearance in a 3D scene, real-time graphics applications use shaders... Eh, the shaders are just a part of the GPU pipeline that transforms your vertices, textures, and shaders into something interesting on the screen.This is already oversimplifying what GPUs are trying to do for the base case of graphics.> the interface between the CPU and GPU code is needlessly dynamic, so you can’t reason statically about the whole, heterogeneous program.Ok, so what is the proposed solution here? You have a variety of IHVs (NV, AMD, Intel, ImgTec, ARM, Samsung, Qualcomm, etc). Each vendor has a set of active architectures that each have their own ISA. And even then, there are sub-archs that likely require different accommodations in ISA generation depending on the sub-rev.So in the author's view of just the shader code, you already have the large problem of unifying the varieties of ISA under some...homogenous ISA, like an x86. That's a non-trivial problem. What's the motivation here? How will you get vendors to comply?I think right now, SPIR-V, OpenCL, and CUDA aren't doing a _bad_ job in trying to create a common programming model where you can target multiple hardware revs with some intermediate representation, but until all the vendors team up and agree on an ISA, I don't see how to fix this.On top of that, that isn't even really the only important bit of programming that happens on GPUs. GPUs primarily operate on command buffers, of which, there is nary a mention of in the article. So even if we address the shader cores inside the GPU, what about a common model for programming command buffers directly? Good luck getting vendors to unify on that. Vulkan/DX12/Metal are good (even great) efforts in exposing the command buffer model. You couldn't even _see_ this stuff in OpenGL and pre-DX12 (though there were display lists and deferred contexts, which kinda exposed the command buffer programming model).> To use those parameters, the host program’s first step is to look up location handles for each variable...Ok, I don't blame the author for complaining about this model, but this is an introductory complaint. You can bind shader inputs to 'registers', which map to API slots. So with some planning, you don't need to query location handles if you specify them in the shader in advance. I think this functionality existed in Shader Model 1.0, though I can't find any old example code for it (2001?).That being said, I certainly don't blame the author for not knowing this, as I think this is a common mistake made by introductory graphics programmers, because the educational resources are poor. I don't think I ever learned it in school...only in the industry was this exposed to me, to my great joy. Though I am certain many smarter engineers figured it out unprompted.> OpenGL’s programming model espouses the simplistic view that heterogeneous software should comprise multiple, loosely coupled, independent programs.Eh, I don't think I want a common programming model across CPUs and GPUs. They are fundamentally different machines, and I don't think it makes sense to try to lump them together. I don't think we just assume that we can use the same programming methodologies for a single vs multi-threaded program. I know that plenty tried, but I thought the best method of addressing the differences was education and tools. I'd advocate that most effective way that GPU programming will become more accessible will be education and tools. I have hope that the current architecture of the modern 'explicit' API will facilitate that movement.

评论 #11623251 未加载

评论 #11624001 未加载

onetimePeteabout 9 years ago

Coding for a GPU is basically embedded programming - and the jit step is needed to avoid platform adaption costs.

scrumperabout 9 years ago

Slightly OT, but the literate presentation was nice. Any particular tool or technique you used for that?Thanks

评论 #11623474 未加载

评论 #11623478 未加载

评论 #11623481 未加载

cosmicexplorerabout 9 years ago

I think this what OpenACC (<a href="https://en.wikipedia.org/wiki/OpenACC" rel="nofollow">https://en.wikipedia.org/wiki/OpenACC</a>) was created to do; its support is pretty iffy right now, though.

评论 #11619763 未加载

tdsamardzhievabout 9 years ago

Can somebody with Direct3D-experience give opinion if it's any better?

评论 #11619859 未加载

sklogicabout 9 years ago

Source code or IR or whatever else would always be needed, I'm afraid, with a late (and, likely, unpredictable) compilation. Passing an image of a different format may trigger a dynamic shader (or OpenCL kernel) recompilation on some GPUs, for example.