Literate programming is much more than just commenting code

173 点作者 increscent大约 3 年前

30 条评论

falcolas大约 3 年前

My favorite literate program still has to be the book "Physically Based Rendering". An optimized, feature rich ray tracer in the form of a textbook.That said, I wouldn't personally want to try and collaborate on such a program with more than one other person. It would make for a great single-contributer OSS library though. Rubber duck debugging built right into the prose.

评论 #30762442 未加载

评论 #30762217 未加载

svat大约 3 年前

I would go further: literate programming is not just "much more than" commenting code, because you can do LP without commenting much. The main thing in LP is the idea/orientation of writing as if you're writing something for a human reader. This does often lead to more comments, but even something like "here's the code" followed by lots of code can be LP, if you deem it sufficient for your intended audience. (Earlier comment of mine about target audience and not over-commenting: <a href="https://news.ycombinator.com/item?id=29871047" rel="nofollow">https://news.ycombinator.com/item?id=29871047</a>)This works well for people who are writers by nature (like Knuth who's always making edits and improvements to his books <a href="https://news.ycombinator.com/item?id=30149221" rel="nofollow">https://news.ycombinator.com/item?id=30149221</a>). One problem though (and there are several) is that because this is so personal, nearly everyone who seriously tries LP ends up writing their own LP tool (including the author of this post!).

评论 #30770841 未加载

antirez大约 3 年前

I think likewise. When I had to write the radix tree implementation for Redis I faced two problems:- I needed a stable implemention as soon as possible, I had a performance issued that needed to be solved by range queries.- The radix tree was full of corner cases.So I resorted to literate programming, which is in general very near to my usual programming style. You can find it in the rax.c file inside the Redis source code, as you can see as the algorithm is enunciated, the corresponding code is inplenented.Other than that I wrote a very extensive fuzzer for the implementation. Result: after the initial development I don't think it was never targeted by serious bugs, and now the implementation is very easy to modify if needed.

评论 #30767862 未加载

评论 #30773302 未加载

BeetleB大约 3 年前

The problems one will run into with literate programming:1. Lack of tooling.2. Refactoring becomes nontrivial3. How one would write a program in literate style will vary widely from person to person. If you write your code in literate style, it may be easy for you to follow it years later and modify it, but it likely will not be the case for a coworker. If they have to modify the code, the cognitive load will not be too different from that of just dealing with well written code.Disclaimer: I've written two nontrivial programs literate style that I continue to rely on and occasionally modify years after writing them. It works as advertised.

评论 #30768695 未加载

评论 #30770869 未加载

评论 #30762129 未加载

yumiris大约 3 年前

Literate programming has been particularly useful for my "dotfile" configurations, such as .emacs, .vimrc, .zshrc and even the .gitconfig file.I use one .org file to declare all of my configurations, and tangle them together into the aforementioned files. This keeps things pretty portable, and makes up for the unintuitive readability of many dotfiles.It can also work for rudimentary shell scripts and other single-file goodies; however, scaling it to proper multi-file programs proves to be difficult, especially when multiple developers are involved.

评论 #30762339 未加载

sritchie大约 3 年前

Literate programming is going to feel far more powerful when we expand the definition to include:- Smalltalk-ish things like writing suites of custom viewers for various types, - demos and examples in-line inside of a library - multiple stories about the same piece of code, but all with the ability to IMPORT the story as a libraryI've been writing sicmutils[0] as a "literate library"; see the automatic differentiation implementation as an example[1].A talk I gave yesterday at ELS[2] demos a much more powerful host that uses Nextjournal's Clerk[3] to power physics animations, TeX rendering etc, but all derived from a piece of Clojure source that you can pull in as a library, ignoring all of these presentation effects.Code should perform itself, and it would be great if when people thought "LP" they imagined the full range of media through which that performance could happen.[0] sicmutils: <a href="https://github.com/sicmutils/sicmutils" rel="nofollow">https://github.com/sicmutils/sicmutils</a>[1] autodiff namespace: <a href="https://github.com/sicmutils/sicmutils/blob/main/src/sicmutils/differential.cljc#L21" rel="nofollow">https://github.com/sicmutils/sicmutils/blob/main/src/sicmuti...</a>[2] Talk code: <a href="https://github.com/sritchie/programming-2022" rel="nofollow">https://github.com/sritchie/programming-2022</a>[3] Clerk: <a href="https://github.com/nextjournal/clerk" rel="nofollow">https://github.com/nextjournal/clerk</a>

评论 #30770775 未加载

QuikAccount大约 3 年前

I like literate programming in theory but the most common response I see to it is that writing self documenting code is better because as you are working on a code base with many people, it is unlikely they will keep your prose up to date as the code is changed.

评论 #30762422 未加载

评论 #30761630 未加载

评论 #30761753 未加载

评论 #30763166 未加载

评论 #30762287 未加载

lf-non大约 3 年前

I am not a big fan of the complex literate programming style involving code-generation which this article talks about.But I recently discovered that Google's zx [1] scripting utility supports executing scripts in markdown documents and I combined it with httpie [2] and usql [3] for a bit of quick and dirty automation testing and api verification code and it worked out pretty well.I imagine for most people nowadays jupyter or vscode notebooks are the closest it comes to practical literate programming.[1] <a href="https://github.com/google/zx#markdown-scripts" rel="nofollow">https://github.com/google/zx#markdown-scripts</a>[2] <a href="https://github.com/httpie/httpie" rel="nofollow">https://github.com/httpie/httpie</a>[3] <a href="https://github.com/xo/usql" rel="nofollow">https://github.com/xo/usql</a>

andrewshadura大约 3 年前

ifupdown, the Debian tool to manage network interfaces, used to be written in literate C using noweb. When I took it over from the original author, I struggled to understand how it worked. I had to print out the weaved version of it, and read it making notes on the paper. I eventually managed to make sense of it, but making any change was very difficult, so I ended up converting it to plain C, adding some comments from the original literate source and reindenting.

foxdeploy大约 3 年前

This talked about writing code for humans then immediately jumped into some arcane mathematic scrawl like the stuff when Sephiroth casts supernova

评论 #30761999 未加载

vim-guru大约 3 年前

I've written a fair share of literate code.It works well for personal stuff where you would like to leave some bits of information for yourself (typically, configuration files).It works well for small libraries where good documentation is important.It works well for visualisation-work, where you may combine multiple languages and data-formats without writing API's for each.In larger scale apps though and with collaboration; you run into problems with tooling on multiple levels. I am working on tackling scale, but collaboration is tricky. Mostly because you need structure to collaborate and then you will likely end up with an outline that's pretty close to a directory-tree and then you've lost one of the good bits of literate code in my opinion.

atweiden大约 3 年前

I’d like to see a literate programming version of GitHub where the community standardizes around an eminently-readable Markdown-like syntax. srcweave [1] looks like a great start.[1]: <a href="https://github.com/justinmeiners/srcweave" rel="nofollow">https://github.com/justinmeiners/srcweave</a>

评论 #30762247 未加载

评论 #30763734 未加载

rektide大约 3 年前

Building a Habitable Computing Environment[1] was a recent blush i had with a "literate" computing project, this time less about programming specifically & about system setup/config.I confess I'd rather forgotten what literate specifically meant beyond code comments describing the flow, but i did find it to be a remarkably comprehensive & understandable document, a prime example of how we might teach & understand computing. Even if it did leave me puzzling out what a number of the many many many scripts were for!Certainly the overall project of computing needs a lot of help, ways to explain itself. Ive seen tons and tons and tons of "dotfiles" projects, but none have gotten anywhere near to as comprehensible as this literate programming project, from what I've seen.[1] <a href="https://tess.oconnor.cx/config/hobercfg.html" rel="nofollow">https://tess.oconnor.cx/config/hobercfg.html</a> <a href="https://news.ycombinator.com/item?id=30748033" rel="nofollow">https://news.ycombinator.com/item?id=30748033</a> (19 points, 1d ago, 0 comments)

mci大约 3 年前

IMHO, attempts to show literate programming on screen are doomed to meet with mediocre success. DEK invented literate programming with printed books in mind. I dare say that the only successful literate programs are books printed on paper.First, in a printed book, it is easier to find a previous page and compare a fragment on it with the current fragment. Second, a printed book has no links tempting you with the words "CLICK ME" to disrupt the flow so you can read it from cover to cover with fewer distractions. Third, anecdotally, I can see flaws much easier on a printout than on screen, both in programs and in texts.

评论 #30763684 未加载

评论 #30764824 未加载

nonrandomstring大约 3 年前

This is great stuff. It's how all code and data research should be presented, where the document is the program and you can reproduce it as easily as you can read it. After years of using Pure Data (a visual dataflow) whose unofficial motto was "The diagram is the program" I got this philosophy stuck deep in my brain. Today I use Org-Mode (In Emacs) for tangling (with something called Babel) that can run source code from many languages as part of an active document.

评论 #30761238 未加载

dwohnitmok大约 3 年前

Are there any large (> 5 people teams) projects written with literate programming?Also are there any IDE plugins or error stack trace/debuggers for literate programming?I haven't really paid attention to literate programming in a long long time and I'm curious if the field has advanced.(Also I don't understand this: "A typical literate file produces many source files." Why? Why would you care about having multiple source files? Isn't the literate file the source at this point?)

评论 #30762188 未加载

评论 #30762301 未加载

评论 #30761302 未加载

评论 #30761566 未加载

hzhou321大约 3 年前

What are the key differences between a human audience and a machine interpreter? It is not the language or prose. It is the structure and order. For machines, details comes first. You declare all the actors and types with every non-forgiving annotations first. You may tuck the details into a header, but it still needs be ordered according to compilers and structured in the way that machines gets the details first. On the other hand, for human, it is top-down context oriented. The details are important, but not after we establish the right context.So for literate programming, if you just think it is how you write the code (e.g. self-documenting or not), or you think it is the amount of commenting (e.g. doc string or not), if you are not first and constantly thinking about how to structure your code and establish context, you are not getting literate programming.Now, once you understand your ends, the means (tangle or weave), will come along. It is easy to invent one if you don't have one. On the other hand, getting your coworkers to agree and work together, that's hard. It is easy to get machines to work together and it is easy for human to cope.

评论 #30761836 未加载

评论 #30762467 未加载

copperx大约 3 年前

It would be great if IDEs supported literate programming; the tangle/weave commands, simple as they are, create many possible points for navigation. An IDE would be ideal to go back and forth from the prose to the code.

评论 #30761418 未加载

评论 #30763200 未加载

ggm大约 3 年前

I have always felt a literate program is probably for many of us, a future deliverable on the hack we've implemented up front.Very very few people can start from the abstraction and get TO a literate outcome without a lot of false steps along the way.Or, as an alternative, the LOC of a literate program has to include the 100x cost of exploring how to carve it out of the block of mud we start from, including making our own tools.

评论 #30762102 未加载

评论 #30763108 未加载

derangedHorse大约 3 年前

Maybe I'm missing something, but I didn't find his way of programming to be all that more useful than just having well written code. The small code snippets are labeled and shown where they are referenced but this seems to mimic the functionality of functions which, when using an IDE like Visual Studio, can have where it's referenced identified through tooling.

评论 #30765361 未加载

silcoon大约 3 年前

Another solution has been implemented in Marginalia. Notes near complete source code.source code: <a href="https://github.com/gdeer81/marginalia" rel="nofollow">https://github.com/gdeer81/marginalia</a> example: <a href="http://gdeer81.github.io/marginalia/" rel="nofollow">http://gdeer81.github.io/marginalia/</a>

kkfx大约 3 年前

IMVHO literate programming is "describing an algorithm" like writing a book, witch is absolutely good, but demand much more time than directly writing code. That means: or we change actual "quick" development model to a new/old "slow" one, perhaps additive and coherent like classic systems (SmallTalk and LispM systems, the OS as a single application easy to change at runtime, anything available as a function/method anywhere) to keep the overall development speed useful enough or there is no room for literate programming.Now, seen actual overall software quality (far less hacky than the past, but also unable to innovate, bloated, with gazillions of deps) we need to change back to days of the real innovation BUT that means we need to completely erase actual economical model centered on giants, witch can be "a little bit" difficult since they are giants and they do not like the idea to be thrown out of the window...

fanf2大约 3 年前

It seems to me that literate programming was partly invented to escape from the rigid structure of Pascal programs; if Knuth had been using a language that allows use before declaration, literate programming would be just comments. Like literate Haskell.

eterevsky大约 3 年前

I tried reading some literate code, and I have troubles understanding it, compared to well structured normal code with moderate amount of comments.Sometimes when you are writing an article it may make sense to write LP-like snippets of code like<pre><code> int my_function() { // Initialize variables return 0; } </code></pre> but you don't really need to invent the whole "literate programming" concept to do this and you don't need to write all of your code like that.

shp0ngle大约 3 年前

All bigger programs I have seen that used literate programming were unreadable and I always wished they used something else when reading the source code.Maybe I saw bad examples though.

nesarkvechnep大约 3 年前

I think Literate Programming is fantastic when used to teach computational thinking.On a side note, it seems a lot of the other commenters miss one of the best "features" of LP - minimizing repetition. Chunks of code can be reused and so patterns can become clearly evident. Also, chunks can be defined "out-of-order".

mtm大约 3 年前

One of my favorite examples of a literate-style program is "cl-6502, A Readable CPU Emulator" by Brit Butler

0des大约 3 年前

OK, I'll bite.I stopped using 1 letter variables, abbreviations, and non descriptive function names. If a function or block can't be read and followed like a story, and without comments, it probably can be simplified.

krick大约 3 年前

Ok, that will be unpopular."Literate programming" is a non-invention by somebody (Knuth), who is very much revered by many programmers (many of whom never even actually read him), but who was — let's admit it — just terrible at writing readable code. I'm very much not a fan of the "Clean Code" by Martin, but he had a very nice example of refactoring some of Knuth's code to show you what I mean (although, it's kind of evident that writing clearly wasn't in Knuth's DNA just by reading his famous books). Today, this is an attempt to solve a problem, which you created yourself by avoiding using tools that already exist to solve that problem. Then you invent all sorts of tooling and mental tricks to make solving this problem your way more comfortable. But if you would just use these already existing tools, there would be no need in making up a new name for what you do. It wouldn't be some "literate programming", it would be just programming, the sane way.First off, what tools I'm talking about: well, that's everything PL developers invented over the decades, and it obviously depends on which PL you are going to use. If this is some pseudo-assembly language like what Knuth uses in his TAOCP, then, well, there aren't many such tools, so creating your own template-preprocessor (which is, in a sense, making a new PL with additional features on top of your pseudo-assembly) perhaps would be an okay-ish idea. But if you use something that people actually use for programming, then you surely have functions, some kind of advanced data structures, perhaps classes and inheritance, perhaps some templating features as well (like… traits?).Going back to the example at hand (the code author "simplifies"): all that "simplifying" consists of a top-down description of what he's going to do. Really, the code he ends up with (in "transpiled" form) isn't that much harder to read and understand than his "LP" version of it. Inline some comments to explain what he explains in the "LP" version, and you and up with the same thing, but much more concise (so, faster to read — and easier to edit!). If it was a bit more complicated: you do the same thing that he did with his "templating", but simply by doing what programmers actually do in such cases — extract complicated fragments of a function into smaller functions, and give them proper names. Maybe add some comments — yes, they are a part of your PL for a reason.Moreover, the most complicated thing in his example isn't how the algorithm is written down, but the very algorithm itself. It is ok as long as you never actually run this code, but if you actually use it in some useful program, where it can cause problems, a programmer coming across this thing would need to stop to wrap his head around what this is doing, if it's actually all subsets and how fast the call stack may grow (as it so often turns out when you use recursion to write down "an elegant" solution). I mean, I'm only suggesting, but wouldn't this be a little bit more straight-forward?<pre><code> function subsets(elements) { results = [] // All subsets of a set of 5 elements are basically binary numbers // from 00000 to 11111, which is from 0 to 2⁵-1 for (i in range(0, 2^(len(elements)) - 1) { results.add(get_subset_by_binary_number(elements, i))) } return results } // Blah-blah // Given [1, 2, 3, 4] and a number with binary representation 0101 // will return [2, 4] function get_subset_by_binary_number(set, number) { ... } </code></pre> This isn't my main point, though. My main point is, that people write code for a reason. There can be number of reasons, but usually it fits into a range from "doing some enerprisy-boilerplaity stuff I'll need to redo over again next week" and "writing a book, which has code, because it's about programming, and code describes programming better than english". In the first case it probably won't need a lot of "LP-kind of explanations", and where it needs to go over "why the fuck did I do it like that" a bit more extensively, you'll just link Jira issue in a comment. In the second case it might look a bit moe like LP, but it's just called "writing a book".In all of the cases in between you'll add some amount of comments, always trying to minimize overall amount of stuff other people will have to read (and, well, you to write), which is expressing as much as you can with words you cannot avoid to write (i.e., code that actually does things, explaining them both to humans and to a computer) and minimizing what you can avoid (i.e. English). (Closer to "a book" on this spectrum it will also include some Jupyter Notebooks.)

评论 #30766859 未加载

DeathArrow大约 3 年前

>Code should be written for humans not machines.Unfortunately, machines have a different way of understanding code than humans.