The Mighty Named Pipe

385 pointsby vsbuffaloabout 10 years ago

16 comments

aidosabout 10 years ago

Nice article. Really easy to follow introduction.I only discovered process substitution a few months ago but it's already become a frequently used tool in my kit.One thing that I find a little annoying about unix commands sometimes is how hard it can be to google for them. '<()', nope, "command as file argument to other command unix," nope. The first couple of times I tried to use it, I knew it existed but struggled to find any documentation. "Damnit, I know it's something like that, how does it work again?..."Unless you know to look for "Process Substitution" it can be hard to find information on these things. And that's once you even know these things exist....Anyone know a good resource I should be using when I find myself in a situation like that?

评论 #9184731 未加载

评论 #9184382 未加载

评论 #9183591 未加载

评论 #9183352 未加载

评论 #9183632 未加载

评论 #9185067 未加载

评论 #9185121 未加载

unhammerabout 10 years ago

Once you discover <() it's hard not to (ab)use it everywhere :-)<pre><code> # avoid temporary files when some program needs two inputs: join -e0 -o0,1.1,2.1 -a1 -a2 -j2 -t$'\t' \ <(sort -k2,2 -t$'\t' freq/forms.${lang}) \ <(sort -k2,2 -t$'\t' freq/lms.${lang}) # gawk doesn't care if it's given a regular file or the output fd of some process: gawk -v dict=<(munge_dict) -f compound_translate.awk <in.txt # prepend a header: cat <(echo -e "${word}\t% ${lang}\tsum" | tr [:lower:] [:upper:]) \ <(coverage ${lang})</code></pre>

评论 #9183456 未加载

评论 #9185112 未加载

larsfabout 10 years ago

Pipes are probably the original instantiation of dataflow processing (dating back to the 1960s). I gave a tech talk on some of the frameworks: <a href="https://www.youtube.com/watch?v=3oaelUXh7sE" rel="nofollow">https://www.youtube.com/watch?v=3oaelUXh7sE</a>And my company creates a cool dataflow platform - <a href="https://composableanalytics.com" rel="nofollow">https://composableanalytics.com</a>

评论 #9183811 未加载

Malarkey73about 10 years ago

Vince Buffalo is author of the best book on bioinformatics: Bioinformatics Data Skills (O'Reilly). It's worth a read for learning unix/bash style data science of any flavour.Or even if you think you know unix/bash and data there are new and unexpected snippets every few pages that surprise you.

dbboltonabout 10 years ago

In zsh, =(cmd) will create a temporary file, <(cmd) will create a named pipe, and $(cmd) creates a subshell. There are also fancy options that use MULTIOS. For example:<pre><code> paste <(cut -f1 file1) <(cut -f3 file2) | tee >(process1) >(process2) >/dev/null </code></pre> can be re-written as:<pre><code> paste <(cut -f1 file1) <(cut -f3 file2) > >(process1) > >(process2) </code></pre> <a href="http://zsh.sourceforge.net/Doc/Release/Expansion.html#Process-Substitution" rel="nofollow">http://zsh.sourceforge.net/Doc/Release/Expansion.html#Proces...</a><a href="http://zsh.sourceforge.net/Doc/Release/Redirection.html#Redirection" rel="nofollow">http://zsh.sourceforge.net/Doc/Release/Redirection.html#Redi...</a>

ameliusabout 10 years ago

If you like pipes, then you will love lazy evaluation. It is unfortunate, though, that Unix doesn't support that (operations can block when "writing" only, not when "nobody is reading").

评论 #9183809 未加载

评论 #9184038 未加载

评论 #9183712 未加载

baschismabout 10 years ago

AFAIK process substitution is a bash-ism (not part of POSIX spec for /bin/sh). I recently had to go with the slightly less wieldy named pipes in a dash environment and put the pipe setup, command execution and teardown in a script.

mhaxabout 10 years ago

I've used *nix for ~15 years and never used a named pipe or process substitution before. Great to know about!

评论 #9183755 未加载

评论 #9183544 未加载

anateusabout 10 years ago

In fish shell the canonical example is this:<pre><code> diff (sort a.txt|psub) (sort b.txt|psub) </code></pre> The psub command performs the process substitution.

评论 #9189026 未加载

AndrewSBabout 10 years ago

Does anyone have a working link to Gary Bernhardt's The Unix Chainsaw, as mentioned in the article?

评论 #9184366 未加载

评论 #9194463 未加载

评论 #9183522 未加载

frankerzabout 10 years ago

How does the > process substitution differ from simply piping the output with | ?For example (from Wikipedia)tee >(wc -l >&2) < bigfile | gzip > bigfile.gzvstee < bigfile | wc -l | gzip > bigfile.gz

评论 #9183388 未加载

评论 #9183330 未加载

评论 #9183444 未加载

评论 #9183325 未加载

评论 #9184878 未加载

chuckcodeabout 10 years ago

Anybody know of a way to increase the buffer size of pipes? I've experienced cases where piping a really fast program to a slow one caused them both to go slower as the OS pauses first program writing when pipe buffer is full. This seemed to ruin the caching for the first program and caused them both to be slower even though normally pipes are faster as you're not touching disk.

评论 #9184853 未加载

jamesromabout 10 years ago

Is this guy a bioinformatician? I think he's a bioinformatician.Can't be sure if he is a bioinformatician because he never really mentions that he is a bioinformatician.

评论 #9184217 未加载

评论 #9183913 未加载

leni536about 10 years ago

moreutils [1] has some really cool programs for pipe handling.pee: tee standard input to pipes sponge: soak up standard input and write to a file ts: timestamp standard input vipe: insert a text editor into a pipe[1] <a href="https://joeyh.name/code/moreutils/" rel="nofollow">https://joeyh.name/code/moreutils/</a>

hitlin37about 10 years ago

i heard somewhere that go follows unix pipe link interfaces.

Dewieabout 10 years ago

Pipes are very cool and useful, but it's hard for me to understand this common worship of something like that. Yes, it's useful and elegant, but is it really the best thing since Jesus Christ?

评论 #9183479 未加载

评论 #9183994 未加载