The Cult of DD

269 pointsby eklitzkeabout 8 years ago

38 comments

cat199about 8 years ago

"This is a strange program of obscure provenance that somehow, still manages to survive in the 21st century."-> links to wikipedia page with direct discription of lineage back to 5th ed research unix"That weird bs=4M argument in the dd version isn’t actually doing anything special—all it’s doing is instructing the dd command to use a 4 MB buffer size while copying. But who cares? Why not just let the command figure out the right buffer size automatically?"Um -a) it is 'doing the special thing' of changing the block size (not buffer size)b) Because the command probably doesn't figure out the right size automatically, much like your 'cat' example above which also doesn'tc) And this can mean massive performance differences between invocations> Another reason to prefer the cat variant is that it lets you actually string together a normal shell pipeline. For instance, if you want progress information with cat you can combine it with the pv commandUmm:<pre><code> dd if=file bs=some-optimal-block-size | rest-of-pipeline </code></pre> that was hard.>If you want to create a file of a certain size, you can do so using other standard programs like head. For instance, here are two ways to create a 100 MB file containing all zeroes:<pre><code> $ uname -sr OpenBSD 6.0 $ head -c 10MB /dev/zero head: unknown option -- c usage: head [-count | -n count] [file ...] </code></pre> well.. guess that wasn't so 'standard' after all.. I must be using some nonstandard version...<pre><code> $ man head |sed -ne 47,51p HISTORY The head utility first appeared in 1BSD. AUTHORS Bill Joy, August 24, 1977. $ sed -ne 4p /usr/src/usr.bin/head/head.c * Copyright (c) 1980, 1987 Regents of the University of California. </code></pre> Hmm..> So if you find yourself doing that a lot, I won’t blame you for reaching for dd. But otherwise, try to stick to more standard Unix tools.Like 'pv'?edit: added formatting, sector size note, head manpage/head.c stuffs.. apologies.

评论 #13900618 未加载

评论 #13900349 未加载

评论 #13900484 未加载

评论 #13900274 未加载

viraptorabout 8 years ago

There's one good (?) reason to use dd with devices: it specifies target in the same command. For devices, writing to them usually requires root privileges, so it's easy to:<pre><code> sudo dd .... of=/dev/... </code></pre> But there's no trivial cat equivalent:<pre><code> sudo cat ... > target </code></pre> Will open target as your current user anyway. You can play around with tee and redirection of course. But that's getting more complicated than the original.

评论 #13899452 未加载

评论 #13899490 未加载

评论 #13899230 未加载

评论 #13900033 未加载

评论 #13908136 未加载

评论 #13900362 未加载

colemannugentabout 8 years ago

One thing I'll often use dd for is recovering data from a failing drive. Can head ignore read errors? dd can.As far as I'm concerned, dd is lower-level than most of the other utilities and provides more control over what's happening.The author does have a point that the syntax is strange though.

评论 #13899232 未加载

评论 #13898918 未加载

评论 #13899003 未加载

评论 #13993719 未加载

评论 #13900777 未加载

wwalexanderabout 8 years ago

This article is full of Useless Uses of Cat[1] that could just use redirection operators. For instance,<pre><code> cat image.iso | pv >/dev/sdb </code></pre> could be rewritten as<pre><code> pv < image.iso > /dev/sdb </code></pre> A related mistake is the Useless Use of Echo, since any command of the form<pre><code> echo "foo" | bar </code></pre> can be written using here strings as<pre><code> bar <<< "foo" </code></pre> or even<pre><code> bar <<WORD foo WORD </code></pre> [1] <a href="http://porkmail.org/era/unix/award.html" rel="nofollow">http://porkmail.org/era/unix/award.html</a>

评论 #13901056 未加载

评论 #13900997 未加载

评论 #13901001 未加载

评论 #13900984 未加载

hvsabout 8 years ago

For those of you that are blissfully unaware of what the JCL DD command looks like, here's a example (with only the DD section of the JCL shown):<pre><code> //SYSPRINT DD SYSOUT=* //SYSLIN DD DSN=&&OBJAPBND, // DISP=(NEW,PASS),SPACE=(TRK,(3,3)), // DCB=(RECFM=FB,LRECL=80,BLKSIZE=3200), // UNIT=&SAMPUNIT //SYSLIB DD DSN=SYS1.MACLIB,DISP=SHR //SYSIN DD DSN=&SAMPLIB(IEWAPBND),DISP=SHR</code></pre>

评论 #13900269 未加载

tambourine_manabout 8 years ago

But who cares? Why not just let the command figure out the right buffer size automatically?Because it can be a lot slower. dd is low level, hence powerful and dangerous.And, if we are going down that rabbit hole, you don't need cat[1]“The purpose of cat is to concatenate (or "catenate") files. If it's only one file, concatenating it with nothing at all is a waste of time, and costs you a process.”[1]<a href="http://porkmail.org/era/unix/award.html#cat" rel="nofollow">http://porkmail.org/era/unix/award.html#cat</a>

评论 #13899546 未加载

gensabout 8 years ago

The Ignorance Of Err Ignorant Peopledd is a tool. dd can do a lot more then cat. dd can count, seek, skip (seek/drop input), and do basic-ish data conversion. dd is standard, even more standard then cat (the GNU breed). I even used it to flip a byte in a binary, a couple of times.New-ish gnu dd even adds a nice progress display option (standard is sending it sigusr1, since dd is made to be scripted where only the exit code matters).> Actually, using dd is almost never necessary, and due to its highly nonstandard syntax is usually just an easy way to mess things up.Personally I never messed it up, nor was confused about it. This sentence also sets the tone of the whole article, a rather subjective tone that is.edit: Some dd usage examples: <a href="http://www.linuxquestions.org/questions/linux-newbie-8/learn-the-dd-command-362506/" rel="nofollow">http://www.linuxquestions.org/questions/linux-newbie-8/learn...</a>

评论 #13903391 未加载

electrumabout 8 years ago

Don't cat a file and pipe it into pv. Use "pv file" as a replacement for "cat file" and it will show you the progress as a percentage. When it's in the middle of a pipeline, it doesn't know the total size (unless you tell it with -s), so it can only show the throughput.

评论 #13899924 未加载

gunnihinnabout 8 years ago

A counterpoint: dd survives not because it's good or makes sense, but explicitly because it doesn't.You wanna format a usb key? Google this, copy/paste these dd instructions, it works, move on with your life.You wanna format a usb key using something related to cat you once saw and didn't fully understand? Have fun.Both approaches have their weak points, but in any OS the answer to "How do I format a usb key" should not start with "Oh boy, let's have a Socratic dialog over 10 years on how to do that."

评论 #13899187 未加载

评论 #13900696 未加载

knz42about 8 years ago

What about the `seek` argument which skips over some blocks at the beginning but still allocates them (unix "holes")?Also note that there are still unix systems out there which do not support byte-level granularity of access to block devices. On those devices you must actually use a buffer of exactly the size of the blocks on the device. Heck, linux was like this until at least v2.

评论 #13898786 未加载

chrisfosterelliabout 8 years ago

I think dd is primarily so popular because it is used in mostly dangerous operations. Sure, using cat makes logicial sense, but if we are talking about writing directly to disk devices here I'll trust the command I read from the manual and not explore commands I think would work.dd's "highly nonstandard syntax" comes from the JCL programming language, but it's really just another tool to read and write files. At the end of the day it's not more complex or incompatible than other unix tools. For example, you can also use tools like `pv` with dd no problem to get progress statements.

评论 #13901227 未加载

评论 #13900011 未加载

donaldihunterabout 8 years ago

Cult of pv. It looks to have more command-line complexity than dd. <a href="https://linux.die.net/man/1/pv" rel="nofollow">https://linux.die.net/man/1/pv</a>

评论 #13899163 未加载

angry_octetabout 8 years ago

This is a great example of why downvoting submissions should be a thing. Or at least showing the up/down tuple. I would say every upvote represents someone misled and likely to further propagate this nonsense.

评论 #13900291 未加载

merlincoreyabout 8 years ago

The author doesn't even give correct invocations of dd (on BSD, at least, for their last example with head).I certainly agree the syntax of the arguments is strange, due to its age, but I don't agree that learning it is difficult or a waste of time.All I've learned is that the author doesn't like dd well enough to learn it.

betabyabout 8 years ago

Author is wrong bs IS useful, try to dd one hard drive to another without reasonable bs (1-8M) with and without and you will see a difference.

评论 #13899174 未加载

评论 #13900538 未加载

评论 #13899385 未加载

snickerbockersabout 8 years ago

OP, your alternatives to DD are more complicated, not less complicated. I shouldn't need to pipeline two commands together just to cut off the first 100MB of a file.

评论 #13906133 未加载

ocschwarabout 8 years ago

Dude's missing an important point:If you mess up the syntax on a dd invocation, a nice thing happens: nothing.Use a shell command and pipes, and your command better be perfect before you hit return.

评论 #13899957 未加载

评论 #13899569 未加载

sndeanabout 8 years ago

Somewhat related short story: Earlier this week my friend said that he dd'd away just over 50 bitcoins, back when they were worth ~$3 each."One of the biggest regrets of my life."

评论 #13899096 未加载

AdamJacobMullerabout 8 years ago

I'll point out that dd also allows you to control lots of other filesystem and OS-related things that other tools do not. See: fsync/fdatasync. I'm not aware of any shell tools that allow you to write data like that.

gravypodabout 8 years ago

An even easier solution: don't make people fall into the command line to format a USB reliably.The command line should be reserved for times where you need the fine grain control to do something that DD is meant to do. A GUI should implement everything else in a reliable way that doesn't break half the time or crash on unexprected input.

评论 #13899465 未加载

评论 #13899562 未加载

评论 #13899438 未加载

评论 #13899514 未加载

评论 #13899733 未加载

评论 #13993754 未加载

kev009about 8 years ago

Ignorance on the blocksize arg.Also, I only need to remember one progress command for my entire operating system: control+t. I also get a kernel wait channel from that which is phenomenally pertinent to rapidly understanding and diagnosing what the heck a command is doing or why it is stuck.I hate what Linux has done to systems software culture.

emmelaichabout 8 years ago

Specifiying a large block size used to help a LOT with performance. From memory shell redirection used a tiny blocksize. On Solaris at least.And if you use dd then you probably should specify a bigger block size than the default of 512 bytes.But yeah, most usage is obsolete.

评论 #13899274 未加载

gabrielblackabout 8 years ago

I think this article is full of "alternative computer science" and reminds me other article, published here as well, about the obsolescence of Unix. The only good thing is this discussion thread.

ori_babout 8 years ago

To be fair, dd was mostly a toungue in cheek reference to the overly baroque JCL command for IBM mainframes.

jsd1982about 8 years ago

Interesting assertion. Can you show me a shell invocation without using dd that cuts off the first 16 bytes of a binary file, for example? This is a common reason I use dd.

评论 #13898508 未加载

tardo99about 8 years ago

One of the charms of dd is its hilarious syntax. And, used properly, it's a bit of a swiss army knife for a few different disk operations.

noir_lordabout 8 years ago

not sure status=progress is that obscure a command, it was added relatively recently as well (in terms of dd).

评论 #13900510 未加载

kazinatorabout 8 years ago

dd precisely controls the sizes of read, write and lseek system calls. This doesn't matter on buffered block devices; there is no "reblocking" benefit.Some kinds of devices are structured such that each write produces a discrete block, with a maximum size (such that any bytes in excess are discarded) and each read reads only from one block, advancing to the next one (such that any unread bytes in the current block due to the buffer being too small are discarded). This is very reminiscent of datagram sockets in the IPC/networking arena. dd was developed as an invaluable tool for "reblocking" data for these kinds of devices.One point that the blog author doesn't realize (or neglects to comment upon) is that "head -c 100MB" relies on an extension, whereas "dd if=/dev/zero of=image.iso bs=4MB count=25" is ... almost POSIX: there is no MB suffix documented by POSIX, only "b" and "k" (lower case). The operator "x" is in POSIX: bs=4x1024x1024.Here is a non-useless use of dd to request exactly one byte of input from a TTY in raw mode:file:///usr/share/doc/bash-doc/examples/scripts/line-input.bashWrote that myself, back in 1996; was surprised years later to find it in the Bash distribution.

paulddraperabout 8 years ago

My most common use of dd is warming up AWS EBS volumes. <a href="http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-initialize.html" rel="nofollow">http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-initi...</a>Though fio is better because it can work in parallel.

diegorbaqueroabout 8 years ago

Question: Will cat do a bit-to-bit copy between disks?

dom0about 8 years ago

dd is for handling blocked data, while cat, redirection and pipelines are completely useless for that, since they are not meant to manipulate blocks of data, but streams. They do not compare (apart from really simple cases where either will do, like copying a file into some other file); this blog posts mainly highlights that neither the author nor many tutorial writers now the difference.

nwah1about 8 years ago

Someone should write a wiki bot to crawl through the wikis for Arch, Debian, and so forth to help rewrite all these bad instructions.

评论 #13898630 未加载

评论 #13898482 未加载

rurbanabout 8 years ago

Instead of<pre><code> cat image.iso | pv >/dev/sdb </code></pre> just do<pre><code> pv image.iso >/dev/sdb</code></pre>

bigbugbagabout 8 years ago

A self submitted opinion blog post pretty much entirely wrong ending up on HN front page. What gives ?

gbinabout 8 years ago

Instead of `cat file | pv > dev` why not `pv file > dev` ?

jeffdavisabout 8 years ago

What about writing a block into the middle of a file?

number6about 8 years ago

This is cat abuse

badatusernamesabout 8 years ago

TLDR This has nothing to do with dunkin donuts