linuxdd

Why using conv=notrunc when cloning a disk with dd?


If you look up how to clone an entire disk to another one on the web, you will find something like that:

dd if=/dev/sda of=/dev/sdb conv=notrunc,noerror

While I understand the noerror, I am getting a hard time understanding why people think that notrunc is required for "data integrity" (as ArchLinux's Wiki states, for instance).

Indeed, I do agree on that if you are copying a partition to another partition on another disk, and you do not want to overwrite the entire disk, just one partition. In thise case notrunc, according to dd's manual page, is what you want.

But if you're cloning an entire disk, what does notrunc change for you? Just time optimization?


Solution

  • TL;DR version:

    notrunc is only important to prevent truncation when writing into a file. This has no effect on a block device such as sda or sdb.

    Educational version

    I looked into the coreutils source code which contains dd.c to see how notrunc is processed.

    Here's the segment of code that I'm looking at:

    int opts = (output_flags
                | (conversions_mask & C_NOCREAT ? 0 : O_CREAT)
                | (conversions_mask & C_EXCL ? O_EXCL : 0)
                | (seek_records || (conversions_mask & C_NOTRUNC) ? 0 : O_TRUNC));
    
    /* Open the output file with *read* access only if we might
    need to read to satisfy a `seek=' request.  If we can't read
    the file, go ahead with write-only access; it might work.  */
    if ((! seek_records
        || fd_reopen (STDOUT_FILENO, output_file, O_RDWR | opts, perms) < 0)
        && (fd_reopen (STDOUT_FILENO, output_file, O_WRONLY | opts, perms) < 0))
            error (EXIT_FAILURE, errno, _("opening %s"), quote (output_file));
    

    We can see here that if notrunc is not specified, then the output file will be opened with O_TRUNC. Looking below at how O_TRUNC is treated, we can see that a normal file will get truncated if written into.

    O_TRUNC

    If the file already exists and is a regular file and the open mode allows writing (i.e., is O_RDWR or O_WRONLY) it will be truncated to length 0. If the file is a FIFO or terminal device file, the O_TRUNC flag is ignored. Otherwise the effect of O_TRUNC is unspecified.

    Effects of notrunc / O_TRUNC I

    In the following example, we start out by creating junk.txt of size 1024 bytes. Next, we write 512 bytes to the beginning of it with conv=notrunc. We can see that the size stays the same at 1024 bytes. Finally, we try it without the notrunc option and we can see that the new file size is 512. This is because it was opened with O_TRUNC.

    $ dd if=/dev/urandom of=junk.txt bs=1024 count=1
    $ ls -l junk.txt
    -rw-rw-r-- 1 akyserr akyserr 1024 Dec 11 17:08 junk.txt
    
    $ dd if=/dev/urandom of=junk.txt bs=512 count=1 conv=notrunc
    $ ls -l junk.txt
    -rw-rw-r-- 1 akyserr akyserr 1024 Dec 11 17:10 junk.txt
    
    $ dd if=/dev/urandom of=junk.txt bs=512 count=1
    $ ls -l junk.txt
    -rw-rw-r-- 1 akyserr akyserr 512 Dec 11 17:10 junk.txt
    

    Effects of notrunc / O_TRUNC II

    I still haven't answered your original question of why when doing a disk-to-disk clone, why conv=notrunc is important. According to the above definition, O_TRUNC seems to be ignored when opening certain special files, and I would expect this to be true for block device nodes too. However, I don't want to assume anything and will attempt to prove it here.

    openclose.c

    I've written a simple C program here which opens and closes a file given as an argument with the O_TRUNC flag.

    #include <stdio.h>
    #include <sys/stat.h>
    #include <fcntl.h>
    #include <time.h>
    
    int main(int argc, char * argv[])
    {
        if (argc < 2)
        {
            fprintf(stderr, "Not enough arguments...\n");
            return (1);
        }
    
        int f = open(argv[1], O_RDWR | O_TRUNC);
    
        if (f >= 0)
        {
            fprintf(stderr, "%s was opened\n", argv[1]);
            close(f);
            fprintf(stderr, "%s was closed\n", argv[1]);
        } else {
            perror("Opening device node");
        }
    
        return (0);
    }
    

    Normal File Test

    We can see below that the simple act of opening and closing a file with O_TRUNC will cause it to lose anything that was already there.

    $ dd if=/dev/urandom of=junk.txt bs=1024 count=1^C
    $ ls -l junk.txt 
    -rw-rw-r-- 1 akyserr akyserr 1024 Dec 11 17:26 junk.txt
    
    $ ./openclose junk.txt
    junk.txt was opened
    junk.txt was closed
    
    $ ls -l junk.txt
    -rw-rw-r-- 1 akyserr akyserr 0 Dec 11 17:27 junk.txt
    

    Block Device File Test

    Let's try a similar test on a USB flash drive. We can see that we start out with a single partition on the USB flash drive. If it get's 'truncated', perhaps the partition will go away (considering it's defined in the first 512 bytes of the disk)?

    $ ls -l /dev/sdc*
    brw-rw---- 1 root disk 8, 32 Dec 11 17:22 /dev/sdc
    brw-rw---- 1 root disk 8, 33 Dec 11 17:22 /dev/sdc1
    
    $ sudo ./openclose /dev/sdc
    /dev/sdc was opened
    /dev/sdc was closed
    
    $ sudo ./openclose /dev/sdc1
    /dev/sdc1 was opened
    /dev/sdc1 was closed
    
    $ ls -l /dev/sdc*
    brw-rw---- 1 root disk 8, 32 Dec 11 17:31 /dev/sdc
    brw-rw---- 1 root disk 8, 33 Dec 11 17:31 /dev/sdc1
    

    It looks like it has no effect whatsoever to open either the disk or the disk's partition 1 with the O_TRUNC option. From what I can tell, the filesystem is still mountable and the files are accessible and intact.

    Effects of notrunc / O_TRUNC III

    Okay, for my final test I will use dd on my flash drive directly. I will start by writing 512 bytes of random data, then writing 256 bytes of zeros at the beginning. For the final test, we will verify that the last 256 bytes remained unchanged.

    $ sudo dd if=/dev/urandom of=/dev/sdc bs=256 count=2
    $ sudo hexdump -n 512 /dev/sdc
    0000000 3fb6 d17f 8824 a24d 40a5 2db3 2319 ac5b
    0000010 c659 5780 2d04 3c4e f985 053c 4b3d 3eba
    0000020 0be9 8105 cec4 d6fb 5825 a8e5 ec58 a38e
    0000030 d736 3d47 d8d3 9067 8db8 25fb 44da af0f
    0000040 add7 c0f2 fc11 d734 8e26 00c6 cfbb b725
    0000050 8ff7 3e79 af97 2676 b9af 1c0d fc34 5eb1
    0000060 6ede 318c 6f9f 1fea d200 39fe 4591 2ffb
    0000070 0464 9637 ccc5 dfcc 3b0f 5432 cdc3 5d3c
    0000080 01a9 7408 a10a c3c4 caba 270c 60d0 d2f7
    0000090 2f8d a402 f91a a261 587b 5609 1260 a2fc
    00000a0 4205 0076 f08b b41b 4738 aa12 8008 053f
    00000b0 26f0 2e08 865e 0e6a c87e fc1c 7ef6 94c6
    00000c0 9ced 37cf b2e7 e7ef 1f26 0872 cd72 54a4
    00000d0 3e56 e0e1 bd88 f85b 9002 c269 bfaa 64f7
    00000e0 08b9 5957 aad6 a76c 5e37 7e8a f5fc d066
    00000f0 8f51 e0a1 2d69 0a8e 08a9 0ecf cee5 880c
    0000100 3835 ef79 0998 323d 3d4f d76b 8434 6f20
    0000110 534c a847 e1e2 778c 776b 19d4 c5f1 28ab
    0000120 a7dc 75ea 8a8b 032a c9d4 fa08 268f 95e8
    0000130 7ff3 3cd7 0c12 4943 fd23 33f9 fe5a 98d9
    0000140 aa6d 3d89 c8b4 abec 187f 5985 8e0f 58d1
    0000150 8439 b539 9a45 1c13 68c2 a43c 48d2 3d1e
    0000160 02ec 24a5 e016 4c2d 27be 23ee 8eee 958e
    0000170 dd48 b5a1 10f1 bf8e 1391 9355 1b61 6ffa
    0000180 fd37 7718 aa80 20ff 6634 9213 0be1 f85e
    0000190 a77f 4238 e04d 9b64 d231 aee8 90b6 5c7f
    00001a0 5088 2a3e 0201 7108 8623 b98a e962 0860
    00001b0 c0eb 21b7 53c6 31de f042 ac80 20ee 94dd
    00001c0 b86c f50d 55bc 32db 9920 fd74 a21e 911a
    00001d0 f7db 82c2 4d16 3786 3e18 2c0f 47c2 ebb0
    00001e0 75af 6a8c 2e80 c5b6 e4ea a9bc a494 7d47
    00001f0 f493 8b58 0765 44c5 ff01 42a3 b153 d395
    
    $ sudo dd if=/dev/zero of=/dev/sdc bs=256 count=1
    $ sudo hexdump -n 512 /dev/sdc
    0000000 0000 0000 0000 0000 0000 0000 0000 0000
    *
    0000100 3835 ef79 0998 323d 3d4f d76b 8434 6f20
    0000110 534c a847 e1e2 778c 776b 19d4 c5f1 28ab
    0000120 a7dc 75ea 8a8b 032a c9d4 fa08 268f 95e8
    0000130 7ff3 3cd7 0c12 4943 fd23 33f9 fe5a 98d9
    0000140 aa6d 3d89 c8b4 abec 187f 5985 8e0f 58d1
    0000150 8439 b539 9a45 1c13 68c2 a43c 48d2 3d1e
    0000160 02ec 24a5 e016 4c2d 27be 23ee 8eee 958e
    0000170 dd48 b5a1 10f1 bf8e 1391 9355 1b61 6ffa
    0000180 fd37 7718 aa80 20ff 6634 9213 0be1 f85e
    0000190 a77f 4238 e04d 9b64 d231 aee8 90b6 5c7f
    00001a0 5088 2a3e 0201 7108 8623 b98a e962 0860
    00001b0 c0eb 21b7 53c6 31de f042 ac80 20ee 94dd
    00001c0 b86c f50d 55bc 32db 9920 fd74 a21e 911a
    00001d0 f7db 82c2 4d16 3786 3e18 2c0f 47c2 ebb0
    00001e0 75af 6a8c 2e80 c5b6 e4ea a9bc a494 7d47
    00001f0 f493 8b58 0765 44c5 ff01 42a3 b153 d395
    

    Summary

    Throughout the experiment it seems that notrunc is only important when you want to write to a regular file, but don't want to truncate it. It seems to have no effect on a block device such as sda or sdb.