ZFS: How could my file got permanently corrupted?












0














I'm trying to understand what could have gone wrong.



for context:

I have a mirrored set with 3 drives and non-ECC memory. Not sure what else to share.



I was under assumption that having a 3-way mirror would keep me relatively secure from incidental corruptions, like failing drives, or that memory corruption would be recoverable (from one of 2 remaining disks).



But I'm probably misunderstanding.

Can anyone explain what could have gone wrong so I can secure myself for the future? I'm not too experienced with zfs.



Thanks!



$ sudo zpool status -v
pool: dozer
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://zfsonlinux.org/msg/ZFS-8000-8A
scan: scrub repaired 0 in 8h36m with 1 errors on Sun Jan 6 02:12:32 2019
config:

NAME STATE READ WRITE CKSUM
dozer ONLINE 0 0 1
mirror-0 ONLINE 0 0 2
ata-WDC_WD40EFRX-68N32N0_WD-WCC7K1ZKZLYK ONLINE 0 0 2
ata-WDC_WD40EFRX-68N32N0_WD-WCC7K6VCAZXL ONLINE 0 0 2
ata-ST4000DM000-1F2168_S301LW48 ONLINE 0 0 2

errors: Permanent errors have been detected in the following files:

/dozer/path/to/my/file









share|improve this question







New contributor




Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

























    0














    I'm trying to understand what could have gone wrong.



    for context:

    I have a mirrored set with 3 drives and non-ECC memory. Not sure what else to share.



    I was under assumption that having a 3-way mirror would keep me relatively secure from incidental corruptions, like failing drives, or that memory corruption would be recoverable (from one of 2 remaining disks).



    But I'm probably misunderstanding.

    Can anyone explain what could have gone wrong so I can secure myself for the future? I'm not too experienced with zfs.



    Thanks!



    $ sudo zpool status -v
    pool: dozer
    state: ONLINE
    status: One or more devices has experienced an error resulting in data
    corruption. Applications may be affected.
    action: Restore the file in question if possible. Otherwise restore the
    entire pool from backup.
    see: http://zfsonlinux.org/msg/ZFS-8000-8A
    scan: scrub repaired 0 in 8h36m with 1 errors on Sun Jan 6 02:12:32 2019
    config:

    NAME STATE READ WRITE CKSUM
    dozer ONLINE 0 0 1
    mirror-0 ONLINE 0 0 2
    ata-WDC_WD40EFRX-68N32N0_WD-WCC7K1ZKZLYK ONLINE 0 0 2
    ata-WDC_WD40EFRX-68N32N0_WD-WCC7K6VCAZXL ONLINE 0 0 2
    ata-ST4000DM000-1F2168_S301LW48 ONLINE 0 0 2

    errors: Permanent errors have been detected in the following files:

    /dozer/path/to/my/file









    share|improve this question







    New contributor




    Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.























      0












      0








      0







      I'm trying to understand what could have gone wrong.



      for context:

      I have a mirrored set with 3 drives and non-ECC memory. Not sure what else to share.



      I was under assumption that having a 3-way mirror would keep me relatively secure from incidental corruptions, like failing drives, or that memory corruption would be recoverable (from one of 2 remaining disks).



      But I'm probably misunderstanding.

      Can anyone explain what could have gone wrong so I can secure myself for the future? I'm not too experienced with zfs.



      Thanks!



      $ sudo zpool status -v
      pool: dozer
      state: ONLINE
      status: One or more devices has experienced an error resulting in data
      corruption. Applications may be affected.
      action: Restore the file in question if possible. Otherwise restore the
      entire pool from backup.
      see: http://zfsonlinux.org/msg/ZFS-8000-8A
      scan: scrub repaired 0 in 8h36m with 1 errors on Sun Jan 6 02:12:32 2019
      config:

      NAME STATE READ WRITE CKSUM
      dozer ONLINE 0 0 1
      mirror-0 ONLINE 0 0 2
      ata-WDC_WD40EFRX-68N32N0_WD-WCC7K1ZKZLYK ONLINE 0 0 2
      ata-WDC_WD40EFRX-68N32N0_WD-WCC7K6VCAZXL ONLINE 0 0 2
      ata-ST4000DM000-1F2168_S301LW48 ONLINE 0 0 2

      errors: Permanent errors have been detected in the following files:

      /dozer/path/to/my/file









      share|improve this question







      New contributor




      Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      I'm trying to understand what could have gone wrong.



      for context:

      I have a mirrored set with 3 drives and non-ECC memory. Not sure what else to share.



      I was under assumption that having a 3-way mirror would keep me relatively secure from incidental corruptions, like failing drives, or that memory corruption would be recoverable (from one of 2 remaining disks).



      But I'm probably misunderstanding.

      Can anyone explain what could have gone wrong so I can secure myself for the future? I'm not too experienced with zfs.



      Thanks!



      $ sudo zpool status -v
      pool: dozer
      state: ONLINE
      status: One or more devices has experienced an error resulting in data
      corruption. Applications may be affected.
      action: Restore the file in question if possible. Otherwise restore the
      entire pool from backup.
      see: http://zfsonlinux.org/msg/ZFS-8000-8A
      scan: scrub repaired 0 in 8h36m with 1 errors on Sun Jan 6 02:12:32 2019
      config:

      NAME STATE READ WRITE CKSUM
      dozer ONLINE 0 0 1
      mirror-0 ONLINE 0 0 2
      ata-WDC_WD40EFRX-68N32N0_WD-WCC7K1ZKZLYK ONLINE 0 0 2
      ata-WDC_WD40EFRX-68N32N0_WD-WCC7K6VCAZXL ONLINE 0 0 2
      ata-ST4000DM000-1F2168_S301LW48 ONLINE 0 0 2

      errors: Permanent errors have been detected in the following files:

      /dozer/path/to/my/file






      ubuntu zfs






      share|improve this question







      New contributor




      Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|improve this question







      New contributor




      Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|improve this question




      share|improve this question






      New contributor




      Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked 22 mins ago









      Bartek Chlebek

      1012




      1012




      New contributor




      Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      Bartek Chlebek is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






















          1 Answer
          1






          active

          oldest

          votes


















          0














          You do understand that if any single non-RAIDed drive has a failure, you lose data and the same is true if one of the non-ECC memory chips has a failure: you lose data.



          And even if you have:




          • dual processors

          • dual NICs

          • dual hard drives

          • ECC memory

          • High-availability fail-over servers in a different data centre on a different continent


          ...any kind of uncorrectable error (Bug, human, Electro-Magnetic Pulse, .. ) can still lead to data loss.



          And that's why, with all this nifty technology, we still have multiple off-line backups today.



          Note: On-line backups / Data replication to a secondary Data Centre can be corrupted too in this way






          share|improve this answer























            Your Answer








            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "106"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });






            Bartek Chlebek is a new contributor. Be nice, and check out our Code of Conduct.










            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f492802%2fzfs-how-could-my-file-got-permanently-corrupted%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            0














            You do understand that if any single non-RAIDed drive has a failure, you lose data and the same is true if one of the non-ECC memory chips has a failure: you lose data.



            And even if you have:




            • dual processors

            • dual NICs

            • dual hard drives

            • ECC memory

            • High-availability fail-over servers in a different data centre on a different continent


            ...any kind of uncorrectable error (Bug, human, Electro-Magnetic Pulse, .. ) can still lead to data loss.



            And that's why, with all this nifty technology, we still have multiple off-line backups today.



            Note: On-line backups / Data replication to a secondary Data Centre can be corrupted too in this way






            share|improve this answer




























              0














              You do understand that if any single non-RAIDed drive has a failure, you lose data and the same is true if one of the non-ECC memory chips has a failure: you lose data.



              And even if you have:




              • dual processors

              • dual NICs

              • dual hard drives

              • ECC memory

              • High-availability fail-over servers in a different data centre on a different continent


              ...any kind of uncorrectable error (Bug, human, Electro-Magnetic Pulse, .. ) can still lead to data loss.



              And that's why, with all this nifty technology, we still have multiple off-line backups today.



              Note: On-line backups / Data replication to a secondary Data Centre can be corrupted too in this way






              share|improve this answer


























                0












                0








                0






                You do understand that if any single non-RAIDed drive has a failure, you lose data and the same is true if one of the non-ECC memory chips has a failure: you lose data.



                And even if you have:




                • dual processors

                • dual NICs

                • dual hard drives

                • ECC memory

                • High-availability fail-over servers in a different data centre on a different continent


                ...any kind of uncorrectable error (Bug, human, Electro-Magnetic Pulse, .. ) can still lead to data loss.



                And that's why, with all this nifty technology, we still have multiple off-line backups today.



                Note: On-line backups / Data replication to a secondary Data Centre can be corrupted too in this way






                share|improve this answer














                You do understand that if any single non-RAIDed drive has a failure, you lose data and the same is true if one of the non-ECC memory chips has a failure: you lose data.



                And even if you have:




                • dual processors

                • dual NICs

                • dual hard drives

                • ECC memory

                • High-availability fail-over servers in a different data centre on a different continent


                ...any kind of uncorrectable error (Bug, human, Electro-Magnetic Pulse, .. ) can still lead to data loss.



                And that's why, with all this nifty technology, we still have multiple off-line backups today.



                Note: On-line backups / Data replication to a secondary Data Centre can be corrupted too in this way







                share|improve this answer














                share|improve this answer



                share|improve this answer








                edited 4 mins ago

























                answered 10 mins ago









                Fabby

                3,68511228




                3,68511228






















                    Bartek Chlebek is a new contributor. Be nice, and check out our Code of Conduct.










                    draft saved

                    draft discarded


















                    Bartek Chlebek is a new contributor. Be nice, and check out our Code of Conduct.













                    Bartek Chlebek is a new contributor. Be nice, and check out our Code of Conduct.












                    Bartek Chlebek is a new contributor. Be nice, and check out our Code of Conduct.
















                    Thanks for contributing an answer to Unix & Linux Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.





                    Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


                    Please pay close attention to the following guidance:


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f492802%2fzfs-how-could-my-file-got-permanently-corrupted%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    CARDNET

                    Boot-repair Failure: Unable to locate package grub-common:i386

                    濃尾地震