From: Rich Freeman <rich0@gentoo.org>
To: gentoo-user@lists.gentoo.org
Subject: Re: [gentoo-user] OT: btrfs raid 5/6
Date: Sat, 9 Dec 2017 18:36:45 -0500 [thread overview]
Message-ID: <CAGfcS_nAaDVkT6v=_3cmhi_o90UFxF_YXg3O6GEDwLT-7QGozg@mail.gmail.com> (raw)
In-Reply-To: <5A2C2B44.6060802@youngman.org.uk>
On Sat, Dec 9, 2017 at 1:28 PM, Wols Lists <antlists@youngman.org.uk> wrote:
> On 09/12/17 16:58, J. Roeleveld wrote:
>> On Friday, December 8, 2017 12:48:45 AM CET Wols Lists wrote:
>>> On 07/12/17 22:35, Frank Steinmetzger wrote:
>>>>> (Oh - and md raid-5/6 also mix data and parity, so the same holds true
>>>>>
>>>>>> there.)
>>>>
>>>> Ok, wasn’t aware of that. I thought I read in a ZFS article that this were
>>>> a special thing.
>>>
>>> Say you've got a four-drive raid-6, it'll be something like
>>>
>>> data1 data2 parity1 parity2
>>> data3 parity3 parity4 data4
>>> parity5 parity6 data5 data6
>>>
>>> The only thing to watch out for (and zfs is likely the same) if a file
>>> fits inside a single chunk it will be recoverable from a single drive.
>>> And I think chunks can be anything up to 64MB.
>>
>> Except that ZFS doesn't have fixed on-disk-chunk-sizes. (especially if you use
>> compression)
>>
>> See:
>> https://www.delphix.com/blog/delphix-engineering/zfs-raidz-stripe-width-or-how-i-learned-stop-worrying-and-love-raidz
>>
> Which explains nothing, sorry ... :-(
>
> It goes on about 4K or 8K database blocks (and I'm talking about 64 MEG
> chunk sizes). And the OP was talking about files being recoverable from
> a disk that was removed from an array. Are you telling me that a *small*
> file has bits of it scattered across multiple drives? That would be *crazy*.
I'm not sure why it would be "crazy." Granted, most parity RAID
systems seem to operate just as you describe, but I don't see why with
Reed Solomon you couldn't store ONLY parity data on all the drives.
All that matters is that you generate enough to recover the data - the
original data contains no more information than an equivalent number
of Reed-Solomon sets. Of course, with the original data I imagine you
need to do less computation assuming you aren't bothering to check its
integrity against the parity data.
In case my point is clear a RAID would work perfectly fine if you had
5 drives with the capacity to store 4 drives wort of data, but instead
of storing the original data across 4 drives and having 1 of parity,
you instead compute 5 sets of parity so that now you have 9 sets of
data that can tolerate the loss of any 5, then throw away the sets
containing the original 4 sets of data and store the remaining 5 sets
of parity data across the 5 drives. You can still tolerate the loss
of one more set, but all 4 of the original sets of data have been
tossed already.
--
Rich
next prev parent reply other threads:[~2017-12-09 23:36 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-11-27 22:30 [gentoo-user] OT: btrfs raid 5/6 Bill Kenworthy
2017-12-01 15:59 ` J. Roeleveld
2017-12-01 16:58 ` Wols Lists
2017-12-01 17:14 ` Rich Freeman
2017-12-01 17:24 ` Wols Lists
2017-12-06 23:28 ` Frank Steinmetzger
2017-12-06 23:35 ` Rich Freeman
2017-12-07 0:13 ` Frank Steinmetzger
2017-12-07 0:29 ` Rich Freeman
2017-12-07 21:37 ` Frank Steinmetzger
2017-12-07 21:49 ` Wols Lists
2017-12-07 22:35 ` Frank Steinmetzger
2017-12-07 23:48 ` Wols Lists
2017-12-09 16:58 ` J. Roeleveld
2017-12-09 18:28 ` Wols Lists
2017-12-09 23:36 ` Rich Freeman [this message]
2017-12-10 9:45 ` Wols Lists
2017-12-10 15:07 ` Rich Freeman
2017-12-10 21:00 ` Wols Lists
2017-12-11 1:33 ` Rich Freeman
2017-12-11 23:20 ` Frank Steinmetzger
2017-12-12 10:15 ` Neil Bothwick
2017-12-12 12:18 ` Wols Lists
2017-12-12 13:24 ` Neil Bothwick
2017-12-07 7:54 ` Richard Bradfield
2017-12-07 9:28 ` Frank Steinmetzger
2017-12-07 9:52 ` Richard Bradfield
2017-12-07 14:53 ` Frank Steinmetzger
2017-12-07 15:26 ` Rich Freeman
2017-12-07 16:04 ` Frank Steinmetzger
2017-12-07 23:09 ` Rich Freeman
2017-12-07 20:02 ` Wols Lists
2017-12-07 18:35 ` Wols Lists
2017-12-07 20:17 ` Richard Bradfield
2017-12-07 20:39 ` Wols Lists
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAGfcS_nAaDVkT6v=_3cmhi_o90UFxF_YXg3O6GEDwLT-7QGozg@mail.gmail.com' \
--to=rich0@gentoo.org \
--cc=gentoo-user@lists.gentoo.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox