[plug] 'best fit' data archiving tools

Richard Meyer meyerri at westnet.com.au
Mon Sep 19 20:03:15 WST 2005


On Mon, 2005-09-19 at 19:16 +0800, Craig Ringer wrote:
> On Mon, 2005-09-19 at 14:54 +0800, Ryan wrote:
> > I'm not sure how to name what I want  ...
> > 
> > I want to break a large directory of data (600GB+) into smaller chunks
> > of a specified size limit (~16GB in this case).  In this particular case
> > the finest granularity will be a 2nd level directory, everything below
> > that must stay together with it's 2nd level parent, so essentially a
> > 'file' from a size perspective is the contents of each 2nd level
> > directory.
> 
> I spent some time looking for one a while ago, with nearly identical
> needs to you. I wasn't successful. In my case, rather than buying one, I
> solved it by getting some giant hard disks, sharing out the directory
> across them so that I'd have two complete copies of the data, and then
> storing them off-site.
> 
> This may not suit your needs.
> 
> Writing a tool to do this is significantly complicated by compression.
> If you're compressing, you don't know what each directory will compress
> down to until you've finished archiving it. If the compression is
> archive wide rather than per-directory it's even worse, since each
> directory will compress differently depending on what else is compressed
> with it.

You have just described a perfect use for DAR at
http://dar.linux.free.fr/

Quote form site:
>         Slices
>                 Dar stands for Disk ARchive. From the beginning, it
>                 was designed to be able to split an archive over
>                 several pieces of removable media -- no matter how
>                 many or what size. Thus, dar is able to save over
>                 floppy disks, CD-Rs, DVD-Rs, CD-RWs, DVD-RWs, Zip
>                 disks, Jazz disks, etc. Dar is not concerned by
>                 un/mounting a removable medium; instead it operates
>                 independently of the hardware. Given the size, it will
>                 split the archive in several files (called slices),
>                 pausing before creating each new slice. This allows
>                 the user to un/mount a medium, burn the slice to a
>                 CD-R, send it by email (if your mail system does not
>                 allow huge file in emails, dar can help you here
>                 also). By default, (no size specified), dar will make
>                 only one slice. If a slice size is specified and dar
>                 creates multiple slices, the size of the first slice
>                 can be specified separately. This is useful if, for
>                 example, you want to fill up a partially filled disk
>                 before starting use of an empty one. At restoration
>                 time, dar will look for the slices it needs, asking
>                 for a slice only if it is missing and required.

-- 
Richard Meyer <meyerri at westnet.com.au>
Cutting the space budget really restores my faith in humanity.  It
eliminates dreams, goals, and ideals and lets us get straight to the
business of hate, debauchery, and self-annihilation.
                -- Johnny Hart




More information about the plug mailing list