Skip to content
/ remix Public

remix extracts all the stuff from the SAILDART archive Time-Capsule file named *flat_DART_data8* which must the md5 hash value 3adbff17fd7f9f6eb9107755594ae0b9. remix writes a Smörgåsbord of 21st century formats for historical study.

Notifications You must be signed in to change notification settings

saildart/remix

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

saildart/remix/README.md

foo

to build remix

To compile, just execute the source file remix.c to make a binary at path /usr/local/bin/remix or type the bash command 'source remix.c'. A simple 'makefile' exists, so just type 'make'.

title REMIX

    remix - read SAILDART time-capsule
          - read four curated feedback files, so remix can be done in one pass
          - write time-capsule data into several formats 

SYNOPSIS

    remix extracts stuff 
    from the SAILDART archive Time-Capsule file named 'flat_DART_data8',   
    which should have the md5 hash value 3adbff17fd7f9f6eb9107755594ae0b9.  
    remix writes a Smörgåsbord of 21st century formats for historical study. 

INPUT files

Place a symbolic link named ./INPUT pointing at the directory containing the two large time capsule files are named flat_DART_data8 and sn-hash8-accession-by-sn

md5 hash path
3adbff17fd7f9f6eb9107755594ae0b9 large/flat_DART_data8
9c763af70e2c41ca1f646668df13c7df feedback/perm-darn-byte
b9663bbd331646607452485b6e7d8f52 feedback/sn-hash8-accession-by-sn
8a1d360dd7a7d34476c82d373c375a87 feedback/tape-dumpset-reel.csv
c61314d68b7f24337e70309cc8a4118e feedback/tape-fln-order

PROCESS the five D's

D description
de-frag concatenate DART-7track-record data-payloads into SAIL-blobs
de-dup hash digest SAIL-blob-data8 to get serial numbered unique blob content
de-damage Mark files with Previous-Media-Error or defective headings
de-flate omit excessive record padding, redundancy, and in-band DART tables. (aka de-pad)
de-tox omit ephemera, legal hazards, dipolmatic faux pas [foh pahz] and Phở Soup.

OUTPUT files

Plase a symbolic link named ./OUTPUT pointing at 100GB of empty space. For development I build loop mounted containers so the delete is rapid.

    reset-OUTPUT-container.bash

    data base tables into csv formated files /OUTPUT/csv/
            address
            attr
            dates
            ribs
            tags
            tapedex
            
    content blobs into file system as
            DATA8 binary raw
            UTF-8 text   raw
            UTF-8 text cooked

The unified 'C' program for after 2021 will be one large file named

    remix.c

It is now broken up into 12 pieces

in the hope of re-factoring towards improved understanding

    remix.c      
    main_bigtop.c
    structures.c  
    filename_smith.c
    datetime_smith.c

    dart_decoding.c
    load_feedback.c
    tree_farming.c

    md5.c
    blob_taxidermy.c  
    main_command_options.c
    main_usage.c

auxillary Read Only source files

    md5.c        md5.h
    utf8.c       utf8.h

post remix SQL programs

The SQL scripts have been moved out of remix. The saildart data base processing will require several new repositories.

R E L I N K

symbolic-link convenient pathnames to subsets of the /data8/ binary files

t1;awk '{printf("ln -s /data8/sn/%s /data8/nickname/%s\n",$1,$2)}' \
        data/Disassemble./sn_fn|bash;t2

Initialize some SD chips ( say 128 GB )

for offsite, cold-copy, time-capsule distribution

For example, I plugged in three USD SD chips which then appeared at device locations /dev/sde /dev/sdf /dev/sdg so

        sudo bash
        mkdir -p /e1 /f1 /g1            \# make some mount points
for x in e f g;do echo;
        sgdisk -Z /dev/sd$x;
        sgdisk -N0 /dev/sd$x;
        mkfs.ext4 -F /dev/sd${x}1;
        mount /dev/sd${x}1 /${x}1;
done
        chown bgb:bgb /e1 /f1 /g1       \# user access for copy commands

Practical copy of time-capsule on SD chip

The chip labeled flat_DART_data8 has one GNU/Linux ext4 partition with the 85 Gigabyte input file for remix.

Optional excessive purity of the SAILDART archive byte vector

   Avoid PARTITION table
   Avoid FILE SYSTEM       
   Copy **flat_DART_data8** to SD without file system or partition table
sudo bash
time dd if=/d/large/flat_DART_data8 of=/dev/sdd oflag=direct status=progress bs=64M

90314177512 bytes (90 GB, 84 GiB) copied, 5963 s, 15.1 MB/s 1345+1 records in 1345+1 records out 90314177512 bytes (90 GB, 84 GiB) copied, 5966.4 s, 15.1 MB/s #real 99m26.408s

About

remix extracts all the stuff from the SAILDART archive Time-Capsule file named *flat_DART_data8* which must the md5 hash value 3adbff17fd7f9f6eb9107755594ae0b9. remix writes a Smörgåsbord of 21st century formats for historical study.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages