Gaffaweb >
Love & Anger >
1993-20 >
[ Date Index |
Thread Index ]
[Date Prev] [Date Next] [Thread Prev] [Thread Next]
From: uli@zoodle.robin.de (Ulrich Grepel)
Date: Thu, 10 Jun 93 01:33 MET DST
Subject: half a gigabyte
To: love-hounds@uunet.UU.NET
Ok, here's the promised summary of our ideas so far: Contents: - Complete love-hounds archive. Organized as mbox files. Superfluous headers removed. Bogus headers from news systems sending posted articles to love-hounds@uunet.uu.net removed. Messages are grouped in 100 message groups or into a separate file for each month. Months are difficult, since they overlap quite a lot. At the moment there are 207 such 100-message-files in the archives and 20-21 are missing there by now. That makes a total of about 22700 messages. An index is added to the files that contains sorted lists of "Subject:" and "From:" lines. "Date:" is not needed, since that's the natural order. Subindices aren't needed because of the nature of threads being short and not too many threads going on at one point in time. Full text search would be nice to have. Some systems (e.g. NeXT) already have some sort of full text search (e.g. Digital Librarian/Indexing Kit) that might be connected to the mail reader software used for reading the archive. Specially written or adapted readers would be able to make use of indices. Any better idea of searching in the archives is WELCOME. But think of the time needed to do such a thing manually. 10 seconds per article result in about 75 hours of work, and 10 seconds is WAY TOO OPTIMISTIC. Only way of doing this is sharing the work. One month for any volunteering person? - pictures. Specially scanned pictures plus all pictures availlable at the moment, sorted into categories like - A scan of every album front and back. Includes all singles and maybe boots? - A scan of EXACTLY WHERE each of the hidden KT's are!! - A "Family Album" section with Kate and friends through the years - A "Scrapbook" section with lots of different KT shots. - A KateCon section! - A NetFaces section (I suggest 64*64 2 to 24 bit TIFFs (as used by NeXTmail)) - remaining pics The file format of these pictures should probably be GIF. If we really don't know how to fill up 600 MB we can add other formats, preferably TIFF and JPEG The pictures should confirm to a naming scheme so that we have not that many problems describing/adding future pictures. If there's any room left I suggest thinking about MPEGs (I'm a Rocket Man...). - special text files. - Cloudbusting - Deeper Understanding - The Garden - Lyrics (with and without annotations. I once tried to translate the lyrics into German. I probably was quite unsuccessful. Anyone interested?) - Kate's poems - Discography (basic, extended, fully extended) - Extended FAQ - Intro to the Net and to rec.music.gaffa / Love-Hounds - All the reviews and newsbits from magazines that can be found in the archives - All the rest of the archives. The last two should go in ASCII only, but all the others should be availlable in more than one format to be printable in a good-looking way. These formats don't have to be searchable, there's always the ASCII-version to search in. Formats mentioned include: - ASCII (always neccessary as a base, universally useable) - rtf (NeXT, MS-Windows, MS-Word (also on Macintosh), includes NeXT and MS-Windows help files) - LaTeX - info - WordPerfect - MS-Word - PostScript - AmigaGuide - World Wide Web distributed hypertext system (distributed? on a CD?) - Whatever we want. Some of these formats are editable, searchable, indexable, hypertextable, looking nice, and more, some aren't. Most should be able to be generated from the others, maybe with some step inbetween. As we are talking about 10 to 15 megabytes of text here it should not be a problem to include as many formats of text as we want. If it really gets too much we might start a) compressing stuff and/or b) using diff to remove much of the text. Unfortunately this will inhibit working directly on the CD with the formatted texts. - sound files Ideas include important snippets from the work of this woman ( ;-) ). Sound format should be decided after looking at the amount of sounds that should find a place. I suggest something between mono 8bit 8kHz mulaw and stereo 16 bit 44.1 kHz. Data format will need converters, since it's not good to store sound in several different ways. Converters (incl. sampling frequency adaptions) are easy to write and/or in the PD. It's possible to add a normal audio track to a data CD-ROM. Most audio CD players are able to play them as track 2-n. Track 1 is data. Maybe we could persuade Kate to let us use "Suspended In GAFFA". (How could we achieve this?) - software Any software that helps reading/watching/listening_to/digesting/working_with the rest of the stuff. Preferably in source and binary form(s). Including mail readers, picture viewers/converters, sound converters. - fanzines Peter D.F.M.: Would you like to offer the Homeground Archives? That would be a very nice addition to the CD, since most of the early mags are difficult to find. True colleKTors would still want to have the originals, so that the market for old issues won't die down. This plea holds for all other fanzines too. - other archives Since these are spin-offs from gaffa I suggest to include the archives of the two mailing lists Ecto and Really-Deep-Thoughts. At least if there's room. We might even add the pictures and any other file from them. Of course this depends on the availlable space. - CD data format ISO-9660 with 8+3 character upper case file names (just to please EVERYONE). To please the rest it would be nice to have some way to have longer and mixed case file names. One idea would be to create a tar file that contains a huge amount of soft links into the CD and that of course can have longer file names. - design KT-logo on the disc, some suggestion for the cover? - cost anything between 50 $ (for single copies) and 5 $ (for higher quantities) - needed - person with big hard disk that has enough free space to hold the stuff - person with access to CD mastering facilities. Preferably as near to the first person as possible... - person(s) with good scanners - person(s) with good OCR software (the fanzines...) - a query about the number of people wanting such a CD-ROM - volunteers - further suggestions Bye, Uli