Gemini Archiving and WARC
Tom
tgrom.automail at nuegia.net
Wed Sep 2 06:08:35 BST 2020
On Wed, 02 Sep 2020 01:23:22 +0000
acdw <acdw at acdw.net> wrote:
> On 2020-09-01 (Tuesday) at 23:43, Charles E. Lehner
> <cel at celehner.com> wrote:
>
> > Hi Gemini List,
> >
> > Has anyone thought about, or implemented, archiving of Gemini
> > content/traffic?
> >
> > WARC (Web ARChive)¹ is a standard format used for web archiving. It
> > uses text headers for metadata like in HTTP and email. It looks to
> > me like WARC could be adapted for Gemini. The WARC spec supports
> > multiple URI schemes, although it doesn't specify any other than
> > http/https, ftp, and dns². Bespoke formats could also be used, of
> > course, or just downloading files wget-style, but using a standard
> > format could allow for interop with "the WARC ecosystem"³.
> >
> > Archive Team⁴ has also worked on archiving non-HTTP protocols like
> > FTP⁵ and Gopher⁶.
> >
> > I think there is an opportunity for people to maintain high-quality
> > archives of Gemini content, like what the Internet Archive⁷ and
> > archive.today⁸ do for the HTTP(S) Web. Now is a good time to start,
> > while many of the original Gemini hosts⁹ are still online.
> >
> > Regards,
> > Charles E. Lehner
> >
> > ¹ https://en.wikipedia.org/wiki/Web_ARChive
> > ²
> > https://iipc.github.io/warc-specifications/specifications/warc-format/warc-1.1/#ftp-scheme
> > ³ https://www.archiveteam.org/index.php?title=The_WARC_Ecosystem
> > ⁴ https://www.archiveteam.org/
> > https://en.wikipedia.org/wiki/Archive_Team
> > ⁵ https://www.archiveteam.org/index.php?title=FTP
> > ⁶ https://www.archiveteam.org/index.php?title=Gopher
> > ⁷ https://en.wikipedia.org/wiki/Internet_Archive
> > https://archive.org/
> > ⁸ https://archive.today
> > https://en.wikipedia.org/wiki/Archive.today
> > ⁹ gemini://gemini.circumlunar.space/servers/
> >
>
> I personally think this is a great idea, but I know some might not be
> so on-board with it. I'm thinking of solderpunk's post (in their
> gopherhole, actually):
> gopher://zaibatsu.circumlunar.space:70/0/~solderpunk/phlog/the-individual-archivist-and-ghosts-of-gophers-past.txt
>
> So is there a way to opt-out of archiving for publishers? Some in the
> community might want to know about it, though I personally am of the
> opinion that if you've published it, it's now the property of the
> commons.
>
Ounce you publish something to the internet there is no retracting it.
This is one of the first things I was taught the first time I used the
net. Alongside never using your real name on the net unless your
publishing something.
--
_______________________________________
/ Concentrate on th'cute, li'l CARTOON \
| GUYS! Remember the SERIAL NUMBERS!! |
| Follow the WHIPPLE AVE. EXIT!! Have a |
| FREE PEPSI!! Turn LEFT at th'HOLIDAY |
| INN!! JOIN the CREDIT WORLD!! MAKE me |
\ an OFFER!!! /
---------------------------------------
\
\
/\ /\
//\\_//\\ ____
\_ _/ / /
/ * * \ /^^^]
\_\O/_/ [ ]
/ \_ [ /
\ \_ / /
[ [ / \/ _/
_[ [ \ /_/
More information about the Gemini
mailing list