[ANN] A Gemini crawler, for statistics about the geminispace
Sean Conner
sean at conman.org
Thu Dec 24 02:01:13 GMT 2020
It was thus said that the Great Philip Linde once stated:
> On Wed, 16 Dec 2020 16:05:50 +0100
> Stephane Bortzmeyer <stephane at sources.org> wrote:
>
> > I'm running a Gemini crawler, which gathers metadata about the
> > geminispace. The goal is not to make a search engine but to survey the
> > geminispace.
>
> That's interesting, Stephane. Could you add statistics about character
> encodings used for text/gemini responses specifically? I'd like to know
> if there are currently text/gemini responses in any other encoding than
> UTF-8 (or US ASCII). That would be an interesting topic in the IRI+IDN
> discussion.
There's a chart on the GUS stats page:
https://portal.mozz.us/gemini/gus.guru/statistics
It seesm it's a 54/46 split between UTF-8/US-ASCII (and 7 (seven) pages
out of 84,400 that are NOT UTF-8 nor US-ASCII).
-spc
More information about the Gemini
mailing list