[ANN] A Gemini crawler, for statistics about the geminispace
Sean Conner
sean at conman.org
Fri Dec 18 22:45:33 GMT 2020
It was thus said that the Great Stephane Bortzmeyer once stated:
> On Wed, Dec 16, 2020 at 06:16:53PM -0500,
> Sean Conner <sean at conman.org> wrote
> a message of 27 lines which said:
>
> > > You can find the current results (the crawler did not crawl the entire
> > > space yet):
> > >
> > > gemini://gemini.bortzmeyer.org/software/lupa/stats.gmi
>
> > One stat I haven't seen yet (yours or from GUS) is a breakdown of
> > langauge. How many pages had a lang parameter, then a breakdown by
> > language, how many multiple languages per parameters (for example,
> > "lang=en,fr").
>
> Just ask :-) Now done:
Thanks.
> gemini://gemini.bortzmeyer.org/software/lupa/stats.gmi
>
> I note:
>
> * French is the second language after english. Cocorico, as we say in
> France.
>
> * There is one page in finnish.
And I see one page that is both English and Japanese.
> * There are more HTML than Markdown pages on the geminispace, which I
> find suprising.
Not really, as I've come across one Gemini site that only serves up HTML.
> * There is one page in EBCDIC and one in CP-437 :-)
Now *THAT* is surprising.
-spc
More information about the Gemini
mailing list