Statement of intent regarding document encodings

John Cowan cowan at ccil.org
Wed Nov 11 15:15:12 GMT 2020


+1.  Even Google does not distinguish them when it spiders the web, which
means that more than 95% of all pages are UTF-8 (by actual inspection, not
by Content-Type: declaration).

On Tue, Nov 10, 2020 at 4:24 PM Sean Conner <sean at conman.org> wrote:

> It was thus said that the Great Drew DeVault once stated:
> > I am stopping by to clarify another intepretation of the Gemini spec
> > made by my implementations.
> >
> > https://git.sr.ht/~sircmpwn/gmni
> > https://git.sr.ht/~sircmpwn/kineto
> >
> > With respect to the charset parameter of the document mimetype for
> > text/gemini documents, it is our intention to ONLY support UTF-8, and to
> > raise an error if any other content encoding is specified.
>
>   You may want to revisit that decision and allow US-ASCII as well.  It's a
> strict subset of UTF-8, and about half the text pages return that encoding:
>
>         https://portal.mozz.us/gemini/gus.guru/statistics
>         (bottom of page, by charset)
>
>   -spc
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.orbitalfox.eu/archives/gemini/attachments/20201111/1b7f8732/attachment-0001.htm>


More information about the Gemini mailing list