Statement of intent regarding document encodings
John Cowan
cowan at ccil.org
Wed Nov 11 15:15:12 GMT 2020
+1. Even Google does not distinguish them when it spiders the web, which
means that more than 95% of all pages are UTF-8 (by actual inspection, not
by Content-Type: declaration).
On Tue, Nov 10, 2020 at 4:24 PM Sean Conner <sean at conman.org> wrote:
> It was thus said that the Great Drew DeVault once stated:
> > I am stopping by to clarify another intepretation of the Gemini spec
> > made by my implementations.
> >
> > https://git.sr.ht/~sircmpwn/gmni
> > https://git.sr.ht/~sircmpwn/kineto
> >
> > With respect to the charset parameter of the document mimetype for
> > text/gemini documents, it is our intention to ONLY support UTF-8, and to
> > raise an error if any other content encoding is specified.
>
> You may want to revisit that decision and allow US-ASCII as well. It's a
> strict subset of UTF-8, and about half the text pages return that encoding:
>
> https://portal.mozz.us/gemini/gus.guru/statistics
> (bottom of page, by charset)
>
> -spc
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.orbitalfox.eu/archives/gemini/attachments/20201111/1b7f8732/attachment-0001.htm>
More information about the Gemini
mailing list