Three month spec freeze
Sean Conner
sean at conman.org
Thu Jun 4 23:02:15 BST 2020
It was thus said that the Great Natalie Pendragon once stated:
> On Wed, Jun 03, 2020 at 12:32:23PM -0400, Sean Conner wrote:
> > Since the following query isn't supported, could you check to see how many
> > text/gemini documents have a charset? And if they do, what charset they
> > use?
First off, thank you for this.
> For all content types:
>
> 5529 - none
> 110 - utf-8
> 38 - us-ascii
> 2 - US-ASCII
> 1 - UTF-8
Hmm ... because I know of *one* page that is text/gemini and is *not*
UTF-8 encoded:
gemini://gemini.conman.org/test/torture/0013
I'm guessing you don't crawl the Gemini Client Torture Test then?
> For text/gemini specifically:
>
> 3256 - none
> 14 - utf-8
> 2 - US-ASCII
> 1 - UTF-8
What this tells me is that the fears of non-UTF-8 text/gemini pages are
probably unfouned. For now.
I'd recommend keeping the charset on text/gemini. For the exceeding rare
case of a non-UTF-8 text/gemini page, a very simplistic client that doesn't
want to convert the text can just report an error to the user, or at least
offer an option to display and/or save the content.
-spc
More information about the Gemini
mailing list