Three month spec freeze

Sean Conner sean at conman.org
Thu Jun 4 23:02:15 BST 2020


It was thus said that the Great Natalie Pendragon once stated:
> On Wed, Jun 03, 2020 at 12:32:23PM -0400, Sean Conner wrote:
> >   Since the following query isn't supported, could you check to see how many
> > text/gemini documents have a charset?  And if they do, what charset they
> > use?

  First off, thank you for this.  

> For all content types:
> 
>  5529 - none
>   110 - utf-8
>    38 - us-ascii
>     2 - US-ASCII
>     1 - UTF-8

  Hmm ... because I know of *one* page that is text/gemini and is *not*
UTF-8 encoded:

	gemini://gemini.conman.org/test/torture/0013

  I'm guessing you don't crawl the Gemini Client Torture Test then?

> For text/gemini specifically:
> 
>  3256 - none
>    14 - utf-8
>     2 - US-ASCII
>     1 - UTF-8

  What this tells me is that the fears of non-UTF-8 text/gemini pages are
probably unfouned.  For now.  

  I'd recommend keeping the charset on text/gemini.  For the exceeding rare
case of a non-UTF-8 text/gemini page, a very simplistic client that doesn't
want to convert the text can just report an error to the user, or at least
offer an option to display and/or save the content.

  -spc



More information about the Gemini mailing list