Three possible uses for IRIs

Philip Linde linde.philip at gmail.com
Tue Dec 8 09:29:32 GMT 2020


On Mon, 7 Dec 2020 23:00:01 -0500
John Cowan <cowan at ccil.org> wrote:
 
> Agreed.  But if you have a Punycode encoder, then the following steps will
> convert an IRI reference to a URI reference, without regard to whether it
> is an IRI or a relative reference:
> 
> 1) Look in the IRI reference for a "//" and a following "/"; if they exist,
> pass the characters in between through your encoder and substitute the
> result into the IRI reference.
> 
> 2) Start over from the beginning.  If a character is ASCII, leave it
> unchanged.  Otherwise, take the character, convert it to UTF-8 bytes (easy)
> and each byte to hex digits (trivial), decorate it with leading %
> (trivial), and move on.  When you come to the end, stop.

There's a "drawl the owl" step somewhere here regarding Unicode
normalization. Does the server like your ä:s fully composed or
decomposed, or should the server itself be responsible for
normalization?

-- 
Philip
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 488 bytes
Desc: not available
URL: <https://lists.orbitalfox.eu/archives/gemini/attachments/20201208/cf4dc147/attachment.sig>


More information about the Gemini mailing list