Proposed minor spec changes, for comment.
solderpunk
solderpunk at SDF.ORG
Tue May 19 08:20:16 BST 2020
On Mon, May 18, 2020 at 08:07:53PM -0400, Sean Conner wrote:
> Sure, detecting Greek is
> easy since they have their own alphabet, but what about Spanish, French and
> German? They use the same alphabet.
I don't think it's viable for interactive user clients (especially light
and simple ones) to attempt this, but in the context of, say, a search
engine which really wants to categorise everything (which is not to say
that GUS necessarily has to shoulder this burden!), even distinguishing
languages with the same alphabet is possible by looking at bigram and
trigram frequencies if there's enough text. German text will have many
more occurences of "lich" and "heit" than French or Spanish, etc.
> Nice idea, but there are some tough issues to address.
Yeah, this language proposal may have been poorly categorised as "quick
and easy" compared to the others.
Cheers,
Solderpunk
More information about the Gemini
mailing list