Crawlers on Gemini and best practices
Stephane Bortzmeyer
stephane at sources.org
Fri Dec 11 10:16:05 GMT 2020
On Thu, Dec 10, 2020 at 11:37:34PM +0530,
Sudipto Mallick <smallick.dev at gmail.com> wrote
a message of 40 lines which said:
> - ask for /bots.txt
Speaking of this, I suggest it could be better to have a /.well-known
(or equivalent) to put all these "meta" files. The Web does it (RFC
5785) and it's cool since it avoids colliding with "real"
resources. (Also, crawling the geminispace shows strange robots.txt
which are probably "wildcards" or "catchall", created by a program
which replies for every possible path. Having a /.well-known would
allow to define an exception.)
It requires no change in clients (except bots) or servers, it is just
a convention.
=> gemini://gemini.bortzmeyer.org/rfc-mirror/rfc5785.txt RFC 5785 "Defining Well-Known URIs"
Meta-remark: is there a place with all the "Gemini good practices" or
"Gemini conventions", which do not change the protocol or the format
but are useful?
More information about the Gemini
mailing list