A quick thing you can do if you want to restrict or limit #LLM training on your content - or the opposite, allow it under specific conditions (e.g. attribution).
- Create
/license.xml - Add
License: https://krvtz.net/license.xmlline to your/robots.txt
Sample license.xml banning any LLM learning:
<rsl xmlns="https://rslstandard.org/rsl">
<content url="/">
<license>
<prohibits type="usage">ai-train ai-input</prohibits>
</license>
</content>
</rsl>
Sample license.xml allowing LLM learning on CC-BY attribution basis:
<rsl xmlns="https://rslstandard.org/rsl">
<content url="/">
<license>
<permits type="usage">all</permits>
<payment type="attribution">
<standard>https://creativecommons.org/licenses/by/4.0/</standard>
</payment>
</license>
</content>
</rsl>
Live example: https://krvtz.net/robots.txt
Full standard: https://rslstandard.org/guide/getting-started