Scroll down to FAQ for self-hosting section
I’m sorry, but how does this even work?
Open-weight models (e.g., Mistral 7B) are Apache 2.0 licensed for research/individual use; while commercial deployments require a Mistral license with separate terms for derivatives and production use.
Apache 2.0 explicitly allows you to sell stuff licensed under it, if there’s restrictions to commercial deployments then it’s not Apache 2.0.
To the actual question, most lines about the difference between ethical and non-ethical AI/machine learning stuff I’ve seen have focused on the consent in obtaining the training data. Eg, ChatGPT or Sora aren’t as they didn’t get consent for their training data, while stuff like the voice banks Eclipsed Sounds produce are as they obtain consent from the person they train their models on.
I highly doubt Mistral is getting permission for all the stuff they’re training their stuff on.
To some degree, ethical depends entirely on the operator and user.
I self-host a half dozen different llms and AI models and generally use them for different tasks depending on what they’re best suited for.
By far my favorite thing is just knowing that my data doesn’t leave my computer and go to the cloud. Which also means that my kids data doesn’t leave my computer and go to the cloud.



