tokens are what the model breaks input and output into, and how it maps what words should come next in any given sentence… so basically when you use one of these models online they limit the number of tokens you can input/output in a given time because it is costly to run and people will abuse it otherwise. If you’re running it on your own machine then you don’t have to worry about it because you can use your machine to the max if you want.
tokens are what the model breaks input and output into, and how it maps what words should come next in any given sentence… so basically when you use one of these models online they limit the number of tokens you can input/output in a given time because it is costly to run and people will abuse it otherwise. If you’re running it on your own machine then you don’t have to worry about it because you can use your machine to the max if you want.