gawdzie (potato, again): [nodding sagely] i understand what these words mean

Nothing is sacred anymore.

can’t wait for it to make up shit and students to fail their papers for not reading their citations

That's not a huge problem when dealing with a limited corpus. The LLM can piece together words which you can then tokenize and vectorize and then compare to the same info found in the limited dataset it is supposed to talk about. If the LLM goes off into la-la land have it try again.

[nodding sagely] i understand what these words mean

Post