I have gone through all the content I could find on the internet. All of them explain what they do. But can someone help me understanding what they really are?<p>For example and llm is a "next token predictor". LLM is transformer based.<p>From what I understand it's just LLM + a state machine.