To create a LLM based Retrieval Augmented Generation, you need to create a retriever par which is basically an internal search on document excerpts from your knowledge base. The snippets to search on should be meaningful, not too large an not too small: thus choosing a correct splitting method is crucial. I've created a tool that lets you visualize how your choices of splitting criteria determine the chunks you get!