I imagine coprocessors that don't have separate memory or instructions... they are effectively huge arrays of look up tables, so that the instructions have the data flow through them. We're at the stage where this is possible for all but the biggest of LLMs.<p>A side effect of doing this mapping, even without the hardware, is that the mapping makes a given task inherently parallel, and much, much easier to spread across low cost CPUs. I think of it as a universal solvent for computation.