"At the very least, it’s clear that this is a question for which the models do not achieve consensus. Every single option was chosen at least once and I believe the models chose poorly in almost every case. Additional written training material and reinforcement learning may allow LLMs to effectively mimic us on this test in the near future. In the longer run, I speculate that humanoid robots will be equipped with sensors and interpretation functions that will allow them to record data that forms a representation of our qualia. This will be another dimension in multi-modal models that will allow them to supplement their current reliance on written words when predicting human behavior. In the meantime, we should be able to think of other tests that get at the heart of the human experience. Pain, pleasure, repulsion, joy, fear, love, annoyance, comfort, and anger are written about at length but clearly something is lost in translation."