I found this interesting link in paper, so this can be used as ready code for RMT Transformer<p><a href="https://github.com/booydar/t5-experiments/tree/scaling-report/rmt_utils">https://github.com/booydar/t5-experiments/tree/scaling-repor...</a>