Looks cool. A couple of questions:
1. Does it support fine tuning with different losses? For example, where you don't need to provide negatives and it uses the other examples in the batch as negatives
2. Can you share inference speed info? I know that Colbert should be slow since it creates many embeddings per passage