This is consistent with my own experiences with topic models, although I'm left wondering to what extent these observations generalize and why. I tried to find more details in previous posts about the models used etc but couldn't find much.<p>There's a lot of interest in overfitting with ML but it tends to focus on supervised methods; I think there's a need for more focus on unsupervised methods in general, with regard to overfitting in particular but also just in general.