Since so long, I have been waiting for Indian universities especially IITs to invest and publish in building such corpora.
Being a founder of AI/ML startup, I am surprised at the appalling lack of datasets available to work on Indian problems. Contrast this with Chinese universities where they have built some world class datasets to build NLP solutions in Mandarin.
Our sentiment analysis works in 8 different languages but none of it is in Indian languages despite we being in India!