TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

India's AI Vaporware

3 点作者 profsummergig3 个月前
I did a search for Indian AI LLMs.<p>Found articles containing with headlines containing phrases like: &quot;10 Indian LLMs&quot;<p>In such articles, almost every link to an Indian LLM was either a 404, or a landing page with a lot of claims but nothing that could be tried out, nothing open source, no github.<p>I researched press releases. The loudest claims of breakthrough progress were from an entity called &quot;MahindraTech&quot;. When I went to their website, there was a message on there saying it was not accessible from outside India (I was trying to access it from outside India).<p>This reminds me of something from before the April 2024 Indian general elections. There was a huge hooplah about how Modi&#x27;s speeches were going to be simultaneously translated into multiple Indian languages in real-time using AI. I was very excited about it. Nothing ever came of it.<p>Right now, if you look into the AI scene, there are startups that are excited about offering Indian-hosted DeepSeek. There are media articles lauding how DeepSeek (China&#x27;s DeepSeek!) is being used as a pre-trained model to do fine-tuning on. I even saw a government minister talking about how DeepSeek would be hosted on cheap government owned infrastructure for Indian startups. They seem to have given up on pre-training an LLM.<p>The only serious thing I came across was a quote by Narayan Murthy (founder of InfoSys) saying that Indian AI entrepreneurs should come clean and admit that there isn&#x27;t enough data to train AIs for Indian language translation.<p>The rest was just hype and vapor.<p>It&#x27;s really quite pathetic.<p>(Note: I haven&#x27;t posted links because HN rules don&#x27;t allow that in a post)

1 comment

sieve3 个月前
&gt; Narayan Murthy (founder of InfoSys) saying that Indian AI entrepreneurs should come clean and admit that there isn&#x27;t enough data to train AIs for Indian language translation.<p>What does he know about this, exactly? I managed to translate a hundred English classics into Sanskrit using DeepSeek. The translation quality is quite good, close to Claude-level.<p>Gemini 2 Flash, which I am not yet satisfied with as far as Sanskrit is concerned, gave me a brilliant English translation of the Hanuman Chalisa. Even managed to give me an iambic pentameter version of it when prompted.<p>The data exists. The only question is whether training models from scratch is really necessary or not.