TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Grok3 Launch [video]

632 pointsby travelhead3 months ago

54 comments

CSMastermind3 months ago
Karpathy gave his initial impression: <a href="https:&#x2F;&#x2F;x.com&#x2F;karpathy&#x2F;status&#x2F;1891720635363254772" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;karpathy&#x2F;status&#x2F;1891720635363254772</a><p>The pull quote is: The impression overall I got here is that this is somewhere around (OpenAI) o1-pro capability
评论 #43088748 未加载
评论 #43089020 未加载
评论 #43093357 未加载
评论 #43090262 未加载
评论 #43095712 未加载
rendang3 months ago
Grok has gotten to the top of one benchmark:<p><a href="https:&#x2F;&#x2F;x.com&#x2F;lmarena_ai&#x2F;status&#x2F;1891706264800936307" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;lmarena_ai&#x2F;status&#x2F;1891706264800936307</a><p>It&#x27;s been said before but it is great news for consumers that there&#x27;s so much competition in the LLM space. If it&#x27;s hard for any one player to get daylight between them &amp; the 2nd best alternative, hopefully that means one monopolistic firm isn&#x27;t going to be sucking up all the value created by these things
评论 #43086792 未加载
评论 #43087593 未加载
评论 #43086372 未加载
评论 #43087301 未加载
评论 #43089053 未加载
评论 #43088984 未加载
评论 #43088742 未加载
评论 #43086777 未加载
评论 #43087897 未加载
评论 #43087243 未加载
msuvakov3 months ago
To put it this way: after seeing examples of how a LLM with similar capabilities to state-of-the-art ones can be built with 20 times less money, we now have proof that the same can be done with 20 times more money as well!
评论 #43094647 未加载
评论 #43092439 未加载
ilaksh3 months ago
If what they say is true, then you have to give them credit for catching up incredibly fast. And slightly pulling ahead. Not only with the models, but also products.
评论 #43086653 未加载
评论 #43086779 未加载
评论 #43086373 未加载
评论 #43087876 未加载
shekhargulati3 months ago
I don&#x27;t know, but I found the recording uninspiring. There was nothing new for me. We&#x27;ve all seen reasoning models by now—we know they work well for certain use cases. We&#x27;ve also seen &quot;Deep Researchers,&quot; so nothing new there either.<p>No matter what people say, they&#x27;re all just copying OpenAI. I&#x27;m not a huge fan of OpenAI, but I think they&#x27;re still the ones showing what can be done. Yes, xAI might have taken less time because of their huge cluster, but it’s not inspiring to me. Also, the dark room setup was depressing.
评论 #43088447 未加载
评论 #43095054 未加载
tw19843 months ago
Karpathy believes that this is at o1-pro level[1].<p>This again proves that OpenAI simply has no tech moat whatsoever. Elon&#x27;s $97 billion offer for OpenAI last week was reasonable given that xAI already have something just a few months behind - it would probably be faster for xAI to catch up with o3 than going through all those paperworks and lawyer talks required for such an acquisition.<p>Elon also has some huge up-hand here -<p>Elon and his mum are extremely popular in China, it would be easier for him to acquire Chinese AI engineers. He can offer xAI&#x2F;XSpace&#x2F;Neurallink shares to those best AI engineers who&#x27;d prefer some kind of almost guaranteed 8 figure return in long run.<p>Good luck to OpenAI investors who still believe that OpenAI worth anything more than $100 billion.<p>[1] <a href="https:&#x2F;&#x2F;x.com&#x2F;karpathy&#x2F;status&#x2F;1891720635363254772" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;karpathy&#x2F;status&#x2F;1891720635363254772</a>
评论 #43101021 未加载
评论 #43098163 未加载
Rover2223 months ago
Grok 3 at the top of Chatboat Arena with 1400, and the model will continue to improve as it trains more.
评论 #43086946 未加载
评论 #43086409 未加载
评论 #43086351 未加载
评论 #43086295 未加载
评论 #43089064 未加载
xnx3 months ago
A very impressive debut. No doubt they benefited from all the research and discoveries that have preceded it.<p>Maybe the best outcome of a competitive Grok is breaking the mindshare stranglehold that ChatGPT has on the public at large and with HN. There are many good frontier models that are all very close in capabilities.
评论 #43093049 未加载
评论 #43094487 未加载
pveierland3 months ago
TLDW. Will this be open weights?<p>This commit seems to indicate so, but neither HF or GH has public data yet:<p><a href="https:&#x2F;&#x2F;huggingface.co&#x2F;xai-org&#x2F;grok-1&#x2F;commit&#x2F;91d3a51143e7fc2ce362f35b41ffeaca709c6a9c" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;xai-org&#x2F;grok-1&#x2F;commit&#x2F;91d3a51143e7fc2...</a><p>Edit: Answer from Elon in video is that they plan to make Grok 2 weights open once Grok 3 is stable.
评论 #43089521 未加载
zone4113 months ago
Apparently the API will only be available in a few weeks, so I can&#x27;t run my independent benchmarks yet.
评论 #43086513 未加载
评论 #43087934 未加载
评论 #43090697 未加载
sebzim45003 months ago
Controversial opinion but I think the AI game studio idea is a very good one. Not because I think they will make any money off the games, but dogfooding will lead to so much more improvement than relying on feedback from external customers.
评论 #43099758 未加载
评论 #43089754 未加载
评论 #43091383 未加载
评论 #43089981 未加载
jbryu3 months ago
Looks like they recently updated their ToS as well: <a href="https:&#x2F;&#x2F;www.diffchecker.com&#x2F;w4dbxWwt&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.diffchecker.com&#x2F;w4dbxWwt&#x2F;</a>
mrbonner3 months ago
Have you thought of a future where LLM will be fined tune to target advertisment to you? I mean look at search: first iterations of search were pretty simple in term of ads. Then personalized ads came. I wouldn&#x27;t help but envision the distopia where the LLM will insert personalized ads based on what you are asking for help.
评论 #43090688 未加载
评论 #43090423 未加载
评论 #43094374 未加载
评论 #43098177 未加载
I_am_tiberius3 months ago
Does it already include the datasets Musk received from the government or do I have to wait for Grok4?
Alifatisk3 months ago
Do we have any details on how large the context window is? Or how many input tokens it can handle?
评论 #43113414 未加载
behnamoh3 months ago
Will he do what he promised and open source Grok 2 now?
评论 #43086394 未加载
评论 #43086368 未加载
评论 #43088830 未加载
评论 #43087036 未加载
harisec3 months ago
Anybody can try Grok3 on Chatbot Arena (even if you are in Europe). Select Direct Chat and select the model early-grok-3. <a href="https:&#x2F;&#x2F;lmarena.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lmarena.ai&#x2F;</a>
pkkkzip3 months ago
Am I the only one who isn&#x27;t impressed by this? Grok3 is failing basic OCR, react&#x2F;sql coding excercises that Sonnet and Gemini completes successfully.<p>I&#x27;m also skeptical of lmarena as there is a large number of Elon Musk zealots trying to pass off Grok as a proxy for Tesla shares.
评论 #43103460 未加载
pred_3 months ago
&gt; Currently, Grok Web is not accessible in the United Kingdom or the countries of the European Union. We are diligently working to extend our services to these regions, prioritizing compliance with local data protection and privacy laws to ensure your information remains safely secure.<p>I suppose you can take that to mean that people who do have access to the service should not expect much in terms of data protection.
评论 #43088787 未加载
评论 #43088821 未加载
评论 #43088981 未加载
评论 #43088734 未加载
ddxv3 months ago
I think they put the new model behind a $40 paywall so less people use it. The model seems only marginally better than open source models, based on xAI&#x27;s own internal tests, and they spend $$$ money for it to run. Elon talked in the second half about making one of the largest GPU data centers just to get this running. I guess the next iteration they&#x27;ll be trying to reduce the costs.<p>Also, they will be open sourcing Grok 2, which is probably pretty behind at this point, but will still be interesting for people to check out.
adamhartenz3 months ago
They should have asked Grok3 how to create a good announcement stream before going live. That was a mess
ensocode3 months ago
What are your first impressions using it? (Not available in Europe currently). Is it a game-changer?
评论 #43087725 未加载
评论 #43093633 未加载
modeless3 months ago
I am excited for the voice mode promised in &quot;a week&quot; or so. ChatGPT Advanced Voice has been a big disappointment for me. It can&#x27;t do some of the things they demoed at the announcement. It&#x27;s a lot dumber than text mode. I find the voice recognition unreliable. I couldn&#x27;t get it to act as a translator last time I tried. But most of all I find I don&#x27;t have much to talk to it about. If Grok 3 voice mode can discuss current events from the X timeline then it should be much more interesting to talk to.
评论 #43087245 未加载
designov3 months ago
Very impressive work given the timeline
podobo3 months ago
Say what you will about the guy, he kept the training running on time.
评论 #43096549 未加载
评论 #43094366 未加载
评论 #43098158 未加载
mirekrusin3 months ago
Love low budget on marketing side, just few guys talking about essence - job done, tons of money saved if you ask me.
keepamovin3 months ago
The most fascinating part of the video for me was how they built the hardware to do this: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;AUAJ82H12qs?si=sHz3ddZnz2-HU3UL&amp;t=2192" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;AUAJ82H12qs?si=sHz3ddZnz2-HU3UL&amp;t=2192</a>
geor9e3 months ago
Launched where? <a href="https:&#x2F;&#x2F;x.com&#x2F;i&#x2F;grok" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;i&#x2F;grok</a> just loads Grok 2. I assume it&#x27;s only accessible from iOS right now?
评论 #43090918 未加载
zb33 months ago
I&#x27;m a freeloader and it appears that unfortunately Elon is not stupid enough to just give it to me for free.. There&#x27;s no fair price either since I see no pay-per-use pricing, so.. unavailable for me for now.
评论 #43100072 未加载
92834092323 months ago
I wonder if people will attempt at jailbreaking this model to see if they can find evidence of federal data being used to train it.
mnewme3 months ago
Musk already has too much power, won’t trust him with my AI conversations
评论 #43087329 未加载
greatgib3 months ago
Billions spent, one of the most powerful AI developed, and still no one competent enough to trim the 15 mins of waiting time filler at the beginning of the announcement video...
评论 #43089016 未加载
评论 #43097472 未加载
lngnmn23 months ago
Anyone else noticed anything?<p><a href="https:&#x2F;&#x2F;lngnmn2.github.io&#x2F;articles&#x2F;grok3&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lngnmn2.github.io&#x2F;articles&#x2F;grok3&#x2F;</a>
sunaookami3 months ago
They will open-source Grok 2 when Grok 3 comes out. Also it seems like it will be paywalled - disappointing considering DeepSeek-R1 is free and open source.
评论 #43086391 未加载
srid3 months ago
For some ouroborus fun, I attached this whole HN discussion and asked Grok 3 to summarize (with specific focus on the members attitude towards Elon Musk). Here&#x27;s what it came up with:<p><a href="https:&#x2F;&#x2F;x.com&#x2F;i&#x2F;grok&#x2F;share&#x2F;CTDC0WOi7RCbEDrm11AJ3PtLM" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;i&#x2F;grok&#x2F;share&#x2F;CTDC0WOi7RCbEDrm11AJ3PtLM</a>
评论 #43091125 未加载
arj3 months ago
Still no post on their official blog. How disappointing.
phtrivier3 months ago
Off topic, but just in case: is there a good reference on how people actually use LLMs on a daily basis ? All my attempts so far have been pretty underwhelming:<p>* when I use chatbots as search engines, I&#x27;m very quickly disappointed by obvious hallucinations<p>* I ended up disabling github copilot because it was just &quot;auto-complete on steroids&quot; at best, and &quot;auto-complete on mushrooms&quot; at worst<p>* I rarely have use cases where I have to &quot;generate a plausible page of text that statistically looks like the internet&quot; - usually, when I have to write about something, it&#x27;s to put information that&#x27;s in my head into other people head<p>* I&#x27;d love to have something that reads all my codebase and draws graphs, explain how things work, etc... But I tried aider&#x2F;ollama, etc.. and nothing even starts making sense (is that an avenue to persevere in, though ?)<p>* At once, I tried to write in plain english a situation where a team has to do X tasks, in Y weeks, and I needed a table of who should be working on what for each week. I was impressed that LLMs were able to produce a table - the slight problem was that, of course, the table was completely wrong. Again, is it just bad prompting ?<p>It&#x27;s an interesting problem when you don&#x27;t know if you&#x27;re just having a solution in search of a problem, or if you&#x27;re missing something obvious about how to use a tool.<p>Also, all introductory texts about LLMs go into many details about how they&#x27;re made (NNs and transformers and large corpuses and lots of electricity etc...) but &quot;what you can do with it&quot; looks like toy examples &#x2F; simply not what I do.&quot;<p>So, what is the &quot;start from here&quot; about what it can really do ?
评论 #43087597 未加载
评论 #43087470 未加载
评论 #43087463 未加载
评论 #43087331 未加载
评论 #43089937 未加载
评论 #43087685 未加载
评论 #43087404 未加载
评论 #43089183 未加载
评论 #43088352 未加载
评论 #43087858 未加载
评论 #43087440 未加载
评论 #43087425 未加载
评论 #43088182 未加载
评论 #43087375 未加载
评论 #43087424 未加载
评论 #43091068 未加载
评论 #43087586 未加载
评论 #43087495 未加载
评论 #43089091 未加载
评论 #43087453 未加载
评论 #43087360 未加载
评论 #43094681 未加载
评论 #43087366 未加载
评论 #43091671 未加载
评论 #43095914 未加载
评论 #43087545 未加载
评论 #43090583 未加载
评论 #43087557 未加载
评论 #43088080 未加载
评论 #43087781 未加载
评论 #43088354 未加载
评论 #43087551 未加载
评论 #43090435 未加载
评论 #43089286 未加载
评论 #43087556 未加载
评论 #43089275 未加载
评论 #43087397 未加载
评论 #43088347 未加载
评论 #43090524 未加载
JTyQZSnP3cQGa8B3 months ago
Companies have hijacked the open source concept to mean downloadable blob and we follow them as I see in the comments. It’s a real shame.
评论 #43086944 未加载
评论 #43087051 未加载
评论 #43086974 未加载
评论 #43086984 未加载
评论 #43087492 未加载
评论 #43088162 未加载
评论 #43087307 未加载
评论 #43086942 未加载
评论 #43087135 未加载
评论 #43087090 未加载
评论 #43087294 未加载
angusturner3 months ago
Credit to the engineers that built this, but it fills me with rage that Elon has this sort of unchecked power.<p>How long before this starts getting deployed in safety critical applications or government decision making processes?<p>With no oversight because Elon seems to have the power to dismiss the people responsible for investigating him.<p>Anyone not scared by this concentration of power needs to pick up a book.
评论 #43090274 未加载
评论 #43090397 未加载
评论 #43089906 未加载
评论 #43090638 未加载
评论 #43090003 未加载
评论 #43092916 未加载
评论 #43090020 未加载
评论 #43092515 未加载
评论 #43090321 未加载
评论 #43090760 未加载
评论 #43090647 未加载
评论 #43091841 未加载
评论 #43090439 未加载
评论 #43090243 未加载
评论 #43090443 未加载
评论 #43090172 未加载
评论 #43090170 未加载
评论 #43090818 未加载
LorenDB3 months ago
Elon just said they are launching an AI game studio. Does this mean they will be building games that are mostly built with AI, or will they make AI tooling available for anyone to build games easily? Probably the former, but it would be nice if they would make it fully available to everyone.
评论 #43086927 未加载
评论 #43087094 未加载
评论 #43086217 未加载
评论 #43095965 未加载
评论 #43087598 未加载
评论 #43086323 未加载
ccorcos3 months ago
The politics in the comments here are really toxic. What’s happening to HN?<p>This is the largest computer cluster the world has ever seen.<p>Can someone please post interesting comments about things I can learn?
评论 #43092856 未加载
评论 #43093114 未加载
评论 #43094043 未加载
评论 #43092641 未加载
评论 #43092596 未加载
评论 #43092594 未加载
ban-evader3 months ago
The story about how they made this happen in such a short period is impressive to say the least. Elon’s strength seems to be making things happen.<p>Getting the largest computer cluster in the world up and running in a matter of months? Unbelievable.
评论 #43092431 未加载
评论 #43090979 未加载
评论 #43089299 未加载
评论 #43090867 未加载
albertzeyer3 months ago
<a href="https:&#x2F;&#x2F;garymarcus.substack.com&#x2F;p&#x2F;elon-musks-terrifying-vision-for" rel="nofollow">https:&#x2F;&#x2F;garymarcus.substack.com&#x2F;p&#x2F;elon-musks-terrifying-visi...</a><p>I&#x27;m not sure if this was a very bad joke by Elon, or if Grok 3 is really biased like that.
评论 #43087275 未加载
评论 #43087248 未加载
评论 #43087262 未加载
评论 #43087276 未加载
评论 #43087796 未加载
评论 #43087823 未加载
评论 #43087195 未加载
评论 #43087160 未加载
xqcgrek23 months ago
Looks impressive. OpenAI and Sam Altman might be cooked if its as capable as advertised.
评论 #43086955 未加载
评论 #43086151 未加载
评论 #43086815 未加载
评论 #43086365 未加载
评论 #43086288 未加载
评论 #43086127 未加载
评论 #43086130 未加载
gmerc3 months ago
[flagged]
评论 #43091424 未加载
评论 #43092259 未加载
评论 #43091481 未加载
评论 #43091872 未加载
评论 #43092795 未加载
评论 #43091926 未加载
评论 #43091115 未加载
评论 #43091661 未加载
评论 #43091039 未加载
评论 #43092585 未加载
评论 #43092644 未加载
评论 #43092715 未加载
评论 #43091699 未加载
评论 #43092793 未加载
评论 #43092794 未加载
评论 #43091563 未加载
lionkor3 months ago
I don&#x27;t understand how and why Grok would be related to &quot;understanding the nature of the universe&quot;, as Musk puts it. Please correct me if I&#x27;m wrong, but they basically just burned more cash than any human should have to buy Nvidia GPUs and make them predict natural language, right? So, they are somewhat on-par with all the other companies that did the same.<p>This is not innovation, this is baseless hype over a mediocre technology. I use AI every day, so it&#x27;s not like I don&#x27;t see its uses, it&#x27;s just not <i>that</i> big of a deal.
评论 #43087363 未加载
评论 #43087384 未加载
评论 #43087330 未加载
评论 #43087465 未加载
评论 #43087373 未加载
评论 #43087374 未加载
评论 #43087349 未加载
评论 #43089579 未加载
评论 #43087441 未加载
评论 #43087514 未加载
评论 #43088663 未加载
评论 #43087394 未加载
quyleanh3 months ago
[flagged]
评论 #43086228 未加载
评论 #43086966 未加载
评论 #43093099 未加载
dazzaji3 months ago
[flagged]
评论 #43087473 未加载
评论 #43093711 未加载
评论 #43088129 未加载
评论 #43089608 未加载
ramesh313 months ago
Can&#x27;t stand Elon but happy to see this. We badly need a frontier model that is not so obsessed with &quot;safety&quot;. That nonsense has held things back significantly, and leads to really stupid fake constraints.
JoelJacobson3 months ago
<a href="https:&#x2F;&#x2F;grok.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;grok.com&#x2F;</a><p>500 Internal Server Error<p>nginx&#x2F;1.27.4
behnamoh3 months ago
We know RLHF and alignment degrades model quality. could it be that Grok, due to its less restrictive training guidelines (and the fact that its creators aren&#x27;t afraid of getting sued), can achieve higher performance partly due to this simple factor?
评论 #43086405 未加载
1970-01-013 months ago
It blows my mind that Musk hasn&#x27;t integrated Grok as an app inside their vehicles. A literal AI copilot is a completely novel and killer app that cannot be pulled off by any other vehicle manufacturer.
评论 #43090770 未加载
评论 #43090766 未加载
评论 #43091019 未加载
concordDance3 months ago
Interesting thing about this is that because of all the Musk-related overhyping that&#x27;s gone on and because the launch is a video, the thread that marks the entry of another company into the select group of serious AI companies will go off the front page with possibly only 200 points!
评论 #43110354 未加载
kernal3 months ago
[flagged]
评论 #43093014 未加载