Comparing Google and ChatGPT

260 pointsby xezzedover 2 years ago

55 comments

hncelover 2 years ago

I work at Alphabet and I recently went to an internal tech talk about deploying large language models like this at Google. As a disclaimer I'll first note that this is not my area of expertise, I just attended the tech talk because it sounded interesting.Large language models like GPT are one of the biggest areas of active ML research at Google, and there's a ton of pretty obvious applications for how they can be used to answer queries, index information, etc. There is a huge budget at Google related to staffing people to work on these kinds of models and do the actual training, which is very expensive because it takes a ton of compute capacity to train these super huge language models. However what I gathered from the talk is the economics of actually using these kinds of language models in the biggest Google products (e.g. search, gmail) isn't quite there yet. It's one thing to put up a demo that interested nerds can play with, but it's quite another thing to try to integrate it deeply in a system that serves billions of requests a day when you take into account serving costs, added latency, and the fact that the average revenue on something like a Google search is close to infinitesimal already. I think I remember the presenter saying something like they'd want to reduce the costs by at least 10x before it would be feasible to integrate models like this in products like search. A 10x or even 100x improvement is obviously an attainable target in the next few years, so I think technology like this is coming in the next few years.

评论 #33823058 未加载

评论 #33821078 未加载

评论 #33820893 未加载

评论 #33925324 未加载

评论 #33926568 未加载

评论 #33820824 未加载

评论 #33820831 未加载

评论 #33822125 未加载

drchopchopover 2 years ago

It's great, until people realize GPT-3 will generate answers that are demonstrably wrong. (And to make matters worse, can't show/link the source of the incorrect information!)

评论 #33819193 未加载

评论 #33818935 未加载

评论 #33820167 未加载

评论 #33818490 未加载

评论 #33818443 未加载

评论 #33818764 未加载

评论 #33818997 未加载

评论 #33821227 未加载

评论 #33819268 未加载

评论 #33818670 未加载

评论 #33820878 未加载

评论 #33819683 未加载

评论 #33820677 未加载

评论 #33819321 未加载

评论 #33819913 未加载

评论 #33819031 未加载

评论 #33822696 未加载

评论 #33818717 未加载

评论 #33819685 未加载

评论 #33819344 未加载

评论 #33819097 未加载

tfshover 2 years ago

These are addressing two very different concerns but framed as a singular one. Google is first and foremost a search engine - it searches the web for answers, the key point being the answers need to exist on the web. The other is a machine learning model tasked with deriving answers, and sometimes - if not very often answers will be provided in an authoritative tone whilst being completely and utterly incorrect.Google is working on the latter called LaMDA[1] which is arguably more impressive and extensive than GPT-3, but for the reasons discussed above can't just be rolled out to the public. (edit: as others have noted, the code snippets themselves are wrong, but the Twitter poster didn't verify this because they're not interested in the answer, just the lack of one from Google).It's certainly an interesting discussion for sure. Mathematics help (homework) is being built into search presently and one day for sure code-snippets will be embedded on search. However at Google's scale and the amount of scrutiny it receives spitting out machine-learning based results without any curation or substantiation is dangerous. Legally it is much safer to delegate to websites, thus alleviating any blame to the host.1: <a href="https://en.wikipedia.org/wiki/LaMDA" rel="nofollow">https://en.wikipedia.org/wiki/LaMDA</a>

评论 #33823101 未加载

评论 #33820088 未加载

评论 #33818510 未加载

评论 #33821824 未加载

评论 #33820557 未加载

评论 #33818873 未加载

evrydayhustlingover 2 years ago

These examples are terrific, but the framing is ridiculous.- GPT-3 answers can be incorrect, and don't carry enough context with them for the reader to engage critically.- Text is often an inefficient presentation of an answer and Google's knowledge card results can do more and more (while adopting the risk above).- LLM's are a ways from being scalable at this quality to a fraction of the throughput of Google queries.- Search increasingly benefits from user-specific context, which is even harder to integrate at a reasonable expense into queries at massive throughput.- Google is also regularly putting forward LLM breakthroughs, which will of course impact productized search.As an NLP practitioner who depends on LLMs, I'm excited as anyone about this progress. But I think some folks are jumping to a conclusion that generative AIs will be the standalone products, when I think they'll be much more powerful as integrated into structured product flows.

评论 #33821237 未加载

Veenover 2 years ago

Another person who doesn’t realise AI language models are just making shit up. Google results are quite often full of wrong information, but at least it has mechanism for surfacing better content: inbound links, domain authority, and other signals. It doesn’t guarantee correctness, but it’s better than the pseudo-authoritative fiction GPT-3 and friends come up with.

评论 #33820223 未加载

qlmover 2 years ago

The answer about Typescript generics is totally wrong. I'm sure there's other errors but I didn't read further.

评论 #33819727 未加载

评论 #33822144 未加载

Tepixover 2 years ago

Here's an example on how misleading ChatGPT is:Prompt:Can you explain what happens when i enter "ping 16843009" in a linux shell?Answer:When you enter the command ping 16843009 in a Linux shell, you are instructing the ping utility to send an Internet Control Message Protocol (ICMP) echo request to the IP address 168.43.9.0. [...]The correct answer is that it will ping 1.1.1.1Also ChatGPT missed that fact that 16843009 is bigger than 255 and failed to explain what happens and why.

评论 #33819619 未加载

评论 #33828048 未加载

joegahonaover 2 years ago

This is the very definition of clickbait. Not the Tweeter's fault, but it's a gray area when sharing Tweets on HN, since Tweets do not have a "title" per se.From the HN Guidelines:> Otherwise please use the original title, unless it is misleading or linkbait; don't editorialize.

评论 #33819435 未加载

Snoozusover 2 years ago

I'm confused. Did OP realize that GPTs anwers are completely wrong and is being sarcastic?Or is formatting really more important than content?

评论 #33819483 未加载

jstx1over 2 years ago

I don’t think so. Google is still a search engine first and a question answering machine second. And for the question answering I will still prefer links over a blob of text that can’t be inspected or verified.

评论 #33818197 未加载

评论 #33818456 未加载

hn_throwaway_99over 2 years ago

Google is only "done" if you never actually use it to find sites on the web. For nearly all of these examples I was thinking "WHY DON'T YOU JUST TRY CLICKING THE TOP LINK?" E.g. the first link for writing differential equations in LaTeX, I thought the first result, <a href="https://priyankacool10.wordpress.com/2013/10/15/writing-differential-equations-in-latex/" rel="nofollow">https://priyankacool10.wordpress.com/2013/10/15/writing-diff...</a> , provided excellent, helpful examples.That is, if anything, I'd be quite satisfied with Google getting back to being a search engine and not just trying to bypass all the results that actually come back.

评论 #33821653 未加载

datpiffover 2 years ago

You are very much in a bubble if you think the primary use of Google search is programming questions.

3videnceover 2 years ago

In the replies someone asks a basic physics question."In a vacuum do heavier objects fall faster?"The response from GPT is completely wrong and so confident, it is like an imposter trying to save face.

评论 #33821238 未加载

评论 #33830416 未加载

shantaraover 2 years ago

What terrifies me is the idea of someone building a GPT-based bot specifically targeting Wikipedia. If one would train a model on the existing wiki dataset, it could generate and submit plausibly looking but completely factually false edits and articles with plausibly looking citations. Given the history of long-lasting hoaxes, it shouldn’t be hard to achieve enough throughput to completely overwhelm the capacity of human editors to do any fact checking.

educayseanover 2 years ago

The AI community needs to get real. All this talk about "large language model based AIs" is just smoke and mirrors. The technology is nowhere near advanced enough to convince the majority of people that it can provide genuine value in our lives. Stop pretending like these systems are capable of doing anything more than parroting back pre-programmed responses. The hype is out of control.(The above comment was generated using ChatGPT)

评论 #33823206 未加载

评论 #33822536 未加载

theptipover 2 years ago

Since we are posting ChatGPT Twitter links, I like this one:<a href="https://twitter.com/goodside/status/1598129631609380864" rel="nofollow">https://twitter.com/goodside/status/1598129631609380864</a>> explain the worst-case time complexity of the bubble sort algorithm, with Python code examples, in the style of a fast-talkin' wise guy from a 1940's gangster movie

评论 #33820436 未加载

评论 #33827238 未加载

chpatrickover 2 years ago

The stuff about TypeScript generics is just wrong though.

评论 #33821069 未加载

xrdover 2 years ago

I'm actually really interested in an AI that gives the wrong answers. It is a great way to generate filler when building out flashcards with the correct answer that you find when studying something. Is there a good open source (docker image) available ChatGPT3 equivalent that I can use for that, does anyone know?

评论 #33820258 未加载

wenderenover 2 years ago

<a href="https://twitter.com/jdjkelly/status/1598143982630219776/photo/1" rel="nofollow">https://twitter.com/jdjkelly/status/1598143982630219776/phot...</a>I went and checked out the Borges fable mentioned here: <a href="https://kwarc.info/teaching/TDM/Borges.pdf" rel="nofollow">https://kwarc.info/teaching/TDM/Borges.pdf</a>Looks like the ChatGPT summary is completely wrong? The map gets discarded instead of rendering obsolete the territory.

wldcordeiroover 2 years ago

Like Google can't acquire this or something similar and include it in its results if it's actually a threat.

评论 #33818476 未加载

评论 #33818920 未加载

评论 #33818686 未加载

optionover 2 years ago

Google’s PaLM is current SOTA, way better than GPT-3 (non-tuned). I’m sure Google has many “tuned” internal-only PaLM variants in prod or testing today.

stephc_int13over 2 years ago

AI is often over-hyped, especially during the recent months.But I think that we've all noticed the progressive degradation of search engines, including Google.It is often more efficient to search on Reddit or Wikipedia or event YouTube.But a good interactive LLM based chat agent could be a game changer.I've used the demo and it is very useful to quickly get structured data in plain English or French, with well written code examples when needed.It is not 100% there yet, the agent should be connected to a search engine backend, and maybe keep some long-lasting state for each user.This is promising.

andreykover 2 years ago

I seriously don't get this argument. Google can implement this themselves! It's not like they can't train a large language model akin to GPT-3 (they already have) or deploy it. And as others pointed out, language models are seriously not reliable right now in terms of producing true information.

评论 #33821200 未加载

eachroover 2 years ago

What are the engineering and considerations for serving this sort of model to billions of queries a day? Do the economics of a gpt-as-a-search-engine work?

评论 #33818842 未加载

nistenover 2 years ago

Long live YouTube. Long enough until some client-side stable-diffussion thingy for generating tutorial videos becomes practical enough for daily use.

6gvONxR4sf7oover 2 years ago

Generative models will surely change the shape of the web. If a major effect of freely sharing something is to enable a big AI company to ingest it and show it to their users without attribution, people are going to share things less freely. Which will then mean that these models won’t be able to generate new things as well.I don’t know exactly how that will manifest, but something of that shape seems to be on the way.

ukokiover 2 years ago

I've never seen Solidity before, but it sure looks like `onlyOwner` is an arbitrary modifier name, and you could use _any_ modifier that contains a require(msg.sender == owner) assertion to restrict the caller. So shouldn't the answer be "...you can add a modifier to the function.." rather than "...you can add the onlyOwner modifier to the function...".

yaloginover 2 years ago

If there is really some other method that is better, why can't google just use that behind the scenes to provide answers? At the end of the day, google is what people are used to. They just go there without thinking. I do agree that the search engines part of it has become less effective but authoritative answers is an evolving field and google will evolve as it does.

Jack_rando_fangover 2 years ago

For the use cases of question and answering, especially regarding technology, ChaGPT is indeed more flexible and convenient compared to Google and will surely replace a large part of this use case. However, Google is still irreplaceable as an index for the entire internet, and it will remain how we find content created by other _people_.

MagicMoonlightover 2 years ago

But the problem is, the "AI" doesn't actually know anything about the answer it is giving. It is simply brute-forcing and randomly generating based on a huge lookup table.So what might appear to be an accurate answer, could in reality just be total garbage. Whereas the google answer has at least been written by an actual person.

qwertoxover 2 years ago

I asked it to show me an example code for a Websocket server using Axum and it spit out some .NET code.But while using it, generally I had the feeling that this could one day (3-4 years?) replace Google almost completely for all my code-related searches, which make up more than half for all my Google searches.

satvikpendemover 2 years ago

I asked on Twitter: "Why do you assume Google (who has one of the largest AI teams around, plus DeepMind) won't also integrate this into search too?"I mean really, do people really think Google isn't also working on stuff like this?

carrotcarrotover 2 years ago

I can't see this because twitter has a log in wall. Post better links please.

评论 #33819310 未加载

评论 #33819342 未加载

slmjkdbtlover 2 years ago

I think these are 2 separate use cases, one for organized knowledge and one for related links. Google doesn't compile knowledge as well, but it does good job on finding related links.

fnbrover 2 years ago

I don't get it. Why can't Google just train their own LLM and use that for answer cards?The main value of Google Search is the ability to search the web for websites. Not to search for answers.

none_to_remainover 2 years ago

In the first example, the AI seems more focused on extraneous stuff about aligning the equation, while the search result starts off by answering the question asked

aresantover 2 years ago

Google is literally the “Kleenex” of searchAka “just Google that”I imagine the brand and goodwill value will have remarkable staging power forward as consumers decide where to do their AI search

tompover 2 years ago

GPT chat confidently claimed that Bill Gates never flew with Lolita Express (Jeffrey Epstein’s plane), even when I cited a New York Times article.So, your mileage may vary

pgtover 2 years ago

Prediction: prompt placement. Complex AI explanations will feature the highest paying brand when two choices are tied, e.g. Mercedes vs. BMW.

zeropover 2 years ago

I would like to feed product KT videos, source code to it and ask to generate solution and code.. how good will that be..(fun intended)

manuelabeledoover 2 years ago

I will bookmark this, so in a year or two, I get a reminder that potentially better products do not always win.Nice result formatting, though.

darthrupertover 2 years ago

Any bets on how soon we'll start seeing deaths because somebody did what an AI suggested? Did this already happen?

pruthvishettyover 2 years ago

Can we train ChatGPT on custom data yet?

jpeterover 2 years ago

But can you inject ads into the answer?

ramozover 2 years ago

Yea... when being proactive, in any way that is not adversarial... ChatGTP has shown me that it's capable of providing very specific insights and knowledge when asking about topics Im currently curious about learning. And it works, I learn the type of information I was seeking. When the topics are technical, GPT is very good at crawl, walk, run with things like algorithms. It's great at responding to "well what about...".Not only do I learn simpler, I gain better communication style myself when figuring out how to communicate with GPT. GPT also has a nice approach for dialog reasoning.It's filter system may be annoying, however you can easily learn to play GPT's preferred style of knowledge transfer... and it's honestly something we can learn from.TLDR; IMO ChatGPT expands the concept of learning, and self-tutoring, in an extremely useful way. This is something no search engine of indexed web pages can compete with. Arguably, the utility of index web pages is really degraded for certain types of desired search experiences when compared to ChatGPT... which it seems obv that internet browsing will be eventually incorporated (probably for further reference and narrowed expansion of a topic)

sanjay1374over 2 years ago

E Governance blog

ameliusover 2 years ago

Anecdotal evidence. Perhaps even fabricated.What we need is a large benchmark.

gerashover 2 years ago

scaling a large language model to serve thousands of queries per second and be continuously updated is not trivial.I'm sure we'll get there at some point.

smeagullover 2 years ago

Sorry, which one am I meant to be preferring?

mikkergpover 2 years ago

I wonder what this will do to misinformation. Seems like the next big culture war will be over AI. What seems very Utopian will quickly be framed as dystopian. If AI doesn't promote "opposing positions" it will definitely become the target of politicians ire, if not outright banning as <insert political party here> propaganda. For example, what would AI say in terms of the effectiveness of Ivermectin in combatting COVID-19? or Vaccine injury rates? Would AI argue that lockdowns are the most effective measure against a spreading pandemic?

jmullover 2 years ago

This seems like a tweet of average quality.

Reubachiover 2 years ago

u/xezzed (op)May I ask what draws you to the conclusion the tweeter reached? This seems like adblog spam otherwise.

bobleeswaggerover 2 years ago

Let the market decide, these are cherry picked examples as far as anyone should be concerned.

sanjay1374over 2 years ago

eGovernance blog

blessleroyover 2 years ago

你好

55 comments

hncelover 2 years ago

评论 #33823058 未加载

评论 #33821078 未加载

评论 #33820893 未加载

评论 #33925324 未加载

评论 #33926568 未加载

评论 #33820824 未加载

评论 #33820831 未加载

评论 #33822125 未加载

drchopchopover 2 years ago

It's great, until people realize GPT-3 will generate answers that are demonstrably wrong. (And to make matters worse, can't show/link the source of the incorrect information!)

评论 #33819193 未加载

评论 #33818935 未加载

评论 #33820167 未加载

评论 #33818490 未加载

评论 #33818443 未加载

评论 #33818764 未加载

评论 #33818997 未加载

评论 #33821227 未加载

评论 #33819268 未加载

评论 #33818670 未加载

评论 #33820878 未加载

评论 #33819683 未加载

评论 #33820677 未加载

评论 #33819321 未加载

评论 #33819913 未加载

评论 #33819031 未加载

评论 #33822696 未加载

评论 #33818717 未加载

评论 #33819685 未加载

评论 #33819344 未加载

评论 #33819097 未加载

tfshover 2 years ago

评论 #33823101 未加载

评论 #33820088 未加载

评论 #33818510 未加载

评论 #33821824 未加载

评论 #33820557 未加载

评论 #33818873 未加载

evrydayhustlingover 2 years ago

评论 #33821237 未加载

Veenover 2 years ago

评论 #33820223 未加载

qlmover 2 years ago

The answer about Typescript generics is totally wrong. I'm sure there's other errors but I didn't read further.

评论 #33819727 未加载

评论 #33822144 未加载

Tepixover 2 years ago

评论 #33819619 未加载

评论 #33828048 未加载

joegahonaover 2 years ago

评论 #33819435 未加载

Snoozusover 2 years ago

I'm confused. Did OP realize that GPTs anwers are completely wrong and is being sarcastic?Or is formatting really more important than content?

评论 #33819483 未加载

jstx1over 2 years ago

评论 #33818197 未加载

评论 #33818456 未加载

hn_throwaway_99over 2 years ago

评论 #33821653 未加载

datpiffover 2 years ago

You are very much in a bubble if you think the primary use of Google search is programming questions.

3videnceover 2 years ago

评论 #33821238 未加载

评论 #33830416 未加载

shantaraover 2 years ago

educayseanover 2 years ago

评论 #33823206 未加载

评论 #33822536 未加载

theptipover 2 years ago

评论 #33820436 未加载

评论 #33827238 未加载

chpatrickover 2 years ago

The stuff about TypeScript generics is just wrong though.

评论 #33821069 未加载

xrdover 2 years ago

评论 #33820258 未加载

wenderenover 2 years ago

wldcordeiroover 2 years ago

Like Google can't acquire this or something similar and include it in its results if it's actually a threat.

评论 #33818476 未加载

评论 #33818920 未加载

评论 #33818686 未加载

optionover 2 years ago

Google’s PaLM is current SOTA, way better than GPT-3 (non-tuned). I’m sure Google has many “tuned” internal-only PaLM variants in prod or testing today.

stephc_int13over 2 years ago

andreykover 2 years ago

评论 #33821200 未加载

eachroover 2 years ago

What are the engineering and considerations for serving this sort of model to billions of queries a day? Do the economics of a gpt-as-a-search-engine work?

评论 #33818842 未加载

nistenover 2 years ago

Long live YouTube. Long enough until some client-side stable-diffussion thingy for generating tutorial videos becomes practical enough for daily use.

6gvONxR4sf7oover 2 years ago

ukokiover 2 years ago

yaloginover 2 years ago

Jack_rando_fangover 2 years ago

MagicMoonlightover 2 years ago

qwertoxover 2 years ago

satvikpendemover 2 years ago

carrotcarrotover 2 years ago

I can't see this because twitter has a log in wall. Post better links please.

评论 #33819310 未加载

评论 #33819342 未加载

slmjkdbtlover 2 years ago

I think these are 2 separate use cases, one for organized knowledge and one for related links. Google doesn't compile knowledge as well, but it does good job on finding related links.

fnbrover 2 years ago

I don't get it. Why can't Google just train their own LLM and use that for answer cards?The main value of Google Search is the ability to search the web for websites. Not to search for answers.

none_to_remainover 2 years ago

In the first example, the AI seems more focused on extraneous stuff about aligning the equation, while the search result starts off by answering the question asked

aresantover 2 years ago

tompover 2 years ago

GPT chat confidently claimed that Bill Gates never flew with Lolita Express (Jeffrey Epstein’s plane), even when I cited a New York Times article.So, your mileage may vary

pgtover 2 years ago

Prediction: prompt placement. Complex AI explanations will feature the highest paying brand when two choices are tied, e.g. Mercedes vs. BMW.

zeropover 2 years ago

I would like to feed product KT videos, source code to it and ask to generate solution and code.. how good will that be..(fun intended)

manuelabeledoover 2 years ago

I will bookmark this, so in a year or two, I get a reminder that potentially better products do not always win.Nice result formatting, though.

darthrupertover 2 years ago

Any bets on how soon we'll start seeing deaths because somebody did what an AI suggested? Did this already happen?

pruthvishettyover 2 years ago

Can we train ChatGPT on custom data yet?

jpeterover 2 years ago

But can you inject ads into the answer?

ramozover 2 years ago

sanjay1374over 2 years ago

E Governance blog

ameliusover 2 years ago

Anecdotal evidence. Perhaps even fabricated.What we need is a large benchmark.

gerashover 2 years ago

scaling a large language model to serve thousands of queries per second and be continuously updated is not trivial.I'm sure we'll get there at some point.

smeagullover 2 years ago

Sorry, which one am I meant to be preferring?

mikkergpover 2 years ago

jmullover 2 years ago

This seems like a tweet of average quality.

Reubachiover 2 years ago

u/xezzed (op)May I ask what draws you to the conclusion the tweeter reached? This seems like adblog spam otherwise.

bobleeswaggerover 2 years ago

Let the market decide, these are cherry picked examples as far as anyone should be concerned.

sanjay1374over 2 years ago

eGovernance blog

blessleroyover 2 years ago

你好