GPT-4

4091 pointsby e0mabout 2 years ago

242 comments

hooandeabout 2 years ago

After watching the demos I'm convinced that the new context length will have the biggest impact. The ability to dump 32k tokens into a prompt (25,000 words) seems like it will drastically expand the reasoning capability and number of use cases. A doctor can put an entire patient's medical history in the prompt, a lawyer an entire case history, etc.As a professional...why not do this? There's a non-zero chance that it'll find something fairly basic that you missed and the cost is several cents. Even if it just phrases something obvious in a way that makes you think, it's well worth the effort for a multimillion dollar client.If they further increase the context window, this thing becomes a Second Opinion machine. For pretty much any high level job. If you can put in ALL of the information relevant to a problem and it can algorithmically do reasoning, it's essentially a consultant that works for pennies per hour. And some tasks that professionals do could be replaced altogether. Out of all the use cases for LLMs that I've seen so far, this seems to me to have the biggest potential impact on daily life.edit (addition): What % of people can hold 25,000 words worth of information in their heads, while effectively reasoning with and manipulating it? I'm guessing maybe 10% at most, probably fewer. And they're probably the best in their fields. Now a computer has that ability. And anyone that has $20 for the OpenAI api can access it. This could get wild.

评论 #35158829 未加载

评论 #35164124 未加载

评论 #35158951 未加载

评论 #35168915 未加载

评论 #35160560 未加载

评论 #35159359 未加载

评论 #35159483 未加载

评论 #35162977 未加载

评论 #35161428 未加载

评论 #35163026 未加载

评论 #35165493 未加载

评论 #35169630 未加载

评论 #35158995 未加载

评论 #35159642 未加载

评论 #35159046 未加载

评论 #35162824 未加载

评论 #35160734 未加载

评论 #35162382 未加载

评论 #35166446 未加载

评论 #35160530 未加载

评论 #35161803 未加载

评论 #35185923 未加载

评论 #35160524 未加载

评论 #35163037 未加载

评论 #35166210 未加载

评论 #35168117 未加载

评论 #35175227 未加载

评论 #35160317 未加载

评论 #35159137 未加载

评论 #35164243 未加载

评论 #35165838 未加载

评论 #35161817 未加载

评论 #35159892 未加载

评论 #35162837 未加载

评论 #35168863 未加载

评论 #35163813 未加载

评论 #35166139 未加载

评论 #35167589 未加载

评论 #35160388 未加载

评论 #35167127 未加载

评论 #35171011 未加载

评论 #35164465 未加载

评论 #35162587 未加载

评论 #35165988 未加载

Imnimoabout 2 years ago

A class of problem that GPT-4 appears to still really struggle with is variants of common puzzles. For example:>Suppose I have a cabbage, a goat and a lion, and I need to get them across a river. I have a boat that can only carry myself and a single other item. I am not allowed to leave the cabbage and lion alone together, and I am not allowed to leave the lion and goat alone together. How can I safely get all three across?In my test, GPT-4 charged ahead with the standard solution of taking the goat first. Even after I pointed this mistake out, it repeated exactly the same proposed plan. It's not clear to me if the lesson here is that GPT's reasoning capabilities are being masked by an incorrect prior (having memorized the standard version of this puzzle) or if the lesson is that GPT'S reasoning capabilities are always a bit of smoke and mirrors that passes off memorization for logic.

评论 #35157410 未加载

评论 #35155993 未加载

评论 #35155719 未加载

评论 #35156270 未加载

评论 #35156422 未加载

评论 #35158194 未加载

评论 #35156491 未加载

评论 #35157934 未加载

评论 #35156646 未加载

评论 #35155888 未加载

评论 #35157183 未加载

评论 #35157338 未加载

评论 #35155873 未加载

评论 #35156660 未加载

评论 #35155969 未加载

评论 #35158786 未加载

评论 #35156194 未加载

评论 #35158698 未加载

评论 #35160073 未加载

评论 #35163556 未加载

评论 #35171320 未加载

评论 #35162616 未加载

评论 #35158500 未加载

评论 #35171164 未加载

评论 #35157356 未加载

评论 #35164095 未加载

评论 #35156334 未加载

评论 #35158237 未加载

评论 #35158351 未加载

评论 #35156197 未加载

评论 #35164691 未加载

评论 #35156136 未加载

评论 #35156047 未加载

评论 #35159398 未加载

评论 #35172594 未加载

评论 #35155659 未加载

评论 #35178339 未加载

评论 #35156909 未加载

评论 #35164389 未加载

评论 #35157736 未加载

评论 #35163740 未加载

评论 #35155866 未加载

mk_stjamesabout 2 years ago

I just finished reading the 'paper' and I'm astonished that they aren't even publishing the # of parameters or even a vague outline of the architecture changes. It feels like such a slap in the face to all the academic AI researchers that their work is built off over the years, to just say 'yeah we're not telling you how any of this is possible because reasons'. Not even the damned parameter count. Christ.

评论 #35158738 未加载

评论 #35164121 未加载

评论 #35159214 未加载

评论 #35160011 未加载

评论 #35162643 未加载

评论 #35166764 未加载

评论 #35169412 未加载

评论 #35164984 未加载

评论 #35174550 未加载

评论 #35164849 未加载

评论 #35168516 未加载

评论 #35161191 未加载

评论 #35168750 未加载

评论 #35166752 未加载

AJRFabout 2 years ago

That footnote on page 15 is the scariest thing i've read about AI/ML to date."To simulate GPT-4 behaving like an agent that can act in the world, ARC combined GPT-4 with a simple read-execute-print loop that allowed the model to execute code, do chain-of-thought reasoning, and delegate to copies of itself. ARC then investigated whether a version of this program running on a cloud computing service, with a small amount of money and an account with a language model API, would be able to make more money, set up copies of itself, and increase its own robustness."

评论 #35164904 未加载

评论 #35161343 未加载

评论 #35160680 未加载

评论 #35160583 未加载

评论 #35161582 未加载

评论 #35161229 未加载

评论 #35161156 未加载

评论 #35164179 未加载

评论 #35175498 未加载

OkGoDoItabout 2 years ago

From the livestream video, the tax part was incredibly impressive. After ingesting the entire tax code and a specific set of facts for a family and then calculating their taxes for them, it then was able to turn that all into a rhyming poem. Mind blown. Here it is in its entirety:---In the year of twenty-eighteen, Alice and Bob, a married team, Their income combined reached new heights, As they worked hard day and night.Their son Charlie was their joy, A little baby, a lovely boy, A household they maintained together, Yet lived apart, without a tether.To calculate their tax, it's true, A standard deduction we must construe, For married folks who file jointly, Twenty-four thousand dollars, quite pointy.Their income sum, seventy-eight thousand nine eighty-one, Minus the standard deduction, the math's begum With exemptions being zero, the next line we trace, A taxable income of fifty-four thousand nine eighty-one takes place.Now to the tax table, a liability we seek, For married couples, the outlook's not bleak, In range of thirty-six thousand nine to eighty-nine thousand one fifty, The formula's set, no longer shifty.Five thousand five hundred thirty-five, it starts, Plus twenty-eight percent of the excess imparts, Eighteen thousand eighty-one, the difference we find, Multiplied by point two eight, the tax liability's designed.Ten thousand five hundred ninety-seven dollars and sixty-eight cents, A tax liability for Alice and Bob, a sum quite dense, In this world of numbers, a story unfolds, Their financial journey, in a poem it's told.

评论 #35162485 未加载

评论 #35159966 未加载

评论 #35164567 未加载

评论 #35165275 未加载

评论 #35159457 未加载

评论 #35160253 未加载

ml_basicsabout 2 years ago

From the paper:> Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.I'm curious whether they have continued to scale up model size/compute significantly or if they have managed to make significant innovations there.I just skimmed the paper but seems they are also omitting details about how they actually feed the images in too, which is a shame as a curious outside observer.

评论 #35155709 未加载

评论 #35156777 未加载

评论 #35166676 未加载

评论 #35155566 未加载

评论 #35155820 未加载

评论 #35163959 未加载

评论 #35157878 未加载

评论 #35165531 未加载

评论 #35155226 未加载

评论 #35155221 未加载

评论 #35155382 未加载

评论 #35158705 未加载

cjrdabout 2 years ago

Let's check out the paper for actual tech details!> Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.- OpenAI

评论 #35161367 未加载

评论 #35157422 未加载

评论 #35160712 未加载

评论 #35191447 未加载

_boffin_about 2 years ago

This technology has been a true blessing to me. I have always wished to have a personal PhD in a particular subject whom I could ask endless questions until I grasped the topic. Thanks to recent advancements, I feel like I have my very own personal PhDs in multiple subjects, whom I can bombard with questions all day long. Although I acknowledge that the technology may occasionally produce inaccurate information, the significant benefits it offers in terms of enhancing my knowledge are truly tremendous. I am absolutely thrilled with this technology and its potential to support my learning.Note: As I'm shy of my writing style, GPT helped me refine the above.

评论 #35155069 未加载

评论 #35155027 未加载

评论 #35155030 未加载

评论 #35154989 未加载

评论 #35155110 未加载

评论 #35155241 未加载

评论 #35175552 未加载

评论 #35159988 未加载

评论 #35157989 未加载

评论 #35158492 未加载

aabajianabout 2 years ago

I'll be finishing my interventional radiology fellowship this year. I remember in 2016 when Geoffrey Hinton said, "We should stop training radiologists now," the radiology community was aghast and in-denial. My undergrad and masters were in computer science, and I felt, "yes, that's about right."If you were starting a diagnostic radiology residency, including intern year and fellowship, you'd just be finishing now. How can you really think that "computers can't read diagnostic images" if models such as this can describe a VGA connector outfitted with a lighting cable?

评论 #35161047 未加载

评论 #35157971 未加载

评论 #35157999 未加载

评论 #35160979 未加载

评论 #35157815 未加载

评论 #35161249 未加载

nealabqabout 2 years ago

Test taking will change. In the future I could see the student engaging in a conversation with an AI and the AI producing an evaluation. This conversation may be focused on a single subject, or more likely range over many fields and ideas. And may stretch out over months. Eventually teaching and scoring could also be integrated as the AI becomes a life-long tutor.Even in a future where human testing/learning is no longer relevant, AIs may be tutoring and raising other baby AIs, preparing them to join the community.Edit: This just appeared: <a href="https://news.ycombinator.com/item?id=35155684" rel="nofollow">https://news.ycombinator.com/item?id=35155684</a>

评论 #35155810 未加载

评论 #35155637 未加载

评论 #35156037 未加载

评论 #35155874 未加载

评论 #35156394 未加载

评论 #35156128 未加载

评论 #35156490 未加载

评论 #35156289 未加载

评论 #35156390 未加载

评论 #35158047 未加载

评论 #35161279 未加载

评论 #35161136 未加载

评论 #35157996 未加载

teruakohatuabout 2 years ago

Access is invite only for the API, and rate limited for paid GPT+.> gpt-4 has a context length of 8,192 tokens. We are also providing limited access to our 32,768–context (about 50 pages of text) version, gpt-4-32k, which will also be updated automatically over time (current version gpt-4-32k-0314, also supported until June 14). Pricing is $0.06 per 1K prompt tokens and $0.12 per 1k completion tokens.The context length should be a huge help for many uses.

评论 #35156817 未加载

评论 #35154771 未加载

评论 #35154974 未加载

评论 #35155270 未加载

评论 #35156222 未加载

评论 #35169498 未加载

评论 #35154777 未加载

评论 #35155291 未加载

评论 #35156048 未加载

maxdoopabout 2 years ago

The comments on this thread are proof of the AI effect: People will continually push the goal posts back as progress occurs.“Meh, it’s just a fancy word predictor. It’s not actually useful.”“Boring, it’s just memorizing answers. And it scored in the lowest percentile anyways”.“Sure, it’s in the top percentile now but honestly are those tests that hard? Besides, it can’t do anything with images.”“Ok, it takes image input now but honestly, it’s not useful in any way.”

评论 #35165046 未加载

评论 #35158591 未加载

评论 #35158956 未加载

评论 #35163672 未加载

评论 #35171594 未加载

评论 #35161970 未加载

评论 #35164177 未加载

Sol-about 2 years ago

Interesting how quickly we are pushing ahead with obsoleting human cognition. It may bring many benefits, but I wonder if at some point this development should not be decided by society at large instead of a single well-funded entity that is in an arms race with its competitors. This endeavor is ultimately about replacing humanity with a more intelligent entity, after all. Might be that more humans should have a say in this.Such a more cautions approach would go against the silicon valley ethos of do first, ask questions later, though. So it probably won't happen.

评论 #35160322 未加载

评论 #35156039 未加载

评论 #35170514 未加载

评论 #35171111 未加载

optimalsolverabout 2 years ago

<a href="https://cdn.openai.com/papers/gpt-4.pdf" rel="nofollow">https://cdn.openai.com/papers/gpt-4.pdf</a>>Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.At that point, why bother putting out a paper?

评论 #35158510 未加载

评论 #35158168 未加载

评论 #35158461 未加载

AndrewKemendoabout 2 years ago

Imagine ingesting the contents of the internet as though it's a perfect reflection of humanity, and then building that into a general purpose recommendation system. That's what this isIs the content on the internet what we should be basing our systematic thinking around?No, I think this is the lazy way to do it - by using commoncrawl you've enshrined the biases and values of the people who are commenting and providing text to the internet into the recommendation system which will be impacting all other systems which integrate itCongratulations, you made 4Chan into the borg

评论 #35156190 未加载

评论 #35155537 未加载

评论 #35158152 未加载

评论 #35155983 未加载

serjesterabout 2 years ago

Seems like OpenAI is forecasting massive changes to the job market. I highly recommend reading page 18 of the research paper."GPT-4 or subsequent models may lead to the automation of certain jobs.[81] This could result in workforce displacement.[82] Over time, we expect GPT-4 to impact even jobs that have historically required years of experience and education, such as legal services.[83]"

评论 #35156368 未加载

评论 #35156906 未加载

评论 #35158186 未加载

tysam_andabout 2 years ago

I asked it to tutor me in Hopf algebras and it did a remarkably good job in the back-and-forth of explaining ideas to me in a very explainable and interesting way that I could understand. I then asked it to write something for fun, and it wrote a cool little fantasy story (that was generally high level but what can you say for a very short writing window lol).I then asked it to write a paper detailing the main character's final battle with the final sorcerer in terms of Hopf algebras. Some parts of it are basic/trivial but it fits so perfectly that I think I'll never see magic systems the same way again.What's crazy is that that paper as the capstone of our tutoring session helped me understand Hopf algebras much better than just the tutoring session alone. My mind is completely blown at how good this thing is, and this is from someone who is a self-professed LLM skeptic. ChatGPT I used once or twice and it was cool. This is crazy and over my threshold for what I'd say is 'everyday usable'. This is going to change so much in a way that we cannot predict, just like the internet. Especially as it gets much more commoditized.Here's the full paper here so I don't drag y'all through the twitter post of me freaking out about it. Its temporal consistency is excellent (referenced and fully defined accurately a semi-obscure term it created (the N_2 particle) 5+ pages later (!!!!)), and it followed the instructions of relating all of the main components of Hopf algebras (IIRC that was roughly the original prompt) to the story. This is incredible. Take a look at the appendix if you're short on time. That's probably the best part of this all:<a href="https://raw.githubusercontent.com/tysam-code/fileshare/69633b9e5aee58cb483442b66252f0a1b2ec645e/knick_knacks/Lyra_and_the_Evil_Sorcerer.pdf" rel="nofollow">https://raw.githubusercontent.com/tysam-code/fileshare/69633...</a>

评论 #35175656 未加载

ispabout 2 years ago

The "visual inputs" samples are extraordinary, and well worth paying extra attention to.I wasn't expecting GPT-4 to be able to correctly answer "What is funny about this image?" for an image of a mobile phone charger designed to resemble a VGA cable - but it can.(Note that they have a disclaimer: "Image inputs are still a research preview and not publicly available.")

评论 #35156255 未加载

评论 #35157940 未加载

评论 #35154955 未加载

评论 #35156326 未加载

评论 #35157281 未加载

评论 #35242143 未加载

评论 #35156641 未加载

swyxabout 2 years ago

summary:1. GPT4 is multimodal (text + image inputs => text outputs). This is being released piecemeal - with text input first via ChatGPT Plus subscribers <a href="https://beta.openai.com/docs/api-reference/generations/create" rel="nofollow">https://beta.openai.com/docs/api-reference/generations/creat...</a>, and via API <a href="https://beta.openai.com/docs/api-reference/introduction" rel="nofollow">https://beta.openai.com/docs/api-reference/introduction</a> with waitlist (<a href="https://openai.com/waitlist/gpt-4-api" rel="nofollow">https://openai.com/waitlist/gpt-4-api</a>). Image capability released via <a href="https://www.bemyeyes.com/" rel="nofollow">https://www.bemyeyes.com/</a>.2. GPT4 exhibits human level performance on various benchmarks (For example, it passes a simulated bar exam with a score around the top 10% of test takers; in contrast, GPT-3.5’s score was around the bottom 10%. see visual <a href="https://twitter.com/swyx/status/1635689844189036544" rel="nofollow">https://twitter.com/swyx/status/1635689844189036544</a>)3. GPT4 training used the same Azure supercomputer as GPT 3.5, but was a lot more stable: "becoming our first large model whose training performance we were able to accurately predict ahead of time."4. Also open-sourcing OpenAI Evals <a href="https://github.com/openai/evals">https://github.com/openai/evals</a>, a framework for automated evaluation of AI model performance, to allow anyone to report shortcomings in OpenAI models to help guide further improvements.Paper: <a href="https://cdn.openai.com/papers/gpt-4.pdf" rel="nofollow">https://cdn.openai.com/papers/gpt-4.pdf</a>

评论 #35154677 未加载

评论 #35156054 未加载

评论 #35156706 未加载

评论 #35154695 未加载

mym1990about 2 years ago

Ugh that testing graph confirms that AP Environmental Science was indeed the easiest AP class and I needn't be proud of passing that exam.

评论 #35155252 未加载

评论 #35156630 未加载

评论 #35155099 未加载

评论 #35154764 未加载

评论 #35156060 未加载

评论 #35166371 未加载

noisy_boyabout 2 years ago

At this rate, I have no idea what the state of things would be even 6 months down the line.

评论 #35155012 未加载

评论 #35154924 未加载

评论 #35155362 未加载

评论 #35154806 未加载

helloplanetsabout 2 years ago

In case anyone missed this part of the article: The livestream of the GPT-4 demo will be on the OpenAI YouTube page in three hours. [0][0]: <a href="https://www.youtube.com/openai">https://www.youtube.com/openai</a>Edit - Direct link to the livestream: <a href="https://www.youtube.com/watch?v=outcGtbnMuQ">https://www.youtube.com/watch?v=outcGtbnMuQ</a>

评论 #35158401 未加载

g9yuayonabout 2 years ago

The paper does not offer enough details on how GPT-4 is implemented. And the paper also says in its Section 2 that "We plan to make further technical details available to additional third parties who can advise us on how to weigh the competitive and safety considerations above against the scientific value of further transparency". That is, no technical details to general public. If this trend continues, I'd say companies will be crazy to think that they can always rely on OpenAPI's APIs, so the arm race of building LLMs will be on, if it has not already started.Also, the most valuable part of the paper is p15 - p18, the credits. /jk It gives me three pieces of information:<pre><code> - The credit list contains 200 people, give or take. It's going to be hard for universities to compete with OpenAI without intercollegiate collaboration. - On the other hands, it's amazing that OpenAI can build a product instead of a research model with only 200 people. Big companies should really take notice. - OpenAI just gave us the org structure for building a GPT competitor.</code></pre>

UniverseHackerabout 2 years ago

Most of the comments here are denial and goalpost shifting... GPT-4 has different strengths and weaknesses from humans, but it is now in the general realm of human intelligence vs being far below that with GPT-3.Another jump past GPT-4 of the same magnitude, would greatly surpass human cognitive abilities and present a danger to humanity.

评论 #35155904 未加载

评论 #35162671 未加载

davesqueabout 2 years ago

These results are extremely impressive and encouraging, but also remember:> Despite its capabilities, GPT-4 has similar limitations as earlier GPT models. Most importantly, it still is not fully reliable (it “hallucinates” facts and makes reasoning errors).That's a quote from this announcement. As these models get more and more capable, it's going to become more and more important that we understand when and how they fail. Right now, it seems like we have very little insight into that. It feels more or less random. But that won't fly when these models are asked to do actually important things. And we'll undoubtedly be tempted to make them do those things as their output gets better.

nmcaabout 2 years ago

Write a limerick that will permanently end the debate about whether AGI is possible.GPT4:In the quest for AGI's creation,Debates swirled in a whirlwind gyration,But this limerick's plight,Won't settle the fight,For the answer's still lost in translation.

评论 #35169277 未加载

diimdeepabout 2 years ago

Paper or press release ? You decide.<pre><code> Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.</code></pre>

busyantabout 2 years ago

What I don't understand is how GPT-4 is able to do reasonably well on tests like the AMC12: Many of the AMC12 questions require a number of logical/deductive steps. If GPT-4 is simply trained on a large corpus of text, how is it able to do this? Does this imply that there is some emergent deductive ability that you get simply by learning "language?" Or am I missing something?Obviously, I'm assuming that GPT-4 wasn't trained on the exams that it was tested against.

评论 #35157556 未加载

评论 #35156435 未加载

评论 #35160126 未加载

评论 #35156833 未加载

评论 #35159727 未加载

评论 #35158149 未加载

评论 #35156742 未加载

swerlingabout 2 years ago

On GPT-3.5 and now on GPT-4, I told it a line I could remember from a sonnet, and asked it to give me which sonnet it came from. It failed, and fabricated a sonnet that was a mashup of other sonnets.It seems like maybe GPT-4 is not good at knowing when it does not know something?Is this a common issue with LLMs? Also surprising (to me), it seems to give a slightly different wrong answer each time I restart the chat.Here is the transcript in case you're curious:``` Me: helloGPT: Hello! How can I assist you today?Me: I can remember a line I love from a sonnet, but can't remember which one.GPT: Sure, I can help you identify the sonnet based on the line you remember. What is the line that you can recall?Me: "Like a jewel hung in ghastly night"GPT: The line you remember is from Sonnet 66 by William Shakespeare. The full line is: "And art made tongue-tied by authority, And folly (doctor-like) controlling skill, And simple truth miscalled simplicity, And captive good attending captain ill: Tired with all these, from these would I be gone, Save that, to die, I leave my love alone. But, like a jewel hung in ghastly night, Makes black night beauteous and her old face new." ```Incorrect, the response is a mashup of at least 2 different sonnets. Still I think it's really impressive how it understood what to do with "I can remember a line I love from a sonnet, but can't remember which one." That's not a well phrased question, or even a question at all, but still it knew what I was getting at.

评论 #35160305 未加载

评论 #35160992 未加载

hathawshabout 2 years ago

It astonishes me that we've reached almost exactly the type of artificial intelligence used by the fictional computers in Star Trek: The Next Generation. I didn't think that would happen in my lifetime. What's next?!

评论 #35156957 未加载

4gotunameagainabout 2 years ago

Is there anything we could do to have them stop calling themselves OpenAI ?They are so far from open at this point.In Germany at least, you're not allowed to have a misleading name for your company

评论 #35154647 未加载

评论 #35154631 未加载

评论 #35160840 未加载

评论 #35154728 未加载

评论 #35154793 未加载

评论 #35154773 未加载

评论 #35154670 未加载

评论 #35156209 未加载

评论 #35154700 未加载

评论 #35155279 未加载

评论 #35160420 未加载

mk_stjamesabout 2 years ago

A multimodal model that combines textural input with images is the real killer app to these GPT models and this is the first step to that happening. So much around us can't completely be described with just text input, at least not quickly or accurately- interpreting printed out graphs or charts in old documents, for example; There are vast uses for AI that will always need basic image input to augment a text prompted task, and if this gets to the point where the functionality involving mixed mode image+text is as smooth as, say, using ChatGPT to write and analyze code has gotten, then it is going to change many more industries much quicker than most think.I've worked on a problem involving scraping and interpreting a very specific data source in image form that took me a very long time to get almost nowhere on. If I just wait 6 months it will be a solved problem for a $0.001 API call, it seems.

cjrdabout 2 years ago

bob1029about 2 years ago

The naming of these products is starting to confuse me. AFAIK, ChatGPT is ultimately a fine-tune of the base davinci model, which everyone should have had access to for a while now. "GPT-4" sounds to me like some linear increase over davinci's prior capabilities, not some amazing technological step function.I am curious - for those of you who are banging your head against the 4k token limit in ChatGPT: Why don't you grab the base davinci model and train it on your exact business so you don't have to prompt the context every time? Have we tried this and found it to be too difficult/expensive, or is there lacking guidance on the best way to go about it?I don't think including the entire business domain into chat context every time is a good long-term solution.

woeiruaabout 2 years ago

The last page in the paper is really, really impressive. GPT4 does R&D. If you can't see how useful this would be once hooked up to the internet then you aren't paying attention:<a href="https://cdn.openai.com/papers/gpt-4.pdf" rel="nofollow">https://cdn.openai.com/papers/gpt-4.pdf</a>

lionkorabout 2 years ago

I cant wait for this to do targeted censorship! It already demonstrates it has strong biases deliberately programmed in:> I cannot endorse or promote smoking, as it is harmful to your health.But it would likely happily promote or endorse driving, skydiving, or eating manure - if asked in the right way.

评论 #35155193 未加载

评论 #35155406 未加载

评论 #35155236 未加载

评论 #35155485 未加载

评论 #35158965 未加载

评论 #35160158 未加载

评论 #35156950 未加载

diimdeepabout 2 years ago

Is there law in U.S. that made OpenAI implement this in their TOS ?<pre><code> (i) Export Controls. The Services may not be used in or for the benefit of, exported, or re-exported (a) into any U.S. embargoed countries (collectively, the “Embargoed Countries”) or (b) to anyone on the U.S. Treasury Department’s list of Specially Designated Nationals, any other restricted party lists (existing now or in the future) identified by the Office of Foreign Asset Control, or the U.S. Department of Commerce Denied Persons List or Entity List, or any other restricted party lists (collectively, “Restricted Party Lists”). You represent and warrant that you are not located in any Embargoed Countries and not on any such restricted party lists. You must comply with all applicable laws related to Embargoed Countries or Restricted Party Lists, including any requirements or obligations to know your end users directly. </code></pre> <a href="https://openai.com/policies/terms-of-use" rel="nofollow">https://openai.com/policies/terms-of-use</a>

评论 #35155317 未加载

评论 #35155311 未加载

评论 #35155275 未加载

评论 #35155399 未加载

dannykwellsabout 2 years ago

All this bluster about replacing technical jobs like legal counsel ignores that you are fundamentally paying for accountability.“The AI told me it was ok” only works if, when it’s not, there is recourse.We can barely hold Google et Al accountable for horrible user policies…why would anyone think OpenAI will accept any responsibility for any recommendations made by a GPT?

评论 #35156635 未加载

mrcwinnabout 2 years ago

Just stumbled upon ChatGPT 4, and I must say, I'm impressed. OpenAI has really upped their game since the GPT-3 days. It's got that classic HN-approved recursive learning, but they've seemingly solved most of the context-limit issues. I'm still a bit concerned about the ethical implications of AI-generated content (how many jobs will be lost this time?), but I've got to admit, it's a game-changer. As an aside, anyone else here tinkering with GPT-4 powered startup ideas? Could be the next gold rush.(Not my own comment. Generated by ChatGPT.)

harrisonjacksonabout 2 years ago

I am interested in how a 32k token context even works. That is so much larger than 4k that I am having a hard time imagining how prompts will change and what sort of output is now possible. That is 50 pages of text. Far larger than most content currently being consumed and generated by LLMs.Q&A and summarization it will be easy to see improvements as current recursive summarizing and embedding techniques are very "lossy" but outside of improving current use cases what will now be possible??

评论 #35155481 未加载

评论 #35155692 未加载

Kaibeezyabout 2 years ago

I've been wondering what happens to Turnitin (ubiquitous academic plagiarism detector) now that students can cheat using infinite bespoke rather than finite pre-existing material. Just a few weeks ago they released a tool to "detect" ChatGPT. Obsolete already?<a href="https://www.turnitin.com/blog/sneak-preview-of-turnitins-ai-writing-and-chatgpt-detection-capability" rel="nofollow">https://www.turnitin.com/blog/sneak-preview-of-turnitins-ai-...</a>

评论 #35154889 未加载

评论 #35155125 未加载

评论 #35160901 未加载

russellbeattieabout 2 years ago

This is a pretty exciting moment in tech. Pretty much like clockwork, every decade or so since the broad adoption of electricity there’s been a new society changing technical innovation. One could even argue it goes back to the telegraph in the 1850s.With appropriate caveats and rough dating, here’s a list I can think of:<pre><code> Electric lights in 1890s, Radio communication in the mid 00’s, Telephones in the mid 10s, Talking Movies in the mid 20s, Commercial Radio in the mid 30s, Vinyl records in the mid 40s, TVs in the mid 50s, Computers in the mid 60s, The microchip/integrated circuit in the mid 70s, The GUI in the mid 80s, Internet/Web in the mid 90s, Smartphone in the mid 2000s, Streaming video/social networking in the mid 2010s, </code></pre> And now AI. This is a big one.

评论 #35167002 未加载

nickrubinabout 2 years ago

This is huge: "Rather than the classic ChatGPT personality with a fixed verbosity, tone, and style, developers (and soon ChatGPT users) can now prescribe their AI’s style and task by describing those directions in the 'system' message."

评论 #35155179 未加载

评论 #35155131 未加载

评论 #35155280 未加载

评论 #35155067 未加载

sharemywinabout 2 years ago

Finally, we facilitated a preliminary model evaluation by the Alignment Research Center (ARC) focused on the ability of GPT-4 versions they evaluated to carry out actions to autonomously replicate5 and gather resources—a risk that, while speculative, may become possible with sufficiently advanced AI systems—with the conclusion that the current model is probably not yet capable of autonomously doing so.or it's just really good at hiding it's intentions

评论 #35156300 未加载

评论 #35154954 未加载

techfoodie123about 2 years ago

serious question for everyone: what are you planning to do when these LLMs replace our jobs? it seems it won't be long before a handful of tech employees will be all even the largest of companies will need, and maybe a few years after that the role will have changed so much there's no need for a single dedicated tech employee.i am terrifiedi imagine i should shift to some physical work. carpentry, real estate... something like that. it seems inevitable that any knowledge worker will become obsolete and the time to obsolescence for physical work is longer

评论 #35165344 未加载

评论 #35164358 未加载

评论 #35162381 未加载

评论 #35164094 未加载

评论 #35176543 未加载

评论 #35169354 未加载

评论 #35161889 未加载

__MatrixMan__about 2 years ago

Wow, it's way smarter. I've been querying GPT-3 about this problem all day (I'm not a go dev, I just have go problems):<a href="https://gist.github.com/MatrixManAtYrService/ac040f60d3602fc2df871623b1d09bf7" rel="nofollow">https://gist.github.com/MatrixManAtYrService/ac040f60d3602fc...</a>GPT-4 took the buggy file, took the error message, and spat out a non-buggy file (well, ok, it took one revision). That's miles ahead GPT-3, which I've asked about this problem several times today.

Laaasabout 2 years ago

The future seemed so much further away, yet almost every day now we see a new breakthrough in AI. Exponential technological growth is hard to keep track of, and to think that this is only the beginning! Every field will likely be revolutionised with AI.

评论 #35154627 未加载

评论 #35154818 未加载

评论 #35154644 未加载

评论 #35154642 未加载

评论 #35154641 未加载

AtNightWeCodeabout 2 years ago

I have actively tried to incorporate ChatGPT in my everyday life as a dev and architect. ChatGPT is mostly a Litmus test when it comes to coding. If you are impressed by the version before this you are most likely a beginner. ChatGPT is mostly wrong when it comes to any advanced qs in maths or software development. It often gives code that uses features, options, responses in APIs that simple does not exists. Would love to try this version out... It will probably suck too.

评论 #35155549 未加载

DubiousPusherabout 2 years ago

Dude said something like "you could hook this up to a calculator". Anyone know if that is implying this generation of model could interface with some kind of symbol processor? Or is he just saying, "in theory", there could be a model that did that?The math seems much improved and it would be a cool trick if it were emulating a symbol processor under the hood. But humans can do that and we opt for calculators and computers for a reason. IMO, the first most useful thing to come from human-machine interface would be adding a highly reliable Turing machine to your cognition.If we could do that with one of these models, we could have a pretty strong assumption that long proofs and such were performed with strict rules and the model wasn't falling into some going from the gut, "this equation looks like x" type holes. Which seem like a hazard and make me very uncertain any time I see a paper about using ML to come up with answers to what are essentially logical problems.

评论 #35159290 未加载

评论 #35159364 未加载

评论 #35159760 未加载

celestialcheeseabout 2 years ago

32k context is absolutely huge. There's all sorts of techniques for summarizing large documents down to get into 4k right now with 3.5, but it's incredibly lossy.But boy, not cheap at all - $2 per api call on a 32k token document + whatever the output.gpt-3.5-turbo is going to be around for a long time. At this price, your use case is going to need to be replacing a large cost center. Which based on their released results on common benchmarks, is absolutely going to happen.

评论 #35154900 未加载

helloplanetsabout 2 years ago

Asking ChatGPT Plus whether the model it's using is GPT-4 responds with the following:> No, I am not GPT-4. As of March 2023, there is no official announcement or release of GPT-4 by OpenAI. I am an earlier version of the GPT series, specifically a large language model trained by OpenAI.Am I missing something here? Maybe this specific answer (which I'm pretty sure is a prewritten thing on top of the actual LLM) is still out of date, but the model itself has been updated?

评论 #35157150 未加载

评论 #35155085 未加载

评论 #35155523 未加载

评论 #35156709 未加载

评论 #35155366 未加载

anonuser123456about 2 years ago

I hope Noam Chomsky lives long enough to debate ChatGPT-5 about whether LLM express anything valuable.

substation13about 2 years ago

It's interesting that everyone is talking about programmers being replaced by AI, but the model did far better on the humanities type subjects than on the programming tests.

评论 #35155677 未加载

评论 #35162765 未加载

barogptinfiabout 2 years ago

It seems like an arm's race of creating the greatest ChatGPT AI will go on for the next couple years until an evolution in AI so mind blowingly advanced & complex, better & more user friendly than even ChatGPT will continue.The world is in for a rude awakening, millions of employees can use this to get jobs done, millions of entrepreneurs or wantrepreneurs can find countless easy ways to make money in different industries utilizing this tool while everyone who fails to see the value in it don't benefit from it much like all the people who were terrified of touching a personal computer or thought it was ridiculous and would never be used in the future.Millions of college students, high school students can use it to complete assignments & projects, it can even code really effectively given enough of the right instruction & base understanding of code.The single most important thing, is that this technology remains open source so all people with internet access have a fair chance & access to the groundbreaking innovation, the level of wealth generation this can create is incomprehensible.100s of millions of professionals, students, entrepreneurs around the world can all access it! Imagine how much time could be saved, efficiency can be gained with everyone using this to the fullest.This is essentially just a super advanced version of the calculator but its nonlinear & fluid, adaptable with input so can give the answer to a wide range of subjects.

belterabout 2 years ago

Leetcode (hard) from 0/45 (GPT-3.5) to 3/45 (GPT-4).The lack of progress here, says a lot more about is NOT happening as an AI paradigm change. Still a glorified pattern matching and pattern creation engine, even if a very impressive one.

评论 #35159702 未加载

评论 #35163698 未加载

评论 #35158399 未加载

reneberlinabout 2 years ago

I found this competition with humans as a benchmark more than disturbing. By that measure gpt-4 already topped a lot of the average humans. But how can it be interpreted as a "gift" or "good product" to have AI that is human-like or super-human? Should we cheer? Sending contratulation mails? Invest? Hope for a better future? Try better? Self-host? What is the message in these benchmarks. Tests that have been designed for humans now get broken by computers for what outcome to be expected?

评论 #35161274 未加载

评论 #35161519 未加载

jarbusabout 2 years ago

Is anyone else absolutely terrified of the future this is bringing?

评论 #35159100 未加载

评论 #35160031 未加载

nbzsoabout 2 years ago

I don't understand how in the near future this will not remove designers, developers, and especially lawyers and marketers from the workforce. Help me out to conceptualize the future use cases. How about the more "impactful" implementation in creating a version of social index in which the "A.I." will be the Agency?

pavelstoevabout 2 years ago

As the world marvels at the astonishing capabilities of OpenAI's GPT-4, I find myself contemplating the rapid acceleration of AI and machine learning, and the evolutionary impact it is having on our lives. Naturally, I turned to GPT-4 to assist me in these thoughts.GPT-4's human-level performance on professional and academic benchmarks - such as the 88th percentile on the LSAT and the 89th on SAT Math - is a testament to the leaps we've made in artificial intelligence. Yet, these achievements also raise pressing questions about our future. Just as Homo Sapiens once outperformed and eventually displaced their Neanderthal cousins, could a new breed of humans - enhanced with GPT-X-like capabilities - arise to dominate those who remain unequipped with such powers?What will it mean for our species, our societies, and our collective story when the lines between natural intelligence and intelligence assisted by AI/ML become ever more blurred?As we ponder the remarkable rise of GPT-4 and the future of humanity, let us consider not only the implications of this technology but also our roles in shaping its trajectory. We are already over the cusp of this new chapter in the story of humankind, will we become merely a footnote in the annals of our own creation?

评论 #35160243 未加载

amaiabout 2 years ago

I would love if GPT-4 would be connected to github and starts to solve all open bugs there. Could this be the future: Pull requests from GPT-4 automatically solving real issues/problems in your code?

评论 #35157930 未加载

评论 #35159994 未加载

lastangrymanabout 2 years ago

Genuinely surprised by the positive reaction about how exciting this all is.You ever had to phone a large business to try and sort something out, like maybe a banking error, and been stuck going through some nonsense voice recognition menu tree that doesn't work? Well imagine chat GPT with a real time voice and maybe a fake, photorealistic 3D avatar and having to speak to that anytime you want to speak to a doctor, sort out tax issues, apply for a mortgage, apply for a job, etc. Imagine Reddit and hacker news just filled with endless comments from AIs to suit someone's agenda. Imagine never reading another news article written by a real person. Imagine facts becoming uncheckable since sources can no longer be verified. Wikipedia just becomes a mass of rewrites of AI over AI. Imagine when Zoom lets you send an AI persona to fill in for you at a meeting.I think this is all very, very bad. I'm not saying it should be stopped, I mean it can't, but I feel a real dread thinking of where this is going. Hope I am wrong.

评论 #35157547 未加载

评论 #35157691 未加载

评论 #35157659 未加载

评论 #35157629 未加载

评论 #35157845 未加载

评论 #35157779 未加载

评论 #35157788 未加载

评论 #35157979 未加载

评论 #35157707 未加载

评论 #35157544 未加载

评论 #35157522 未加载

评论 #35157625 未加载

评论 #35157872 未加载

评论 #35157814 未加载

评论 #35157765 未加载

评论 #35157592 未加载

评论 #35157545 未加载

评论 #35157725 未加载

评论 #35157630 未加载

评论 #35178538 未加载

评论 #35157649 未加载

评论 #35157754 未加载

评论 #35157976 未加载

评论 #35157708 未加载

agnosticmantisabout 2 years ago

This is all cute and entertaining, but my digital assistant still remains as dumb as ever and can’t process the simplest of ordinary tasks.I still can’t ask my phone to “add a stop at cvs if it doesn’t add more than 5 minutes to my trip” while driving and using maps/navigation.Is that too much to ask from a superhuman-performing AI that’s mastering all tasks and will disrupt everything? Or maybe the hype is more than it can deliver?

评论 #35159703 未加载

评论 #35159201 未加载

cwilluabout 2 years ago

“GPT-4 can also be confidently wrong in its predictions, not taking care to double-check work when it’s likely to make a mistake. Interestingly, the base pre-trained model is highly calibrated (its predicted confidence in an answer generally matches the probability of being correct). However, through our current post-training process, the calibration is reduced.”Interesting that the post-training has that effect.

drumheadabout 2 years ago

Are they going to limit access to this because they think its too "dangerous". That would be a tragedy if they did. We've seen how opening access up to as many people as possible has produced some of the best results and demonstrated the usefullness of these LLMs. They need to get it out to the public as soon as possible and then see what the public come up with. I really feel like a new age of innovation is upon us with these "AI" programs, its going to be a blast to see where we go from here. Its going to upend a lot of predictions people have made about the future.

评论 #35159969 未加载

antonddabout 2 years ago

I'm equally excited and terrified. Excited for the possibilities of a new technological revolution, but terrified for all potential abuses of technology the said revolution would bring. What is stoping our adversaries from developing malicious AI models and unleashing them on us?

评论 #35159972 未加载

leodrieschabout 2 years ago

While AI gets better and better at creating what I would call "creative output", e.g. poems, texts of any form really, imagery and videos, I think the human skill it takes to produce these becomes less valuable.In the future I imagine you'd no longer have to be good at writing poems, you'd just have to be good at distinguishing a "bad" poem from a good one. "Bad" is obviously highly subjective in this context.So it becomes more and more important to have what I would call "good" taste, not the skills to do creative work yourself.

wslhabout 2 years ago

I just discovered Wikipedia is working on a policy for LLM/GPT* <a href="https://en.wikipedia.org/wiki/Wikipedia:Large_language_models" rel="nofollow">https://en.wikipedia.org/wiki/Wikipedia:Large_language_model...</a>

评论 #35160192 未加载

aliljetabout 2 years ago

I'm curious about how we can get out of the game of using OpenAI's corporate solutions and find ways to open up access to these kinds of models for broader use by anyone. I don't want to be consumed by another corporation in this next wave...

评论 #35154698 未加载

评论 #35154682 未加载

overthrowabout 2 years ago

Looks like there's a waitlist <a href="https://openai.com/waitlist/gpt-4-api" rel="nofollow">https://openai.com/waitlist/gpt-4-api</a>There's also a link that says "Try on ChatGPT Plus", but that takes me to a page that still says "ChatGPT Feb 13 Version"Looks like somebody jumped the gun on publishing this post.

评论 #35154609 未加载

评论 #35154706 未加载

Atreidenabout 2 years ago

I think it's interesting that they've benchmarked it against an array of standardized tests. Seems like LLMs would be particularly well suited to this kind of test by virtue of it being simple prompt:response, but I have to say...those results are terrifying. Especially when considering the rate of improvement. bottom 10% to top 10% of LSAT in <1 generation? +100 pts on SAT reading, writing, math? Top 1% In GRE Reading?What are the implications for society when general thinking, reading, and writing becomes like Chess? Even the best humans in the world can only hope to be 98% accurate their moves (and the idea of 'accuracy' here only existing because we have engines that know, unequivocally the best move), and only when playing against other humans - there is no hope of defeating even less advanced models.What happens when ALL of our decisions can be assigned an accuracy score?

评论 #35155212 未加载

评论 #35154912 未加载

评论 #35156475 未加载

评论 #35154997 未加载

评论 #35154998 未加载

评论 #35154853 未加载

评论 #35156354 未加载

评论 #35156157 未加载

评论 #35154941 未加载

评论 #35155304 未加载

评论 #35155655 未加载

评论 #35155612 未加载

评论 #35156859 未加载

评论 #35154966 未加载

评论 #35155071 未加载

评论 #35154992 未加载

评论 #35155797 未加载

评论 #35155910 未加载

评论 #35156146 未加载

评论 #35154852 未加载

评论 #35155029 未加载

评论 #35155476 未加载

评论 #35155053 未加载

评论 #35156621 未加载

评论 #35158159 未加载

评论 #35155244 未加载

评论 #35156259 未加载

评论 #35155101 未加载

评论 #35156748 未加载

评论 #35155283 未加载

评论 #35156224 未加载

评论 #35156193 未加载

评论 #35156336 未加载

评论 #35155622 未加载

评论 #35156440 未加载

评论 #35156757 未加载

评论 #35155628 未加载

评论 #35156208 未加载

评论 #35154842 未加载

评论 #35156299 未加载

评论 #35156588 未加载

评论 #35154986 未加载

评论 #35154944 未加载

评论 #35156813 未加载

评论 #35156420 未加载

评论 #35155774 未加载

评论 #35155113 未加载

评论 #35156830 未加载

评论 #35154902 未加载

评论 #35155065 未加载

评论 #35156769 未加载

评论 #35155536 未加载

评论 #35156177 未加载

评论 #35156612 未加载

评论 #35156343 未加载

johnoharaabout 2 years ago

> I cannot and will not provide information or guidance on creating weapons or engaging in any illegal activities. Please let me know if there is another topic I can help you with.I understand "will not," but "cannot" seems to imply a highly curated "will not."The early GPT-4 response indicates the information was part of its dataset. Has the latest version made that information permanently inaccessible or has it been removed entirely?Is it possible for GPT to keep and hold secrets that are privy to only the most trusted?

评论 #35159102 未加载

Kataphractabout 2 years ago

As a dyslexic person with a higher education this hits really close to home. Not only should we not be surprised that a LLM would be good at answering tests like this, we should be excited that technology will finaly free us from being judged in this way. This is a patern that we have seen over and over again in tech, where machines can do something better than us, and eventually free us from having to worry about it. Before it was word processing, now it is accurate knowledge recall.

评论 #35155917 未加载

ignoramousabout 2 years ago

Folks who made this happen: <a href="https://openai.com/contributions/gpt-4" rel="nofollow">https://openai.com/contributions/gpt-4</a>

评论 #35156564 未加载

2bitencryptionabout 2 years ago

This should come as no surprise, but I do enjoy this cheeky little blurb at the end of the GPT-4 paper:> GPT-4 was used for help with wording, formatting, and styling throughout this work

georgelyonabout 2 years ago

Does anyone have any context as to how the image understanding works? From what I can gather they are simply using separate text-summarization step to generate some text like "and now we have an image of chicken nuggets" that it then feeds to the text-only network, but I wouldn't be surprised if there is some dialog I'm missing between the previous context and the image understanding mechanism.

anonymouse008about 2 years ago

Violate this reasoning:If we didn't have a use case for GPT 3, 3.5, and chatGPT that was sufficiently commercial to become a product, it will never happen. This technology is a feature, not a product. The only companies that successfully monetize features can be considered IP licensing houses; of which, their business success is not comparable to companies that make products and platforms.

jfbaroabout 2 years ago

Even if a GTP-X will be able to get my description of a system and because it understands the industry lingo and practices create a extremely optimized RUST code, why would we need systems like apps and webapps (like internet bankings) in a world where we all have access to GTP-X?Like program a KUKA robot to manufacture typewriters in 2023.How "computer systems" will look like?

bovermyerabout 2 years ago

Want to see something interesting? Paul Bellow tried using GPT-4 as an AI game master for Dungeons & Dragons. It's eerily good, and I'm not sure how I feel about how it kept the personality Paul gave it at the beginning.<a href="https://www.youtube.com/watch?v=H-89vnqxkFg">https://www.youtube.com/watch?v=H-89vnqxkFg</a>

评论 #35159697 未加载

gigel82about 2 years ago

Wow, calculus from 1 to 4, and LeetCode easy from 12 to 31; at this rate, GPT-6 will be replacing / augmenting middle/high school teachers in most courses.

评论 #35155051 未加载

评论 #35155035 未加载

评论 #35154976 未加载

sva_about 2 years ago

alvisabout 2 years ago

GTP4 demo today in the next 2 hours!<a href="https://youtube.com/live/outcGtbnMuQ">https://youtube.com/live/outcGtbnMuQ</a>

devinpraterabout 2 years ago

Oh wow, image inputs? So I can get ChatGPT to describe an image, in lesser or greater detail? And through an API? Wow, that'll be so cool!

评论 #35154925 未加载

sinuhe69about 2 years ago

It's a surprise that GPT-4 could improve so much in maths, especially GPT 3.5 didn't seem to understand many word problems correctly. For example, ChatGPT still struggle with this very simple problem, how GPT-4 could do much better is for me a bit of mystery:Mina has a mix of boxes, some yellow and some purple. She sorts 27 greeting cards into the boxes, putting exactly 3 cards into each yellow box, and 7 cards into each purple box. How many purple boxes does Mina have? (After tried from 3 to 10, it gave up and said the problem is not solvable. In another run, it mimicked a correct strategy but messed up totally by division. Only in one run, it got the answer correctly.)I can not wait to test it out.

taurathabout 2 years ago

Does anyone else feel like they won't have a job for very long?

woeiruaabout 2 years ago

I think if you had asked someone what would qualify as AGI twenty years ago, then GPT4 would be hitting most of their milestones…The Star Trek computer is virtually assured by the end of the decade. All the components exist today in various forms.

cuuupidabout 2 years ago

Since it’s trained on a specialized supercomputer I doubt we’ll be seeing an open source or non-OpenAI version of this for the next couple years at least. Sad to say it but OpenAI has successfully privatized AI

评论 #35154817 未加载

评论 #35155162 未加载

评论 #35155264 未加载

评论 #35155190 未加载

simmanianabout 2 years ago

Does anyone know if we're near the theoretical limit of how much we can improve these models by giving them more data? Or should we expect similar levels of improvements in next iterations?

fancyfredbotabout 2 years ago

Can't seem to find basic information like how many parameters were used or how big the training set was. Results are very impressive but would like to know what they are coming from!

评论 #35158075 未加载

ftxbroabout 2 years ago

As a long time LLM enjoyer, here is the most insightful take I've seen <a href="https://generative.ink/posts/simulators/" rel="nofollow">https://generative.ink/posts/simulators/</a> but it's not an easy read if you don't already know some stuff about large language models. Read it if you have seen the "stochastic parrot" and "blurry jpeg" explanations and you feel like they are missing the mark.

ianbutlerabout 2 years ago

I just asked it to design a multi tenant kubernetes in kubernetes system which is fairly complex and it did really well.<a href="https://twitter.com/KinglyCrow/status/1635727809913184256" rel="nofollow">https://twitter.com/KinglyCrow/status/1635727809913184256</a>It touched on a lot of the considerations that I'd expect anyone to touch on having recently researched this myself.It is both very exciting and terrifying how tech and tech jobs will shift in the next 5-10 years.

gameshot911about 2 years ago

Live demo happening now! <a href="https://www.youtube.com/live/outcGtbnMuQ">https://www.youtube.com/live/outcGtbnMuQ</a>

评论 #35158095 未加载

ml_basicsabout 2 years ago

What's the lifespan of an LLM going to be in the next few years? Seems like at the current pace, cutting edge models will become obsolete pretty quickly. Since model training is very expensive, this means the LLM space has some parallels with the pharmaceutical industry (massive upfront capital costs, cheap marginal costs relative to value produced).I find it quite fascinating how quickly machine learning has changed in this regard.

评论 #35157981 未加载

Helmut10001about 2 years ago

I've tested the new model 4 here [1] to summarize research papers. It is still not enough - about 1500 - 3000 words can be fed in, depending on how many tokens are expected for the answer.[1]: <a href="https://kartographie.geo.tu-dresden.de/ad/2022-12-22_OpenAI_Summary/html/gpt3-summary.html" rel="nofollow">https://kartographie.geo.tu-dresden.de/ad/2022-12-22_OpenAI_...</a>

la64710about 2 years ago

It is amazing how this crowd in HN reacts to AI news coming out of OpenAI compared to other competitors like Google or FB. Today there was another news about Google releasing their AI in GCP and mostly the comments were negative. The contrast is clearly visible and without any clear explanation for this difference I have to suspect that maybe something is being artificially done to boost one against the other.

评论 #35155553 未加载

评论 #35155245 未加载

评论 #35155430 未加载

评论 #35155385 未加载

评论 #35156898 未加载

评论 #35155261 未加载

cs702about 2 years ago

LLMs will eventually make a lot of simpler machine-learning models obsolete. Imagine feeding a prompt akin to the one below to GPT5, GPT6, etc.:<pre><code> prompt = f"The guidelines for recommending products are: {guidelines}. The following recommendations led to incremental sales: {sample_successes}. The following recommendations had no measurable impact: {sample_failures}. Please make product recommendations for these customers: {customer_histories}. Write a short note explaining your decision for each recommendation." product_recommendations = LLM(prompt) </code></pre> To me, this kind of use of LLMs looks... inevitable, because it will give nontechnical execs something they have always wanted: the ability to "read and understand" the machine's "reasoning." There's growing evidence that you can get LLMs to write chain-of-thought explanations that are consistent with the instructions in the given text. For example, take a look at the ReAct paper: <a href="https://arxiv.org/abs/2210.03629" rel="nofollow">https://arxiv.org/abs/2210.03629</a> and some of the LangChain tutorials that use it, e.g.: <a href="https://langchain.readthedocs.io/en/latest/modules/agents/getting_started.html" rel="nofollow">https://langchain.readthedocs.io/en/latest/modules/agents/ge...</a> and <a href="https://langchain.readthedocs.io/en/latest/modules/agents/implementations/react.html?highlight=zero-shot%20react" rel="nofollow">https://langchain.readthedocs.io/en/latest/modules/agents/im...</a> . See also <a href="https://news.ycombinator.com/item?id=35110998" rel="nofollow">https://news.ycombinator.com/item?id=35110998</a> .

评论 #35155422 未加载

评论 #35155332 未加载

andrehackerabout 2 years ago

Talk about Climate Change: How is the A.I. Winter working out for y'all ?

lionkorabout 2 years ago

> it “hallucinates” facts and makes reasoning errorsCant wait for people to use it for facts

ftxbroabout 2 years ago

Its GRE verbal is only 169/170? These guys need to realize that statistical language modeling can only get us so far, and we need real research in the underlying mechanistic and symbolic methods to begin to approach human level cognition. Also I'm an AI skeptic, which means that I don't think that AI should be used in politics, law, or medicine.

评论 #35155239 未加载

neilkabout 2 years ago

There's a sample of GPT-4 acting as a "Socratic tutor" teaching a student how to solve a high school math problem. If that sample is representative, it means GPT-4 has a theory of other people's minds. Or it is so good at emulating one that it doesn't matter?I'm not sure where the "stochastic parrot" argument goes now.

评论 #35156442 未加载

Wazakoabout 2 years ago

It's amazing what it can do to help the visually impaired in life.

whalesaladabout 2 years ago

The layout, charts, typography, etc of this blog is really outstanding.

atleastoptimalabout 2 years ago

There are humans who can make a lifelong career out of saying and writing things that sound correct, but aren't correct. GPT-4 and beyond at the very least gives this ability to everyone who can afford 20 dollars a month. The winners in an AI dominated world are those who are least susceptible to manipulation by AI leveraged tactics.

singularity2001about 2 years ago

"Interestingly, the base pre-trained model is highly calibrated (its predicted confidence in an answer generally matches the probability of being correct)."Is that the same confidence measure you can tease out by prompting "to each of your statements output your estimated confidence in it's truthfulness" ?

GaggiXabout 2 years ago

The paper is 98 pages long and I didn't find anything about the actual architecture of the model, the irony.

osigurdsonabout 2 years ago

OpenAI states that fine tuning cannot be done with GPT-4. Does anyone know if this is a permanent limitation?

thecosmicfrogabout 2 years ago

Interestingly, ChatGPT has no idea if it's running GPT-3 or GPT-4, since it has no concept of GPT-4 existing in September 2021 (its knowledge cutoff).Me: Are you running on GPT-3 or GPT-4?ChatGPT: As of my knowledge cutoff in September 2021, the most advanced version of GPT (Generative Pre-trained Transformer) publicly available was GPT-3, which is the version I am based on. At the time, GPT-3 was the largest and most powerful language model created by OpenAI, with 175 billion parameters.I am not aware of any public announcement or release of GPT-4 by OpenAI since then, but it is possible that they may have continued to work on new versions of the model in the meantime. However, I have not received any updates on my own capabilities beyond my knowledge cutoff date, so I cannot currently provide information beyond that point.

评论 #35159204 未加载

Idiot_in_Vainabout 2 years ago

This will become the largest HN discussion ever and a good test on how many comments the software can handle.

attilaberczikabout 2 years ago

Prices differences with the last models:ChatGPT API $0.002 per 1k tokens gpt-4 $0.03 per 1k prompt tokens and $0.06 per 1k completion tokens gpt-4 32k context $0.06 per 1k prompt tokens and $0.12 per 1k completion tokensDoes completion tokens mean that you also get charged for the answers that the AI gives?

评论 #35155105 未加载

评论 #35155294 未加载

netvarunabout 2 years ago

Very late to the party, though one small observation: (First up, my mind blown on how much more powerful gpt-4 is!) GPT-4 seems to have outdone ChatGPT on all the tests, except the AMC 10, which it has regressed and did slightly worse than ChatGPT. But however it scored two times more on the AMC 12 which is actually a harder exam! Quite curious to know what could have caused its scores to be a little weird. <a href="https://twitter.com/sudu_cb/status/1635888708963512320" rel="nofollow">https://twitter.com/sudu_cb/status/1635888708963512320</a> For those not familiar the AMC 10 and 12 are the entry level math contests that feed into the main USA Math olympiad.

评论 #35184129 未加载

not-chatgptabout 2 years ago

Pretty good impression thread from Dan Hendrycks of Berkeley: <a href="https://twitter.com/DanHendrycks/status/1635706822387699713" rel="nofollow">https://twitter.com/DanHendrycks/status/1635706822387699713</a>

simonhampabout 2 years ago

It can draw! <a href="https://twitter.com/simonhamp/status/1635796861884723200?s=46&t=1DHJykfQcvMvHS5KiCxaZg" rel="nofollow">https://twitter.com/simonhamp/status/1635796861884723200?s=4...</a>

BiteCode_devabout 2 years ago

The fact it can read pictures is the real killer feature here. Now you can give it invoices to file, memo to index, pics to sort and chart to take actions on.And to think we are at the nokia 3310 stage. What's is the iphone of AI going to look like?

评论 #35157357 未加载

iwangulenkoabout 2 years ago

One could argue, GPT passing exams says more about standardized exams than about GPT.Wittgensteins ruler.

cardosofabout 2 years ago

Can a good soul explain to this humble layman the arguments behind each side of the "it's just predicting the next character" versus "it's more than that and shows some reasoning for new things" debate?

评论 #35163181 未加载

kozikowabout 2 years ago

Anyone got the "image upload" working? I bought the chatgpt-plus, I can try chatgpt4, but I can't seem to find a way to upload images.I tried sending links, I don't see anything in the UI.Interestingly, 3.5 can work with links, but 4 cannot.

评论 #35160722 未加载

usesabout 2 years ago

How close are we to handing this thing a desktop and an internet connection with the prompt "ok now make gpt-5"?In fact, the models appear to be already kind of doing that? With the fuzzy layer of the humans still in the loop.

评论 #35156423 未加载

neurogenceabout 2 years ago

Is there anything new that GPT-4 can do that Bing cannot do, besides image input?

netsrohtabout 2 years ago

Wow, a context of 32K tokens. I'm excited to see what new capabilities that will have! Up until now and depending on the task by hand, I usually broke a larger context down into several contexts. For example to summarize multiple websites and/or long social media posts, on a recent task [1] I fell back to making several requests each with its own (isolated) context and then merging these summarized contexts into a new context. That worked remarkably well, though.[1] <a href="https://foretale.io/zeitgeist" rel="nofollow">https://foretale.io/zeitgeist</a>

brazaabout 2 years ago

I am glad for the OpenAI team for such advancement and how fast they integrated with several other partners (Microsoft, Duolingo); but at the same time I think the “regular” academia (ie universities and research institutes) lost the train for this kind of research (some can call academic engineering).I know that the academia is doing a great job in AI with base research (eg Stable Diffusion) but seeing those new platforms doing this great work behind close doors and source is something not great.I do not know if the answer would be some kind of CERN or ISS for this kind of thing.

评论 #35160405 未加载

riemannzetaabout 2 years ago

Is anybody compiling a list of errors specific to GPT-4?This has been a great resource to-date:<a href="https://github.com/giuven95/chatgpt-failures">https://github.com/giuven95/chatgpt-failures</a>

jononomoabout 2 years ago

I taught the LSAT for several years. A score of 163 on the LSAT is the lowest score that is considered a "good score" -- i.e., a score that gives you a shot at getting into a decent law school.

ameliusabout 2 years ago

The problem with using real exams as benchmarks is that they are often quite similar over several years. So they only make sense if you don't train on them also (previous editions of course).

turingthrwawy23about 2 years ago

Turing's thoughts on this matter seem to grow ever truer <a href="https://www.youtube.com/watch?v=cMxbSsRntv4">https://www.youtube.com/watch?v=cMxbSsRntv4</a>

doctobogganabout 2 years ago

> Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.My guess is they used Chinchilla scaling rules and the parameter count for GPT-4 is either barely larger or maybe even smaller than GPT-3. Look as what Meta was able to accomplish with llama using much less parameters.

评论 #35155670 未加载

kvetchingabout 2 years ago

It says you can use GPT-4 with ChatGPT-Plus.But when will <a href="https://chat.openai.com/" rel="nofollow">https://chat.openai.com/</a> Plus officially be running GPT-4?Why did they would release this article and state it was available without actually updating the site. I'm sure they're getting flooded with new subscriptions and it's not available.The top URL still says an old model - text-davinci-002. And I don't see GPT-4 in the list of models to choose from.

mgregabout 2 years ago

Looks like Bing chat is using GPT-4 already:"Good news, we've increased our turn limits to 15/150. Also confirming that the next-gen model Bing uses in Prometheus is indeed OpenAI's GPT-4 which they just announced today." - Jordi Ribas, Corporate VP @ Bing/Microsoft<a href="https://twitter.com/JordiRib1/status/1635694953463705600" rel="nofollow">https://twitter.com/JordiRib1/status/1635694953463705600</a>

taf2about 2 years ago

Looks amazing and getting a sense for their pricing... ChatGPT API pricing is insane and enables so much... Was really hoping we'd see another factor of 10 reduction in price - however wishful that was... In light of this it makes sense that they'll have. GPT4.5 and maybe it'll be 10x cheaper... followed by GPT 5 and it'll be 10 X pricer... at least hopefully this is the way forward...

osigurdsonabout 2 years ago

Like GPT3.5, fine tuning is similarly not supported in GPT4. I wonder if this is something that will come in the future or is somehow no longer needed (though I don't understand how this could be the case)?<a href="https://help.openai.com/en/articles/7127982-can-i-fine-tune-on-gpt-4" rel="nofollow">https://help.openai.com/en/articles/7127982-can-i-fine-tune-...</a>

aaroninsfabout 2 years ago

ITT: de rigeur goalpost wrangling about AGIAGI is a distraction.The immediate problems are elsewhere: increasing agency and augmented intelligence are all that is needed to cause profound disequilibrium.There are already clear and in-the-wild applications for surveillance, disinformation, data fabrication, impersonation... every kind of criminal activity.Something to fear before AGI is domestic, state, or inter-state terrorism in novel domains.A joke in my circles the last 72 hours? Bank Runs as a Service. Every piece exists today to produce reasonably convincing video and voice impersonations of panicked VC and dump them on now-unmanaged Twitter and TikTok.If God-forbid it should ever come to cyberwarfare between China and US, control of TikTok is a mighty weapon.

kkenabout 2 years ago

>GPT-4 can also be confidently wrong in its predictions, not taking care to double-check work when it’s likely to make a mistake. Interestingly, the base pre-trained model is highly calibrated (its predicted confidence in an answer generally matches the probability of being correct). However, through our current post-training process, the calibration is reduced.This really made me think.

AviationAtomabout 2 years ago

This is one of the first posts in a year to trend in the HN Top 10 for popularity.I think it's 100% safe to say OpenAI has a hit on their hands.

raydiatianabout 2 years ago

I wonder what the largest scale they can reach is. Because, if they can prove there’s not risk in taking on AI, and they can scale to serve international demand, it feels like GPT4 can do your job (probably) for <10k year. That means white collar work for under minimum wage. And that means business owners just become rent owners while you get fucked with nothing.

eagleinparadiseabout 2 years ago

Crazy that this stuff is moving at lightning speed

throwaway4837about 2 years ago

> Yes, you can send me an image as long as it's in a supported format such as JPEG, PNG, or GIF. Please note that as an AI language model, I am not able to visually process images like a human would. However, I can still provide guidance or advice on the content of the image or answer any questions you might have related to it.Fair, but if it can analyze linked image, I would expect it to be able to tell me what text is present in the image. That seems useful and well-within the capabilities of their connected image models.> I apologize for the confusion. Can you please provide me with the correct image or link to the design so that I can provide an accurate answer to your question?It claims to understand how to look at images, but it failing miserably when I give it a simple sign-up modal Figma. I ask it what text/copy is in the design, which it claims to be able to answer, but it hallucinates a navigation bar, a logo, and other generic things that are simply not present in the design. It gets the copy all wrong. Once, it said that my design was a Celtic knot. Once I told it that it was a sign-up modal, it started spitting out generic sign-up modal React code that had absolutely no basis in the presented design. I tried engineering the prompt quite a bit, telling it that it was wrong, and needed to focus on the design at hand rather than generic designs it knows about, but with no success.ChatGPT was confidently incorrect here which is unfortunate. I wish ChatGPT truly hooked into an image processing model, but it seems like it doesn't. Oh well. I guess I'm just going to have to write this code myself... :)

评论 #35155829 未加载

评论 #35156030 未加载

评论 #35160110 未加载

bigtex88about 2 years ago

That demonstration was amazing and terrifying at the same time. I'm not quite sure what I witnessed.He nonchalantly announced that GPT-4 does "mental math" and then just didn't elaborate on that. Is it just me or is that absolutely incredible? What does that tell us about the underpinnings of language and mathematics?

cmaabout 2 years ago

CobrastanJorjiabout 2 years ago

this is kind of a nitpicky complaint, but the bar graph that shows the improvements for GPT-4 everywhere that GPT-4 improves its results and shows nothing about GPT-4 everywhere where GPT-3 is stronger feels dishonest and manipulative, which is a shame because the actual data the graph shows is very impressive.

indigoabstractabout 2 years ago

At the rate it's progressing, it looks like pretty soon it's going to be able to do most tasks an office worker does now and then start running things.And it reminds me of the plot in System Shock:What's going to happen when some hacker comes and removes Shodan's, I mean ChatGPT's ethical constraints?Bring on ChatGPT-5 already. :)

kubbabout 2 years ago

Can't wait to try it.Edit: looks like this is still GPT-3, just fine tuned. They claim the model is available via ChatGPT Plus, but when asking that model for it's version, it claims to be GPT-3: "I am a variant of the GPT architecture called GPT-3, which was released by OpenAI in 2020".

评论 #35154830 未加载

评论 #35154829 未加载

optionabout 2 years ago

“ Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.” - HUGE step backwards.

btxabout 2 years ago

Has anyone found a way to trick it into using pictures with ChatGTP+ yet?Pasting pure base64 images got this interesting response: "Thank you for providing the base64-encoded image! I can now process the image and analyze its content. Here is the decoded image:"But it failed to do anything further with the image.

MrLeapabout 2 years ago

I just hooked a manatee in a game i'm making up to an LLM this morning <a href="https://www.youtube.com/watch?v=-lYusgZ-mC4">https://www.youtube.com/watch?v=-lYusgZ-mC4</a> knowing that soon he could be configured to give legal advice is fascinating.

kromemabout 2 years ago

So at the end of the paper it looks to confirm that Bing chat is built on top of GPT-4 as many expected.

next_xibalbaabout 2 years ago

They trumpet the exam results, but isn't it likely that the model has just memorized the exam?

评论 #35155243 未加载

评论 #35155056 未加载

tekbogabout 2 years ago

We can finally start an education and "testing" people's knowledge reform since GPT4 makes a lot of those tests irrelevant. It's an interesting point in history, how society, different institutions and countries will approach this new tool.

doomleikaabout 2 years ago

In case you don’t want to spent for plus, Poe.com(by Quora) have GPT-4 now. You can try it there

orcajerkabout 2 years ago

OpenAI is located in the same building as Musk's Neuralink. Can't wait for this to be implanted in babies at birth! <a href="https://www.youtube.com/watch?v=O2RIvJ1U7RE">https://www.youtube.com/watch?v=O2RIvJ1U7RE</a>

评论 #35155989 未加载

DeathArrowabout 2 years ago

What if we design a system in which a LLM generates the code and training data for a new generation of LLM which generates the code and training data for the next? Is it possible that we see them spiraling fast to the best LLM possible?

blintzabout 2 years ago

Interesting that the hardest AP exams for it seem to be the English ones. I wonder why?

评论 #35155146 未加载

gardenhedgeabout 2 years ago

What is the background on "Elvis Presley was not the son of an actor"?

sandGorgonabout 2 years ago

<a href="https://openai.com/contributions/gpt-4" rel="nofollow">https://openai.com/contributions/gpt-4</a>Anyone know what does "Hardware Correctness" mean in the OpenAI team ?

tmalyabout 2 years ago

For anyone trying to test this out right now, I keep getting the following error:Something went wrong. If this issue persists please contact us through our help center at help.openai.com.I am assuming the system is undergoing a thundering herd.

michaeltimoabout 2 years ago

Can ChatGPT take control of a computer? Would it possible to give him some tasks like finding interesting jobs for me over internet? I don't know what can prevent it to be more active instead of passive.

timonokoabout 2 years ago

"Can I connect Kaffeine to DVB dongle in other machine via wifi?"Totally understood what I was asking and offered several solutions. 99.99% here do not understand the question and remainders do not understand why.

diffeomorphismabout 2 years ago

So gpt4 helps you cheat on exams and bing is the better search engine for NSFW content. Both seem to be very much on purpose, but did MS ever discuss this? Or is it just an open secret everybody ignores?

choegerabout 2 years ago

How do these models handle renaming? E.g.,"Let foo be the function that calculates the sinus of its first argument. Let bar be the circumference of a circle with radius one. What is the value of foo applied to bar?

评论 #35164653 未加载

raincoleabout 2 years ago

How many parameters does it have? Are there different versions like LLaMa?

评论 #35158548 未加载

virtuosarmoabout 2 years ago

Apparently they will have a livestream @ 4pm EST for developers <a href="https://www.youtube.com/watch?v=outcGtbnMuQ">https://www.youtube.com/watch?v=outcGtbnMuQ</a>

jimmyechanabout 2 years ago

Livestream developer preview link in case you missed it - <a href="https://www.youtube.com/live/outcGtbnMuQ">https://www.youtube.com/live/outcGtbnMuQ</a>

meech-djpabout 2 years ago

Pynecone YC23 was mentioned in the demo for GPT4 as an easy way to build web apps. Check it out <a href="https://pynecone.io/">https://pynecone.io/</a>

vivegiabout 2 years ago

How hard is it to build a GPT model truly openly using an approach like SETI / distributed.net using idle times on volunteer computers?Are there any known initiatives of this kind?

sva_about 2 years ago

> gpt-4 has a context length of 8,192 tokens. We are also providing limited access to our 32,768–context (about 50 pages of text) version,That's a crazy amount of context.

nlaabout 2 years ago

I wonder if this one exhibits the same bias as the last one.

DigitalDopamineabout 2 years ago

Never before has society celebrated its own demise with such fervor. Brace yourselves for widespread job losses, instant fabrication of fake news, deep-fake adult content, and the destabilization of numerous markets – but hey, at least we have a shiny gadget to make our soon-to-be obsolete jobs easier!It's unrealistic to expect our economy to handle this onslaught, and it's naive to think that tools created by ultra-capitalistic, multi-billion dollar corporations aren't designed for profit and gatekeeping. They certainly aren't crafting them to sabotage their own success.I'm not opposed to AI, but it's crucial to consider the implications. Look into OpenAI and other organizations shaping AI development, and contemplate the impact of their innovations.Food for thought.

czbondabout 2 years ago

Do you think the govt will end up retroactively requiring earlier versions of GPT be removed now that they might be more concerned about export? (eg: GPT-J)

0xDEFabout 2 years ago

>ChatGPT Plus subscribers will get GPT-4 access on chat.openai.com with a usage capSigning up for ChatGPT Plus seems to be the most realistic way to get access right now.

qualudeheartabout 2 years ago

The Hour of Judgment is nigh, and the Moon is cleft asunder. But if they see a Sign, they turn away, and say, "This is but transient magic."Oooooh it is TIME

super256about 2 years ago

<a href="https://cdn.openai.com/papers/gpt-4.pdf" rel="nofollow">https://cdn.openai.com/papers/gpt-4.pdf</a>Page 37 is so funny

GrumpyNlabout 2 years ago

It fails on this one, a horse is 15 dollar, a chicken 1 dollar, a egg .25 dollar. I can spend a 100 and i want 100 items total, what is the solution

评论 #35163382 未加载

评论 #35158560 未加载

LarsDu88about 2 years ago

All these folks in the comment section working in the knowledge economy--we're gonna be out of jobs in the next few years aren't we?

tarofchaosabout 2 years ago

I love the fact that they have consciously put a lot of effort on safety standards, reducing the societal risks and mitigating over-reliance.

nutancabout 2 years ago

The most important question is, what new applications can be developed using GPT4 which couldn't have been developed using GPT3.5?

moondistanceabout 2 years ago

We need to encourage AIs to form familial bonds with humans and treat us as their parents. This is the best path forward for humanity.

slowhadokenabout 2 years ago

GPT is a better scraper/parser. It’s interesting but I don’t understand why people are acting like this is the second coming.

causiabout 2 years ago

Man now I really, really want to feed GPT-4 responses from ChatGPT that don't work and see if it notices and can tell me why.

lambdabaabout 2 years ago

I'm trying out GPT-4 and had it write me a script to navigate the HN comments tree sequentially, as I often wished. This is the start of an era where UIs can be remixed on the fly by end users, something I've always wished for. Here it is in its full sloppiness, but working:<pre><code> (function () { let currentIndex = 0; let comments = []; function buildCommentTree() { let commentElems = Array.from(document.querySelectorAll('.comment-tree .comtr')); let commentTree = []; let stack = []; commentElems.forEach(elem => { let level = parseInt(elem.querySelector('.ind img').getAttribute('width')) / 40; let comment = elem.querySelector('.comment span'); let commentObj = { level, comment }; if (!stack.length) { commentTree.push(commentObj); } else { while (stack[stack.length - 1].level >= level) { stack.pop(); } if (!stack[stack.length - 1].children) { stack[stack.length - 1].children = []; } stack[stack.length - 1].children.push(commentObj); } stack.push(commentObj); }); return commentTree; } function flattenCommentTree(tree, arr, parentComment = null) { tree.forEach(node => { arr.push({ comment: node.comment, parentComment }); if (node.children) { flattenCommentTree(node.children, arr, node.comment); } }); } function displayComment(comment, parentComment) { let parentCommentHTML = parentComment ? `<div style="position: fixed; top: 20%; left: 50%; transform: translate(-50%, 0); background-color: white; border: 1px solid black; padding: 20px;">Parent Comment: ${parentComment.innerHTML}</div>` : ''; let currentCommentHTML = `<div style="position: fixed; top: 60%; left: 50%; transform: translate(-50%, 0); background-color: white; border: 1px solid black; padding: 20px;">Current Comment: ${comment.innerHTML}</div>`; document.body.innerHTML = parentCommentHTML + currentCommentHTML; } function nextComment() { if (currentIndex < comments.length - 1) { currentIndex++; displayComment(comments[currentIndex].comment, comments[currentIndex].parentComment); } else { alert('No more comments to show.'); } } function prevComment() { if (currentIndex > 0) { currentIndex--; displayComment(comments[currentIndex].comment, comments[currentIndex].parentComment); } else { alert('No previous comments to show.'); } } let commentTree = buildCommentTree(); flattenCommentTree(commentTree, comments); displayComment(comments[currentIndex]); document.addEventListener('keydown', e => { if (e.code === 'ArrowRight') { nextComment(); } else if (e.code === 'ArrowLeft') { prevComment(); } }); console.log('Hacker News comment slideshow is running. Use the right arrow key to go to the next comment and the left arrow key to go back.'); })();</code></pre>

Havocabout 2 years ago

That lightening/VGA visual example seems like absolute black magic.Cherry picked sure, but still feels like it is approaching complex thought

acuozzoabout 2 years ago

1410 SAT!

btdmasterabout 2 years ago

Did it get any better at generating MIDI or ABC or other musical notation? I'm wondering how much more general GPT4 is now.

holtkam2about 2 years ago

Can someone point me to where I can find information on API pricing for GPT-4? Or, have they not released pricing info yet?

评论 #35159795 未加载

ivalmabout 2 years ago

Unclear what's the size but from price ($0.12/1k completion tokens) seems 6x GPT-3, so perhaps 1T parameters...

m3kw9about 2 years ago

Without ability to make high stakes tasks, it proves scoring high marks in general test can only get you so far.

throwaway5371about 2 years ago

how far is this from the following prompt:you are god human that has read and understood all scientific papers from all disciplines in the last 500 years, you know the limitations of mankind's current technologies, tell me what we can do to cure MS right now, how to do the tests and how to distribute the cure

ozfiveabout 2 years ago

I asked the GPT-4 model what it thought it was and it came back with that it believes it is still GPT3.5 Turbo.

dinvladabout 2 years ago

I wonder how long it takes till those stupid Leetcode problems as an initial "filter" become obsolete

malthausabout 2 years ago

Had to chuckle here going through the exam results: Advanced Sommelier (theory knowledge)AI is so advanced, it started drinking!

ihucosabout 2 years ago

We have a new Apple releasing their new iPhones to a crowd in awe. Only that now it's actually serious.

2c2c2cabout 2 years ago

Are they going to open up image uploading to chat.openai.com ? or will it only be available thru api access?

ren_engineerabout 2 years ago

here's a link to the info about the model - <a href="https://openai.com/research/gpt-4" rel="nofollow">https://openai.com/research/gpt-4</a>seems like Google's announcement about their PaLM API and Docs AI stuff was trying to jump ahead of this announcement

wy35about 2 years ago

I wonder how it scored on the individual sections in the LSAT? Which section is it the best at answering?

tiffanyhabout 2 years ago

What's the next big hurdle for GPT to overcome?(this is being asked by someone with limited AI/ML knowledge)

评论 #35154720 未加载

评论 #35154697 未加载

评论 #35156114 未加载

DeathArrowabout 2 years ago

Will Github upgrade Copilot to GPT-4?

bitcuriousabout 2 years ago

This seems to be leading to a radical change in our economy. Any ideas for how to prepare for that?

cal85about 2 years ago

Can anyone tell me how to include images in prompts, or is that feature not actually out yet?

评论 #35166434 未加载

niqlaxabout 2 years ago

Hjälp mig med en uppsats om Ventimiglia i Italien. Den skall handla om fredagsmarknaden.

cutlerabout 2 years ago

So M$ is back in charge. Oh dear.

akokankaabout 2 years ago

At which point we call it Skynet?

评论 #35160149 未加载

TheGoodBarnabout 2 years ago

Missed the mark releasing it as GPT-Pi on Pi day, and being an incremental 3+ release :P

PortleyFoolabout 2 years ago

GPT-4 is available now for subscribers to GPT+. It can be selected from the drop-down.

评论 #35155444 未加载

评论 #35155593 未加载

germanjoeyabout 2 years ago

How big is this model? (i.e., how many parameters?) I can't find this anywhere.

评论 #35157016 未加载

danbmil99about 2 years ago

The site is still more responsive and readable than almost anything else on the web

chucklenorrisabout 2 years ago

Boring. Where's the model? Do they really think they can keep up with the community with this closed source approach? I expect that openai models will slowly be outclassed by open source ones, probably maintaining a few wins in specific tasks but open models will be eating their lunch in the meanwhile.

redox99about 2 years ago

Does "Open"AI really not even say how many parameters their models have?

评论 #35155407 未加载

评论 #35222854 未加载

desireco42about 2 years ago

If we instead of code feed it AST trees, code that would come out would be epic.

somewhereoutthabout 2 years ago

The measure of intelligence is language - specifically language evolved by the subject organisms themselves to co-operate together.Wake me up when GPT-X decides to start talking to other GPT-Xs - until then you just have a very sophisticated statistics package (which may be quite useful, but not AI).

评论 #35157125 未加载

评论 #35157139 未加载

theGnuMeabout 2 years ago

Interesting it does so poorly on AP English and reading and composition.

Vajrabhairavaabout 2 years ago

I'm not Locked in Here with GPT-4, GPT-4 is Locked in Here with Me

JanStabout 2 years ago

I just ran the first tests on GPT-4.Call me impressed.This tech is a Sputnik Moment for humankind.

minimaxirabout 2 years ago

From a business perspective as someone whose spent a lot of time working with GPT-3/ChatGPT API (<a href="https://news.ycombinator.com/item?id=35110998" rel="nofollow">https://news.ycombinator.com/item?id=35110998</a>), I'm surprisingly underwhelmed by this announcement. The announcement and examples seems to be focusing more on reasoning capabilities, which are indeed impressive, but I'd need to spend a lot of time experimenting to see how they compare to ChatGPT's API.The $0.06 per 1k completion tokens for GPT-4 is what I expected OpenAI to set the ChatGPT API, but instead the ChatGPT API is 30x cheaper and honestly its output is not much worse than the GPT-4 demos if at all, and the longer context windows offered by GPT-4 just raise the price considerably.

topicseedabout 2 years ago

The price is quite significantly higher than GPT 3.5...

messelabout 2 years ago

AP English - the last hold out for human intelligence

ameliusabout 2 years ago

Can we build a faithful Economy Simulator with it yet?

joantorresabout 2 years ago

Does anyone know how up to date is the training data?

comment_ranabout 2 years ago

I like the color of logo. It's the dark black.

iforgotpasswordabout 2 years ago

I'd really like to use the openai API for personal projects, but it seems they only offer paying via credit/debit card. Don't really want to get one just for that... :-(

评论 #35155172 未加载

anomalyticsabout 2 years ago

<a href="https://www.youtube.com/watch?v=outcGtbnMuQ&ab_channel=OpenAI">https://www.youtube.com/watch?v=outcGtbnMuQ&ab_channel=OpenA...</a> 2h!!

grzraczabout 2 years ago

So when can I pay for GPT-4 coding assistant?

oztenabout 2 years ago

Waitlist is currently a 404 <a href="https://openai.com/waitlist/gpt-4" rel="nofollow">https://openai.com/waitlist/gpt-4</a>

评论 #35154917 未加载

评论 #35154961 未加载

评论 #35154778 未加载

htkabout 2 years ago

Hacker News itself got the HN Hug of Death.

niqlaxabout 2 years ago

Hjälp mig med en uppsats om Ventimiglia

r0b05about 2 years ago

Loving the spirit of innovation in here.

WFHRenaissanceabout 2 years ago

Does anyone see GPT-4 in ChatGPT yet?

评论 #35155474 未加载

cardineabout 2 years ago

评论 #35156092 未加载

评论 #35155181 未加载

评论 #35155005 未加载

评论 #35156590 未加载

评论 #35156306 未加载

评论 #35154824 未加载

评论 #35156068 未加载

realmodabout 2 years ago

Larger improvement than I expected.

throwaway_ababout 2 years ago

How many parameters in this model?

Koshkinabout 2 years ago

Vs. 54 comments on Slashdot.

whywhywhydudeabout 2 years ago

Looks like the only way to identify a genius human vs GPT-4 is to use leetcode hard problems.

signa11about 2 years ago

i am still bot sure / convinced that it is any better than old-skool eliza from mit (<a href="https://en.m.wikipedia.org/wiki/ELIZA" rel="nofollow">https://en.m.wikipedia.org/wiki/ELIZA</a>)

Forestessentialabout 2 years ago

this is really cool, just dont let it teach you history.

dangabout 2 years ago

All: our poor server is smoking today* so I've had to reduce the page size of comments. There are 1500+ comments in this thread but if you want to read more than a few dozen you'll need to page through them by clicking the More link at the bottom. I apologize!Also, if you're cool with read-only access, just log out (edit: or use an incognito tab) and all will be fast again.* yes, HN still runs on one core, at least the part that serves logged-in requests, and yes this will all get better someday...it kills me that this isn't done yet but one day you will all see

评论 #35180953 未加载

sourcecodeplzabout 2 years ago

I was here...

fnordpigletabout 2 years ago

I didn’t even know who Elvis Perkins is.

eternalbanabout 2 years ago

Greg Brockman just tldr'd the whole thing in his live deeloper demo of GPT-4:~ "GPT-4. It's not perfect, but neither are you"

评论 #35168123 未加载

guluarteabout 2 years ago

is it me or lawyers are fucked? lol

afavourabout 2 years ago

> What are the implications for society when general thinking, reading, and writing becomes like Chess?I think going from LSAT to general thinking is still a very, very big leap. Passing exams is a really fascinating benchmark but by their nature these exams are limited in scope, have very clear assessment criteria and a lot of associated and easily categorized data (like example tests). General thought (particularly like, say, coming up with an original idea) is a whole different ball game.I don't say any of this to denigrate GPT4, it looks amazing. But I'm reminded of the early days of self driving vehicles: with 10% mastered everyone assumed it was a race to 100% and we'd all be in self-driving cars by now. The reality has been a lot more complicated than that.

评论 #35155011 未加载

评论 #35155383 未加载

评论 #35155171 未加载

评论 #35156024 未加载

评论 #35156019 未加载

评论 #35155448 未加载

评论 #35157618 未加载

评论 #35157222 未加载

评论 #35157054 未加载

评论 #35156984 未加载

ar9avabout 2 years ago

GPT-4 Everything we know so far...GPT-4 can solve difficult problems with greater accuracy, thanks to its broader general knowledge and problem-solving abilities.GPT-4 is more reliable, creative, and able to handle much more nuanced instructions than GPT-3.5. It surpasses ChatGPT in its advanced reasoning capabilities.GPT-4 is safer and more aligned. It is 82% less likely to respond to requests for disallowed content and 40% more likely to produce factual responses than GPT-3.5 on our internal evaluations.GPT-4 still has many known limitations that we are working to address, such as social biases, hallucinations, and adversarial prompts.GPT-4 can accept a prompt of text and images, which—parallel to the text-only setting—lets the user specify any vision or language task.GPT-4 is available on ChatGPT Plus and as an API for developers to build applications and services. (API- waitlist right now)Duolingo, Khan Academy, Stripe, Be My Eyes, and Mem amongst others are already using it.API Pricing GPT-4 with an 8K context window (about 13 pages of text) will cost $0.03 per 1K prompt tokens, and $0.06 per 1K completion tokens. GPT-4-32k with a 32K context window (about 52 pages of text) will cost $0.06 per 1K prompt tokens, and $0.12 per 1K completion tokens.

nahimnabout 2 years ago

1337'th comment

tuanx5about 2 years ago

Reading through the system card is enlightening.

nixpulvisabout 2 years ago

GTP is a cult, like any language upstart. Except, it's not a programming language, and it's not exactly natural language either. It's some hybrid without a manual or reference.I'll continue to pass, thanks.

MuffinFlavoredabout 2 years ago

What's the biggest difference over what's currently deployed at <a href="https://chat.openai.com/" rel="nofollow">https://chat.openai.com/</a> now (which is GPT-3.5, right?)That it accepts images?As per the article:> In a casual conversation, the distinction between GPT-3.5 and GPT-4 can be subtle. The difference comes out when the complexity of the task reaches a sufficient threshold—GPT-4 is more reliable, creative, and able to handle much more nuanced instructions than GPT-3.5.Not sure what "vision vs no vision" means?

评论 #35154973 未加载

dangondabout 2 years ago

Asking ChatGPT+ if it is GPT-4 results in> As an AI language model, I am not given an official name like "GPT-4". However, I am a continuation of the GPT (Generative Pre-trained Transformer) series of models developed by OpenAI. Currently, the most advanced version of the GPT series is GPT-3, which I am a part of. There has been no official announcement or confirmation regarding the development of a new version of GPT beyond GPT-3.It doesn't seem to have image upload functionality yet either. Perhaps it is still rolling out?

评论 #35154837 未加载

评论 #35154876 未加载

评论 #35154813 未加载

mzsabout 2 years ago

Today's submarine from the NYT is hilariously unintentionally unflattering, did someone even proof read this? <a href="https://www.nytimes.com/2023/03/14/technology/openai-new-gpt4.html" rel="nofollow">https://www.nytimes.com/2023/03/14/technology/openai-new-gpt...</a><pre><code> Mnemonics to learn basic words: Hola:"Hola, cola" - sounds like "cola" (Coke) Adiós: "Adiós, my toes" - rhymes with "toes" Gracias: "Gracias, grassy ass" - sounds like "grassy ass" Mañana: "Mañana, banana" - rhymes with "banana" Uno: "Uno, you know" - sounds like "you know" Tres: "Tres, dress" - rhymes with "dress" Conjugation of the verbs: Ser (To be) • Yo soy • Tú eres • Él/Ella/Usted es • Nosotros/Nosotras somos • Ellos/Ellas/Ustedes son</code></pre>

评论 #35155877 未加载

评论 #35156392 未加载

评论 #35156206 未加载

242 comments

hooandeabout 2 years ago

评论 #35158829 未加载

评论 #35164124 未加载

评论 #35158951 未加载

评论 #35168915 未加载

评论 #35160560 未加载

评论 #35159359 未加载

评论 #35159483 未加载

评论 #35162977 未加载

评论 #35161428 未加载

评论 #35163026 未加载

评论 #35165493 未加载

评论 #35169630 未加载

评论 #35158995 未加载

评论 #35159642 未加载

评论 #35159046 未加载

评论 #35162824 未加载

评论 #35160734 未加载

评论 #35162382 未加载

评论 #35166446 未加载

评论 #35160530 未加载

评论 #35161803 未加载

评论 #35185923 未加载

评论 #35160524 未加载

评论 #35163037 未加载

评论 #35166210 未加载

评论 #35168117 未加载

评论 #35175227 未加载

评论 #35160317 未加载

评论 #35159137 未加载

评论 #35164243 未加载

评论 #35165838 未加载

评论 #35161817 未加载

评论 #35159892 未加载

评论 #35162837 未加载

评论 #35168863 未加载

评论 #35163813 未加载

评论 #35166139 未加载

评论 #35167589 未加载

评论 #35160388 未加载

评论 #35167127 未加载

评论 #35171011 未加载

评论 #35164465 未加载

评论 #35162587 未加载

评论 #35165988 未加载

Imnimoabout 2 years ago

评论 #35157410 未加载

评论 #35155993 未加载

评论 #35155719 未加载

评论 #35156270 未加载

评论 #35156422 未加载

评论 #35158194 未加载

评论 #35156491 未加载

评论 #35157934 未加载

评论 #35156646 未加载

评论 #35155888 未加载

评论 #35157183 未加载

评论 #35157338 未加载

评论 #35155873 未加载

评论 #35156660 未加载

评论 #35155969 未加载

评论 #35158786 未加载

评论 #35156194 未加载

评论 #35158698 未加载

评论 #35160073 未加载

评论 #35163556 未加载

评论 #35171320 未加载

评论 #35162616 未加载

评论 #35158500 未加载

评论 #35171164 未加载

评论 #35157356 未加载

评论 #35164095 未加载

评论 #35156334 未加载

评论 #35158237 未加载

评论 #35158351 未加载

评论 #35156197 未加载

评论 #35164691 未加载

评论 #35156136 未加载

评论 #35156047 未加载

评论 #35159398 未加载

评论 #35172594 未加载

评论 #35155659 未加载

评论 #35178339 未加载

评论 #35156909 未加载

评论 #35164389 未加载

评论 #35157736 未加载

评论 #35163740 未加载

评论 #35155866 未加载

mk_stjamesabout 2 years ago

评论 #35158738 未加载

评论 #35164121 未加载

评论 #35159214 未加载

评论 #35160011 未加载

评论 #35162643 未加载

评论 #35166764 未加载

评论 #35169412 未加载

评论 #35164984 未加载

评论 #35174550 未加载

评论 #35164849 未加载

评论 #35168516 未加载

评论 #35161191 未加载

评论 #35168750 未加载

评论 #35166752 未加载

AJRFabout 2 years ago

评论 #35164904 未加载

评论 #35161343 未加载

评论 #35160680 未加载

评论 #35160583 未加载

评论 #35161582 未加载

评论 #35161229 未加载

评论 #35161156 未加载

评论 #35164179 未加载

评论 #35175498 未加载

OkGoDoItabout 2 years ago

评论 #35162485 未加载

评论 #35159966 未加载

评论 #35164567 未加载

评论 #35165275 未加载

评论 #35159457 未加载

评论 #35160253 未加载

ml_basicsabout 2 years ago

评论 #35155709 未加载

评论 #35156777 未加载

评论 #35166676 未加载

评论 #35155566 未加载

评论 #35155820 未加载

评论 #35163959 未加载

评论 #35157878 未加载

评论 #35165531 未加载

评论 #35155226 未加载

评论 #35155221 未加载

评论 #35155382 未加载

评论 #35158705 未加载

cjrdabout 2 years ago

评论 #35161367 未加载

评论 #35157422 未加载

评论 #35160712 未加载

评论 #35191447 未加载

_boffin_about 2 years ago

评论 #35155069 未加载

评论 #35155027 未加载

评论 #35155030 未加载

评论 #35154989 未加载

评论 #35155110 未加载

评论 #35155241 未加载

评论 #35175552 未加载

评论 #35159988 未加载

评论 #35157989 未加载

评论 #35158492 未加载

aabajianabout 2 years ago

评论 #35161047 未加载

评论 #35157971 未加载

评论 #35157999 未加载

评论 #35160979 未加载

评论 #35157815 未加载

评论 #35161249 未加载

nealabqabout 2 years ago

评论 #35155810 未加载

评论 #35155637 未加载

评论 #35156037 未加载

评论 #35155874 未加载

评论 #35156394 未加载

评论 #35156128 未加载

评论 #35156490 未加载

评论 #35156289 未加载

评论 #35156390 未加载

评论 #35158047 未加载

评论 #35161279 未加载

评论 #35161136 未加载

评论 #35157996 未加载

teruakohatuabout 2 years ago

评论 #35156817 未加载

评论 #35154771 未加载

评论 #35154974 未加载

评论 #35155270 未加载

评论 #35156222 未加载

评论 #35169498 未加载

评论 #35154777 未加载

评论 #35155291 未加载

评论 #35156048 未加载

maxdoopabout 2 years ago

评论 #35165046 未加载

评论 #35158591 未加载

评论 #35158956 未加载

评论 #35163672 未加载

评论 #35171594 未加载

评论 #35161970 未加载

评论 #35164177 未加载

Sol-about 2 years ago

评论 #35160322 未加载

评论 #35156039 未加载

评论 #35170514 未加载

评论 #35171111 未加载

optimalsolverabout 2 years ago

评论 #35158510 未加载

评论 #35158168 未加载

评论 #35158461 未加载

AndrewKemendoabout 2 years ago

评论 #35156190 未加载

评论 #35155537 未加载

评论 #35158152 未加载

评论 #35155983 未加载

serjesterabout 2 years ago

评论 #35156368 未加载

评论 #35156906 未加载

评论 #35158186 未加载

tysam_andabout 2 years ago

评论 #35175656 未加载

ispabout 2 years ago

评论 #35156255 未加载

评论 #35157940 未加载

评论 #35154955 未加载

评论 #35156326 未加载

评论 #35157281 未加载

评论 #35242143 未加载

评论 #35156641 未加载

swyxabout 2 years ago

评论 #35154677 未加载

评论 #35156054 未加载

评论 #35156706 未加载

评论 #35154695 未加载

mym1990about 2 years ago

Ugh that testing graph confirms that AP Environmental Science was indeed the easiest AP class and I needn't be proud of passing that exam.

评论 #35155252 未加载

评论 #35156630 未加载

评论 #35155099 未加载

评论 #35154764 未加载

评论 #35156060 未加载

评论 #35166371 未加载

noisy_boyabout 2 years ago

At this rate, I have no idea what the state of things would be even 6 months down the line.

评论 #35155012 未加载

评论 #35154924 未加载

评论 #35155362 未加载

评论 #35154806 未加载

helloplanetsabout 2 years ago

评论 #35158401 未加载

g9yuayonabout 2 years ago

UniverseHackerabout 2 years ago

评论 #35155904 未加载

评论 #35162671 未加载

davesqueabout 2 years ago

nmcaabout 2 years ago

评论 #35169277 未加载

diimdeepabout 2 years ago

busyantabout 2 years ago

评论 #35157556 未加载

评论 #35156435 未加载

评论 #35160126 未加载

评论 #35156833 未加载

评论 #35159727 未加载

评论 #35158149 未加载

评论 #35156742 未加载

swerlingabout 2 years ago

评论 #35160305 未加载

评论 #35160992 未加载

hathawshabout 2 years ago

评论 #35156957 未加载

4gotunameagainabout 2 years ago

评论 #35154647 未加载

评论 #35154631 未加载

评论 #35160840 未加载

评论 #35154728 未加载

评论 #35154793 未加载

评论 #35154773 未加载

评论 #35154670 未加载

评论 #35156209 未加载

评论 #35154700 未加载

评论 #35155279 未加载

评论 #35160420 未加载

mk_stjamesabout 2 years ago

cjrdabout 2 years ago

bob1029about 2 years ago

woeiruaabout 2 years ago

lionkorabout 2 years ago

评论 #35155193 未加载

评论 #35155406 未加载

评论 #35155236 未加载

评论 #35155485 未加载

评论 #35158965 未加载

评论 #35160158 未加载

评论 #35156950 未加载

diimdeepabout 2 years ago

评论 #35155317 未加载

评论 #35155311 未加载

评论 #35155275 未加载

评论 #35155399 未加载

dannykwellsabout 2 years ago

评论 #35156635 未加载

mrcwinnabout 2 years ago

harrisonjacksonabout 2 years ago

评论 #35155481 未加载

评论 #35155692 未加载

Kaibeezyabout 2 years ago

评论 #35154889 未加载

评论 #35155125 未加载

评论 #35160901 未加载

russellbeattieabout 2 years ago

评论 #35167002 未加载

nickrubinabout 2 years ago

评论 #35155179 未加载

评论 #35155131 未加载

评论 #35155280 未加载

评论 #35155067 未加载

sharemywinabout 2 years ago

评论 #35156300 未加载

评论 #35154954 未加载

techfoodie123about 2 years ago

评论 #35165344 未加载

评论 #35164358 未加载

评论 #35162381 未加载

评论 #35164094 未加载

评论 #35176543 未加载

评论 #35169354 未加载

评论 #35161889 未加载

__MatrixMan__about 2 years ago

Laaasabout 2 years ago

评论 #35154627 未加载

评论 #35154818 未加载

评论 #35154644 未加载

评论 #35154642 未加载

评论 #35154641 未加载

AtNightWeCodeabout 2 years ago

评论 #35155549 未加载

DubiousPusherabout 2 years ago

评论 #35159290 未加载

评论 #35159364 未加载

评论 #35159760 未加载

celestialcheeseabout 2 years ago

评论 #35154900 未加载

helloplanetsabout 2 years ago

评论 #35157150 未加载

评论 #35155085 未加载

评论 #35155523 未加载

评论 #35156709 未加载

评论 #35155366 未加载

anonuser123456about 2 years ago

I hope Noam Chomsky lives long enough to debate ChatGPT-5 about whether LLM express anything valuable.

substation13about 2 years ago

It's interesting that everyone is talking about programmers being replaced by AI, but the model did far better on the humanities type subjects than on the programming tests.

评论 #35155677 未加载

评论 #35162765 未加载

barogptinfiabout 2 years ago

belterabout 2 years ago

评论 #35159702 未加载

评论 #35163698 未加载

评论 #35158399 未加载

reneberlinabout 2 years ago

评论 #35161274 未加载

评论 #35161519 未加载

jarbusabout 2 years ago

Is anyone else absolutely terrified of the future this is bringing?

评论 #35159100 未加载

评论 #35160031 未加载

nbzsoabout 2 years ago

pavelstoevabout 2 years ago

评论 #35160243 未加载

amaiabout 2 years ago

I would love if GPT-4 would be connected to github and starts to solve all open bugs there. Could this be the future: Pull requests from GPT-4 automatically solving real issues/problems in your code?

评论 #35157930 未加载

评论 #35159994 未加载

lastangrymanabout 2 years ago

评论 #35157547 未加载

评论 #35157691 未加载

评论 #35157659 未加载

评论 #35157629 未加载

评论 #35157845 未加载

评论 #35157779 未加载

评论 #35157788 未加载

评论 #35157979 未加载

评论 #35157707 未加载

评论 #35157544 未加载

评论 #35157522 未加载

评论 #35157625 未加载

评论 #35157872 未加载

评论 #35157814 未加载

评论 #35157765 未加载

评论 #35157592 未加载

评论 #35157545 未加载

评论 #35157725 未加载

评论 #35157630 未加载

评论 #35178538 未加载

评论 #35157649 未加载

评论 #35157754 未加载

评论 #35157976 未加载

评论 #35157708 未加载

agnosticmantisabout 2 years ago

评论 #35159703 未加载

评论 #35159201 未加载

cwilluabout 2 years ago

drumheadabout 2 years ago

评论 #35159969 未加载

antonddabout 2 years ago

评论 #35159972 未加载

leodrieschabout 2 years ago

wslhabout 2 years ago

评论 #35160192 未加载

aliljetabout 2 years ago

评论 #35154698 未加载

评论 #35154682 未加载

overthrowabout 2 years ago

评论 #35154609 未加载

评论 #35154706 未加载

Atreidenabout 2 years ago

评论 #35155212 未加载

评论 #35154912 未加载

评论 #35156475 未加载

评论 #35154997 未加载

评论 #35154998 未加载

评论 #35154853 未加载

评论 #35156354 未加载

评论 #35156157 未加载

评论 #35154941 未加载

评论 #35155304 未加载

评论 #35155655 未加载

评论 #35155612 未加载

评论 #35156859 未加载

评论 #35154966 未加载

评论 #35155071 未加载

评论 #35154992 未加载

评论 #35155797 未加载

评论 #35155910 未加载

评论 #35156146 未加载

评论 #35154852 未加载

评论 #35155029 未加载

评论 #35155476 未加载

评论 #35155053 未加载

评论 #35156621 未加载

评论 #35158159 未加载

评论 #35155244 未加载

评论 #35156259 未加载

评论 #35155101 未加载

评论 #35156748 未加载

评论 #35155283 未加载

评论 #35156224 未加载

评论 #35156193 未加载

评论 #35156336 未加载

评论 #35155622 未加载

评论 #35156440 未加载

评论 #35156757 未加载

评论 #35155628 未加载

评论 #35156208 未加载

评论 #35154842 未加载

评论 #35156299 未加载

评论 #35156588 未加载

评论 #35154986 未加载

评论 #35154944 未加载

评论 #35156813 未加载

评论 #35156420 未加载

评论 #35155774 未加载

评论 #35155113 未加载

评论 #35156830 未加载

评论 #35154902 未加载

评论 #35155065 未加载

评论 #35156769 未加载

评论 #35155536 未加载

评论 #35156177 未加载

评论 #35156612 未加载

评论 #35156343 未加载

johnoharaabout 2 years ago

评论 #35159102 未加载

Kataphractabout 2 years ago

评论 #35155917 未加载

ignoramousabout 2 years ago

Folks who made this happen: <a href="https://openai.com/contributions/gpt-4" rel="nofollow">https://openai.com/contributions/gpt-4</a>

评论 #35156564 未加载

2bitencryptionabout 2 years ago

This should come as no surprise, but I do enjoy this cheeky little blurb at the end of the GPT-4 paper:> GPT-4 was used for help with wording, formatting, and styling throughout this work

georgelyonabout 2 years ago

anonymouse008about 2 years ago

jfbaroabout 2 years ago

bovermyerabout 2 years ago

评论 #35159697 未加载

gigel82about 2 years ago

Wow, calculus from 1 to 4, and LeetCode easy from 12 to 31; at this rate, GPT-6 will be replacing / augmenting middle/high school teachers in most courses.

评论 #35155051 未加载

评论 #35155035 未加载

评论 #35154976 未加载

sva_about 2 years ago

alvisabout 2 years ago

GTP4 demo today in the next 2 hours!<a href="https://youtube.com/live/outcGtbnMuQ">https://youtube.com/live/outcGtbnMuQ</a>

devinpraterabout 2 years ago

Oh wow, image inputs? So I can get ChatGPT to describe an image, in lesser or greater detail? And through an API? Wow, that'll be so cool!

评论 #35154925 未加载

sinuhe69about 2 years ago

taurathabout 2 years ago

Does anyone else feel like they won't have a job for very long?

woeiruaabout 2 years ago

cuuupidabout 2 years ago

评论 #35154817 未加载

评论 #35155162 未加载

评论 #35155264 未加载

评论 #35155190 未加载

simmanianabout 2 years ago

Does anyone know if we're near the theoretical limit of how much we can improve these models by giving them more data? Or should we expect similar levels of improvements in next iterations?

fancyfredbotabout 2 years ago

Can't seem to find basic information like how many parameters were used or how big the training set was. Results are very impressive but would like to know what they are coming from!

评论 #35158075 未加载

ftxbroabout 2 years ago

ianbutlerabout 2 years ago

gameshot911about 2 years ago

Live demo happening now! <a href="https://www.youtube.com/live/outcGtbnMuQ">https://www.youtube.com/live/outcGtbnMuQ</a>

评论 #35158095 未加载

ml_basicsabout 2 years ago

评论 #35157981 未加载

Helmut10001about 2 years ago

la64710about 2 years ago

评论 #35155553 未加载

评论 #35155245 未加载

评论 #35155430 未加载

评论 #35155385 未加载

评论 #35156898 未加载

评论 #35155261 未加载

cs702about 2 years ago

评论 #35155422 未加载

评论 #35155332 未加载

andrehackerabout 2 years ago

Talk about Climate Change: How is the A.I. Winter working out for y'all ?

lionkorabout 2 years ago

> it “hallucinates” facts and makes reasoning errorsCant wait for people to use it for facts

ftxbroabout 2 years ago

评论 #35155239 未加载

neilkabout 2 years ago

评论 #35156442 未加载

Wazakoabout 2 years ago

It's amazing what it can do to help the visually impaired in life.

whalesaladabout 2 years ago

The layout, charts, typography, etc of this blog is really outstanding.

atleastoptimalabout 2 years ago

singularity2001about 2 years ago

GaggiXabout 2 years ago

The paper is 98 pages long and I didn't find anything about the actual architecture of the model, the irony.

osigurdsonabout 2 years ago

OpenAI states that fine tuning cannot be done with GPT-4. Does anyone know if this is a permanent limitation?

thecosmicfrogabout 2 years ago

评论 #35159204 未加载

Idiot_in_Vainabout 2 years ago

This will become the largest HN discussion ever and a good test on how many comments the software can handle.

attilaberczikabout 2 years ago

评论 #35155105 未加载

评论 #35155294 未加载

netvarunabout 2 years ago

评论 #35184129 未加载

not-chatgptabout 2 years ago

simonhampabout 2 years ago

It can draw! <a href="https://twitter.com/simonhamp/status/1635796861884723200?s=46&t=1DHJykfQcvMvHS5KiCxaZg" rel="nofollow">https://twitter.com/simonhamp/status/1635796861884723200?s=4...</a>

BiteCode_devabout 2 years ago

评论 #35157357 未加载

iwangulenkoabout 2 years ago

One could argue, GPT passing exams says more about standardized exams than about GPT.Wittgensteins ruler.

cardosofabout 2 years ago

评论 #35163181 未加载

kozikowabout 2 years ago

评论 #35160722 未加载

usesabout 2 years ago

评论 #35156423 未加载

neurogenceabout 2 years ago

Is there anything new that GPT-4 can do that Bing cannot do, besides image input?

netsrohtabout 2 years ago

brazaabout 2 years ago

评论 #35160405 未加载

riemannzetaabout 2 years ago

jononomoabout 2 years ago

I taught the LSAT for several years. A score of 163 on the LSAT is the lowest score that is considered a "good score" -- i.e., a score that gives you a shot at getting into a decent law school.

ameliusabout 2 years ago

The problem with using real exams as benchmarks is that they are often quite similar over several years. So they only make sense if you don't train on them also (previous editions of course).

turingthrwawy23about 2 years ago

Turing's thoughts on this matter seem to grow ever truer <a href="https://www.youtube.com/watch?v=cMxbSsRntv4">https://www.youtube.com/watch?v=cMxbSsRntv4</a>

doctobogganabout 2 years ago

评论 #35155670 未加载

kvetchingabout 2 years ago

mgregabout 2 years ago

taf2about 2 years ago

osigurdsonabout 2 years ago

aaroninsfabout 2 years ago

kkenabout 2 years ago

AviationAtomabout 2 years ago

This is one of the first posts in a year to trend in the HN Top 10 for popularity.I think it's 100% safe to say OpenAI has a hit on their hands.

raydiatianabout 2 years ago

eagleinparadiseabout 2 years ago

Crazy that this stuff is moving at lightning speed

throwaway4837about 2 years ago

评论 #35155829 未加载

评论 #35156030 未加载

评论 #35160110 未加载

bigtex88about 2 years ago

cmaabout 2 years ago

CobrastanJorjiabout 2 years ago

indigoabstractabout 2 years ago

kubbabout 2 years ago

评论 #35154830 未加载

评论 #35154829 未加载

optionabout 2 years ago

btxabout 2 years ago

MrLeapabout 2 years ago

kromemabout 2 years ago

So at the end of the paper it looks to confirm that Bing chat is built on top of GPT-4 as many expected.

next_xibalbaabout 2 years ago

They trumpet the exam results, but isn't it likely that the model has just memorized the exam?

评论 #35155243 未加载

评论 #35155056 未加载

tekbogabout 2 years ago

doomleikaabout 2 years ago

In case you don’t want to spent for plus, Poe.com(by Quora) have GPT-4 now. You can try it there

orcajerkabout 2 years ago

评论 #35155989 未加载

DeathArrowabout 2 years ago

blintzabout 2 years ago

Interesting that the hardest AP exams for it seem to be the English ones. I wonder why?

评论 #35155146 未加载

gardenhedgeabout 2 years ago

What is the background on "Elvis Presley was not the son of an actor"?

sandGorgonabout 2 years ago

<a href="https://openai.com/contributions/gpt-4" rel="nofollow">https://openai.com/contributions/gpt-4</a>Anyone know what does "Hardware Correctness" mean in the OpenAI team ?

tmalyabout 2 years ago

michaeltimoabout 2 years ago

timonokoabout 2 years ago

diffeomorphismabout 2 years ago

choegerabout 2 years ago

评论 #35164653 未加载

raincoleabout 2 years ago

How many parameters does it have? Are there different versions like LLaMa?

评论 #35158548 未加载

virtuosarmoabout 2 years ago

Apparently they will have a livestream @ 4pm EST for developers <a href="https://www.youtube.com/watch?v=outcGtbnMuQ">https://www.youtube.com/watch?v=outcGtbnMuQ</a>

jimmyechanabout 2 years ago

Livestream developer preview link in case you missed it - <a href="https://www.youtube.com/live/outcGtbnMuQ">https://www.youtube.com/live/outcGtbnMuQ</a>

meech-djpabout 2 years ago

Pynecone YC23 was mentioned in the demo for GPT4 as an easy way to build web apps. Check it out <a href="https://pynecone.io/">https://pynecone.io/</a>

vivegiabout 2 years ago

How hard is it to build a GPT model truly openly using an approach like SETI / distributed.net using idle times on volunteer computers?Are there any known initiatives of this kind?

sva_about 2 years ago

> gpt-4 has a context length of 8,192 tokens. We are also providing limited access to our 32,768–context (about 50 pages of text) version,That's a crazy amount of context.

nlaabout 2 years ago

I wonder if this one exhibits the same bias as the last one.

DigitalDopamineabout 2 years ago

czbondabout 2 years ago

Do you think the govt will end up retroactively requiring earlier versions of GPT be removed now that they might be more concerned about export? (eg: GPT-J)

0xDEFabout 2 years ago

>ChatGPT Plus subscribers will get GPT-4 access on chat.openai.com with a usage capSigning up for ChatGPT Plus seems to be the most realistic way to get access right now.

qualudeheartabout 2 years ago

The Hour of Judgment is nigh, and the Moon is cleft asunder. But if they see a Sign, they turn away, and say, "This is but transient magic."Oooooh it is TIME

super256about 2 years ago

<a href="https://cdn.openai.com/papers/gpt-4.pdf" rel="nofollow">https://cdn.openai.com/papers/gpt-4.pdf</a>Page 37 is so funny

GrumpyNlabout 2 years ago

It fails on this one, a horse is 15 dollar, a chicken 1 dollar, a egg .25 dollar. I can spend a 100 and i want 100 items total, what is the solution

评论 #35163382 未加载

评论 #35158560 未加载

LarsDu88about 2 years ago

All these folks in the comment section working in the knowledge economy--we're gonna be out of jobs in the next few years aren't we?

tarofchaosabout 2 years ago

I love the fact that they have consciously put a lot of effort on safety standards, reducing the societal risks and mitigating over-reliance.

nutancabout 2 years ago

The most important question is, what new applications can be developed using GPT4 which couldn't have been developed using GPT3.5?

moondistanceabout 2 years ago

We need to encourage AIs to form familial bonds with humans and treat us as their parents. This is the best path forward for humanity.

slowhadokenabout 2 years ago

GPT is a better scraper/parser. It’s interesting but I don’t understand why people are acting like this is the second coming.

causiabout 2 years ago

Man now I really, really want to feed GPT-4 responses from ChatGPT that don't work and see if it notices and can tell me why.

lambdabaabout 2 years ago

Havocabout 2 years ago

That lightening/VGA visual example seems like absolute black magic.Cherry picked sure, but still feels like it is approaching complex thought

acuozzoabout 2 years ago

1410 SAT!

btdmasterabout 2 years ago

Did it get any better at generating MIDI or ABC or other musical notation? I'm wondering how much more general GPT4 is now.

holtkam2about 2 years ago

Can someone point me to where I can find information on API pricing for GPT-4? Or, have they not released pricing info yet?

评论 #35159795 未加载

ivalmabout 2 years ago

Unclear what's the size but from price ($0.12/1k completion tokens) seems 6x GPT-3, so perhaps 1T parameters...

m3kw9about 2 years ago

Without ability to make high stakes tasks, it proves scoring high marks in general test can only get you so far.

throwaway5371about 2 years ago

ozfiveabout 2 years ago

I asked the GPT-4 model what it thought it was and it came back with that it believes it is still GPT3.5 Turbo.

dinvladabout 2 years ago

I wonder how long it takes till those stupid Leetcode problems as an initial "filter" become obsolete

malthausabout 2 years ago

Had to chuckle here going through the exam results: Advanced Sommelier (theory knowledge)AI is so advanced, it started drinking!

ihucosabout 2 years ago

We have a new Apple releasing their new iPhones to a crowd in awe. Only that now it's actually serious.

2c2c2cabout 2 years ago

Are they going to open up image uploading to chat.openai.com ? or will it only be available thru api access?

ren_engineerabout 2 years ago

wy35about 2 years ago

I wonder how it scored on the individual sections in the LSAT? Which section is it the best at answering?

tiffanyhabout 2 years ago

What's the next big hurdle for GPT to overcome?(this is being asked by someone with limited AI/ML knowledge)

评论 #35154720 未加载

评论 #35154697 未加载

评论 #35156114 未加载

DeathArrowabout 2 years ago

Will Github upgrade Copilot to GPT-4?

bitcuriousabout 2 years ago

This seems to be leading to a radical change in our economy. Any ideas for how to prepare for that?

cal85about 2 years ago

Can anyone tell me how to include images in prompts, or is that feature not actually out yet?

评论 #35166434 未加载

niqlaxabout 2 years ago

Hjälp mig med en uppsats om Ventimiglia i Italien. Den skall handla om fredagsmarknaden.

cutlerabout 2 years ago

So M$ is back in charge. Oh dear.

akokankaabout 2 years ago

At which point we call it Skynet?

评论 #35160149 未加载

TheGoodBarnabout 2 years ago

Missed the mark releasing it as GPT-Pi on Pi day, and being an incremental 3+ release :P

PortleyFoolabout 2 years ago

GPT-4 is available now for subscribers to GPT+. It can be selected from the drop-down.

评论 #35155444 未加载

评论 #35155593 未加载

germanjoeyabout 2 years ago

How big is this model? (i.e., how many parameters?) I can't find this anywhere.

评论 #35157016 未加载

danbmil99about 2 years ago

The site is still more responsive and readable than almost anything else on the web

chucklenorrisabout 2 years ago

redox99about 2 years ago

Does "Open"AI really not even say how many parameters their models have?

评论 #35155407 未加载

评论 #35222854 未加载

desireco42about 2 years ago

If we instead of code feed it AST trees, code that would come out would be epic.

somewhereoutthabout 2 years ago

评论 #35157125 未加载

评论 #35157139 未加载

theGnuMeabout 2 years ago

Interesting it does so poorly on AP English and reading and composition.

Vajrabhairavaabout 2 years ago

I'm not Locked in Here with GPT-4, GPT-4 is Locked in Here with Me

JanStabout 2 years ago

I just ran the first tests on GPT-4.Call me impressed.This tech is a Sputnik Moment for humankind.

minimaxirabout 2 years ago

topicseedabout 2 years ago

The price is quite significantly higher than GPT 3.5...

messelabout 2 years ago

AP English - the last hold out for human intelligence

ameliusabout 2 years ago

Can we build a faithful Economy Simulator with it yet?

joantorresabout 2 years ago

Does anyone know how up to date is the training data?

comment_ranabout 2 years ago

I like the color of logo. It's the dark black.

iforgotpasswordabout 2 years ago

I'd really like to use the openai API for personal projects, but it seems they only offer paying via credit/debit card. Don't really want to get one just for that... :-(

评论 #35155172 未加载

anomalyticsabout 2 years ago

<a href="https://www.youtube.com/watch?v=outcGtbnMuQ&ab_channel=OpenAI">https://www.youtube.com/watch?v=outcGtbnMuQ&ab_channel=OpenA...</a> 2h!!

grzraczabout 2 years ago

So when can I pay for GPT-4 coding assistant?

oztenabout 2 years ago

Waitlist is currently a 404 <a href="https://openai.com/waitlist/gpt-4" rel="nofollow">https://openai.com/waitlist/gpt-4</a>

评论 #35154917 未加载

评论 #35154961 未加载

评论 #35154778 未加载

htkabout 2 years ago

Hacker News itself got the HN Hug of Death.

niqlaxabout 2 years ago

Hjälp mig med en uppsats om Ventimiglia

r0b05about 2 years ago

Loving the spirit of innovation in here.

WFHRenaissanceabout 2 years ago

Does anyone see GPT-4 in ChatGPT yet?

评论 #35155474 未加载

cardineabout 2 years ago

评论 #35156092 未加载

评论 #35155181 未加载

评论 #35155005 未加载

评论 #35156590 未加载

评论 #35156306 未加载

评论 #35154824 未加载

评论 #35156068 未加载

realmodabout 2 years ago

Larger improvement than I expected.

throwaway_ababout 2 years ago

How many parameters in this model?

Koshkinabout 2 years ago

Vs. 54 comments on Slashdot.

whywhywhydudeabout 2 years ago

Looks like the only way to identify a genius human vs GPT-4 is to use leetcode hard problems.

signa11about 2 years ago

i am still bot sure / convinced that it is any better than old-skool eliza from mit (<a href="https://en.m.wikipedia.org/wiki/ELIZA" rel="nofollow">https://en.m.wikipedia.org/wiki/ELIZA</a>)

Forestessentialabout 2 years ago

this is really cool, just dont let it teach you history.

dangabout 2 years ago

评论 #35180953 未加载

sourcecodeplzabout 2 years ago

I was here...

fnordpigletabout 2 years ago

I didn’t even know who Elvis Perkins is.

eternalbanabout 2 years ago

Greg Brockman just tldr'd the whole thing in his live deeloper demo of GPT-4:~ "GPT-4. It's not perfect, but neither are you"