TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

OpenAI Spring Update [video]

56 点作者 georgehill大约 1 年前

10 条评论

dudus大约 1 年前
Anyone else find it a bit odd how they quickly set it up a day before Google I&#x2F;O?<p>They clearly want to steal the thunder from Google&#x27;s announcements. But they could only do that if they had inside information into what Google is launching. Otherwise it would be smarter to wait and react. Remember they already said this is not a search engine or gpt-5. So while this is probably cool it&#x27;s not a silver bullet and probably won&#x27;t overshadow what Google launches.<p>Unless it&#x27;s almost exactly the same main announcement. In which case a day would make a difference.<p>Chatter seem to suggest they&#x27;re announcing a voice to voice assistant, apple partnership and Google drive integration. All things Google is also rumoured to announce.<p>It definitely feels like the big guys are panicking to find their moat.
评论 #40345299 未加载
评论 #40345683 未加载
nomilk大约 1 年前
I wasn’t prepared to be this impressed.<p>Voice to voice chatGPT. You can interrupt it any time (don’t have to wait for it to finish), it’s real time (no 2-3 second lag), and it picks up on emotion (!).
modeless大约 1 年前
<a href="https:&#x2F;&#x2F;cdn.openai.com&#x2F;hello-gpt-4o&#x2F;coins-01.jpg" rel="nofollow">https:&#x2F;&#x2F;cdn.openai.com&#x2F;hello-gpt-4o&#x2F;coins-01.jpg</a><p>It&#x27;s GPT-4o, the o stands for &quot;omni&quot;, which I&#x27;m guessing means multimodal input <i>and output</i>. Rumors of an end-to-end audio-to-audio real time voice demo, that&#x27;s what I&#x27;d really like to see.<p>Source: <a href="https:&#x2F;&#x2F;x.com&#x2F;btibor91&#x2F;status&#x2F;1790053718416605335" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;btibor91&#x2F;status&#x2F;1790053718416605335</a>
mike_hearn大约 1 年前
I wonder what the justification is for buying ChatGPT Pro now they make GPT-4 available for free.<p>I also wonder if they got a Scarlett Johansson impersonator to do the voice. It sounds eerily like the AI in Her, except for the still present problem with voices dropping out or glitching.<p>Nonetheless the real-time interaction is extremely impressive. So is the desktop app. Their optimization work must have been incredible to both speed things up and free up enough GPU capacity to make it freely available.
评论 #40345864 未加载
评论 #40345841 未加载
pants2大约 1 年前
The different voices were incredible!<p>I&#x27;m dreaming of the possibilities of an audio-to-audio model for any sort of sound. I want to be able to do things like: &quot;What&#x27;s the tune I&#x27;m humming?&quot; &quot;Why is my car making this noise?&quot; &quot;Can you separate the speech from the instruments in this clip?&quot; &quot;Can you make the sound of a steel drum in a hailstorm?&quot;
pants2大约 1 年前
I really appreciate that the demos were shown live, mistakes and all. This is in stark contrast to Google&#x27;s Gemini demos that were heavily edited and cherrypicked.
OutOfHere大约 1 年前
I am observing an extremely high rate of hallucinations with gpt-4o (gpt-4o-2024-05-13) as tested via the API. I advise extreme caution with it. In contrast, I see no such concern with gpt-4-turbo-preview (gpt-4-0125-preview).
sebastiennight大约 1 年前
Anyone watching the OpenAI livestream: did they &quot;paste&quot; the code after hitting CTRL+C ? Or did the desktop app just read the clipboard?<p>Edit: I&#x27;m asking because of the obvious data security implications of having your desktop app read from the clipboard _in the live demo_... That would definitely put a damper to my fanboyish enthusiasm about that desktop app.
评论 #40345708 未加载
39896880大约 1 年前
Spike Jonze should be flattered. The conversation mode is a straight copy of the OS in Her.
neverokay大约 1 年前
Sam Altman seems like a world class douche, is anyone else getting this vibe?<p>I need to know if my spidey senses are on point.
评论 #40345679 未加载
评论 #40345217 未加载