Google Gemini Pro API Available Through AI Studio

185 pointsby sam1234apterover 1 year ago

30 comments

I used this just-released API (of Gemini Pro) with multimodal input to test some of the things from the infamous Gemini Demo. You can see here [ <a href="https://www.youtube.com/watch?v=__nL7Vc0OCg" rel="nofollow noreferrer">https://www.youtube.com/watch?v=__nL7Vc0OCg</a> ] my GPT-4 recreation of that ad which went viral.Gemini Pro is... not great. In one test, I asked what gesture I was making (while showing a thumbs up) -- it said thumbs down and "The image is a commentary on the changing nature of truth".I just just made a heads-to-heads comparison -- you can watch it here: <a href="https://www.youtube.com/watch?v=1RrkRA7wuoE" rel="nofollow noreferrer">https://www.youtube.com/watch?v=1RrkRA7wuoE</a>Code is here: <a href="https://github.com/gregsadetsky/sagittarius">https://github.com/gregsadetsky/sagittarius</a>

评论 #38632694 未加载

chamodaover 1 year ago

Free quota looks reasonable with 60 queries per minute. On the other hand data from free quota requests will be used to improve the product.<a href="https://ai.google.dev/pricing" rel="nofollow noreferrer">https://ai.google.dev/pricing</a>

评论 #38629004 未加载

评论 #38631576 未加载

评论 #38631304 未加载

isalmonover 1 year ago

I know it's just an anecdote, but my biggest problem with Google's Bard/Gemini is that the moment I tried to ask a question about something - I started getting ads all over the internet and social media related to that.Doing this with ChatGPT 4.0 for months and months did not cause this type of behavior.

评论 #38631314 未加载

pesfandiarover 1 year ago

I like that they have a "blog post creator"[1] in their examples. There's no hope for the future of the web when the self-proclaimed stewards of its quality encourage AI spam.[1] <a href="https://makersuite.google.com/app/prompts/blog-post-creator" rel="nofollow noreferrer">https://makersuite.google.com/app/prompts/blog-post-creator</a>

评论 #38631467 未加载

dudusover 1 year ago

This is still not Gemini Ultra. That's the one they said was above state of the art. Still waiting for that one.

sam1234apterover 1 year ago

Developers can start building with our first version of Gemini Pro through Google AI Studio at ai.google.devDevelopers have a free quota and access to a full range of features including function calling, embeddings, semantic retrieval, custom knowledge grounding, chat functionality and more. It supports 38 languages across 180+ countries.

georgehillover 1 year ago

> Access restricted You do not have permission to view this page.Wait only in the US?Edit: I can access it through the Google Cloud Console.<a href="https://imgur.com/a/NXAgvFb" rel="nofollow noreferrer">https://imgur.com/a/NXAgvFb</a>

ianbickingover 1 year ago

Some thoughts comparing this to the GPT API (from a thread: <a href="https://hachyderm.io/@ianbicking/111574983914336748" rel="nofollow noreferrer">https://hachyderm.io/@ianbicking/111574983914336748</a>):It looks like a fairly easy swap-in for GPT. "messages" becomes "content". Some of the configuration parameters are slightly different (topP/etc), but I have never put in the effort to understand the practical effect of those so I never tweak their values.The messages themselves are a list of "parts", which allows mixed media messages. This feels a little cleaner than how GPT has handled messages being extended.Instead of role: "assistant" they use role: "model". There's no role: "system" – presumably you just shove everything into user messages. You can also leave off the role... and I assume that means default to "user" but it's not clear if it's 100% equivalent...?There's a bunch of moderation parameters, which seems like a good idea. OpenAI has a moderation endpoint you can use to preflight check your input, but doing it all at once makes more sense. There's four categories and you can adjust your sensitivity to each (and turn off blocking at entirely). The sensitivity is not about how extreme the violation is, but how likely it is a violation. So it's not like a G/PG/PG-13/etc rating. Just a question of how many false positives/negatives you want.There's functions, though they are in beta (whatever that means): <a href="https://ai.google.dev/docs/function_calling" rel="nofollow noreferrer">https://ai.google.dev/docs/function_calling</a> – they look very very similar to GPT functions. They don't have the "JSON response" that GPT has, but that seems mostly redundant with functions anyway.I have no idea how well prompts translate, but it feels like the API is an easy translation. And importantly everything is semantically equivalent, you don't have to make one pretend it is the other, like turning a completion API into a chat API.Given the generous free tier I feel fairly motivated to swap in Gemini and try to ship experiments that I've sat on until now.

brrrrrmover 1 year ago

why on earth did they design the Node.js and Web APIs to be slightly different and incompatible? (edit: this might just be a bug/oversight on the landing page?)Node.js:<pre><code> const model = genAI.getGenerativeModel({ model: "gemini-pro-vision"}); const result = model.generateContent({ contents: [{parts: [ {text: "What’s in this photo?"}, {inlineData: {data: imgBase64, mimeType: 'image/png'}} ] }] }) </code></pre> Web:<pre><code> const model = genAI.getGenerativeModel({ model: "gemini-pro-vision"}); const result = await model.generateContent([ "What’s in this photo?", {inlineData: {data: imgDataInBase64, mimeType: 'image/png'}} ]);</code></pre>

评论 #38631604 未加载

评论 #38631221 未加载

lovasoaover 1 year ago

You can make 1 query per second to it for free, including large queries that contain images ? This is crazy !I will happily let google buy me for that price.<a href="https://ai.google.dev/pricing" rel="nofollow noreferrer">https://ai.google.dev/pricing</a>

thedanglerover 1 year ago

Wow. what a crap site. I clicked on the option for prompt thinking I could go back and request an API key. Boy was I wrong. No matter what I do it takes me to the prompt console where I get Access Denied and it hijacked my back button.

legendofbrandoover 1 year ago

When I try to create an API key it says that "We are sorry, but you do not have access to Early Access Apps" yet my domain does allow access to early access apps....

AlmostSchurLieover 1 year ago

When I try to create an API key, all I see is "an internal error occured". Still waiting for Gemini Ultra though.

vibhajaimanover 1 year ago

Make a questionnaire senior secondary school students and mobile phone impact reply

vibhajaimanover 1 year ago

Make a questionnaire senior secondary school students and mobile phone impact

andre-zover 1 year ago

See how to use new Gemini Embeddings with Qdrant Vector Database <a href="https://qdrant.tech/documentation/embeddings/gemini/" rel="nofollow noreferrer">https://qdrant.tech/documentation/embeddings/gemini/</a>

SubiculumCodeover 1 year ago

I'd like to see this benchmarked on humaneval for coding.

roschdalover 1 year ago

How can I use Google Gemini in a Java application?

评论 #38636537 未加载

prakhar897over 1 year ago

Can someone recreate the Google Demo of gemini?

ziga9over 1 year ago

Anyone else having access restricted problem?

zlg_codesover 1 year ago

I'd like to know why the name of this AI product coincides with the alternative in-between-HTTP-and-Gopher Gemini protocol.I'm sure it's just an accident.

评论 #38634818 未加载

tanyongshengover 1 year ago

The pricing is attractive.

replwoacauseover 1 year ago

This news doesn’t excite me at all after trying Bard Gemini Pro in the browser.

fotcornover 1 year ago

I can only access <a href="https://makersuite.google.com/" rel="nofollow noreferrer">https://makersuite.google.com/</a> when using a VPN to the US. Also, it spams popups that get blocked by Firefox.Some basic prompts, which are answered correctly most of the time by ChatGPT4:There are 31 books in my house. I read 2 books over the weekend. How many books are still in my house?> 29 booksJulia has three brothers, each of them has two sisters. How many sisters does Julia have?> ThreeIf you place an orange below a plate in the living room, and then move the plate to the kitchen, where is the orange now?> Under the plate in the kitchen.So, not great.

评论 #38629861 未加载

评论 #38629973 未加载

评论 #38629376 未加载

评论 #38630230 未加载

评论 #38629415 未加载

评论 #38630394 未加载

评论 #38629705 未加载

评论 #38629643 未加载

评论 #38633874 未加载

评论 #38631196 未加载

评论 #38630872 未加载

imdsmover 1 year ago

Typical Google UX.Get API key, takes me to makersuite, where I get a create API key button that errors. Then when I reload the page, I get a straight forbidden page.HP said it best, you have to isolate the team from the bigger company to allow them to work as an effective startup. How can solo-preneurs provide better UX & onboarding while doing 16 other jobs than Google can with multi-billion dollar budgets?

评论 #38629406 未加载

评论 #38629846 未加载

评论 #38630233 未加载

评论 #38630224 未加载

评论 #38630031 未加载

评论 #38630603 未加载

评论 #38629311 未加载

评论 #38630881 未加载

verdvermover 1 year ago

Cross posting some links from another post that HNers found helpful- <a href="https://cloud.google.com/vertex-ai" rel="nofollow noreferrer">https://cloud.google.com/vertex-ai</a> (marketing page)- <a href="https://cloud.google.com/vertex-ai/docs" rel="nofollow noreferrer">https://cloud.google.com/vertex-ai/docs</a> (docs entry point)- <a href="https://console.cloud.google.com/vertex-ai" rel="nofollow noreferrer">https://console.cloud.google.com/vertex-ai</a> (cloud console)- <a href="https://console.cloud.google.com/vertex-ai/model-garden" rel="nofollow noreferrer">https://console.cloud.google.com/vertex-ai/model-garden</a> (all the models)- <a href="https://console.cloud.google.com/vertex-ai/generative" rel="nofollow noreferrer">https://console.cloud.google.com/vertex-ai/generative</a> (studio / playground)VertexAI is the umbrella for all of the Google models available through their cloud platform.You want the last link if you are looking for a ChatGPT like experience, with the ability to also adjust the parameters, so more like a UI on top of the API

评论 #38630942 未加载

评论 #38630891 未加载

alexb_over 1 year ago

When I enter into the AI, Firefox blocks an insane amount of popups. The counter for blocked pop ups quickly reaches >100 where it stops counting. What is it trying to do?

评论 #38629100 未加载

评论 #38630292 未加载

评论 #38630957 未加载

behnamohover 1 year ago

Doesn't matter—it's already available on Bard and it's not good.

评论 #38628925 未加载

martythemaniakover 1 year ago

This is very good:- 60 queries per minute free - about 1/5th the price of GPT3.5 Turbo - priced per char, not per token - same image pricing as GPT4 150x150

评论 #38630208 未加载

yeldarbover 1 year ago

We put the image portion through its paces and compared it with GPT-V here: <a href="https://blog.roboflow.com/first-impressions-with-google-gemini/">https://blog.roboflow.com/first-impressions-with-google-gemi...</a>

评论 #38630983 未加载

评论 #38631013 未加载

30 comments

gregsadetskyover 1 year ago

评论 #38632694 未加载

chamodaover 1 year ago

评论 #38629004 未加载

评论 #38631576 未加载

评论 #38631304 未加载

isalmonover 1 year ago

评论 #38631314 未加载

pesfandiarover 1 year ago

评论 #38631467 未加载

dudusover 1 year ago

This is still not Gemini Ultra. That's the one they said was above state of the art. Still waiting for that one.

sam1234apterover 1 year ago

georgehillover 1 year ago

ianbickingover 1 year ago

brrrrrmover 1 year ago

评论 #38631604 未加载

评论 #38631221 未加载

lovasoaover 1 year ago

thedanglerover 1 year ago

legendofbrandoover 1 year ago

When I try to create an API key it says that "We are sorry, but you do not have access to Early Access Apps" yet my domain does allow access to early access apps....

AlmostSchurLieover 1 year ago

When I try to create an API key, all I see is "an internal error occured". Still waiting for Gemini Ultra though.

vibhajaimanover 1 year ago

Make a questionnaire senior secondary school students and mobile phone impact reply

vibhajaimanover 1 year ago

Make a questionnaire senior secondary school students and mobile phone impact

andre-zover 1 year ago

SubiculumCodeover 1 year ago

I'd like to see this benchmarked on humaneval for coding.

roschdalover 1 year ago

How can I use Google Gemini in a Java application?

评论 #38636537 未加载

prakhar897over 1 year ago

Can someone recreate the Google Demo of gemini?

ziga9over 1 year ago

Anyone else having access restricted problem?

zlg_codesover 1 year ago

I'd like to know why the name of this AI product coincides with the alternative in-between-HTTP-and-Gopher Gemini protocol.I'm sure it's just an accident.

评论 #38634818 未加载

tanyongshengover 1 year ago

The pricing is attractive.

replwoacauseover 1 year ago

This news doesn’t excite me at all after trying Bard Gemini Pro in the browser.

fotcornover 1 year ago

评论 #38629861 未加载

评论 #38629973 未加载

评论 #38629376 未加载

评论 #38630230 未加载

评论 #38629415 未加载

评论 #38630394 未加载

评论 #38629705 未加载

评论 #38629643 未加载

评论 #38633874 未加载

评论 #38631196 未加载

评论 #38630872 未加载

imdsmover 1 year ago

评论 #38629406 未加载

评论 #38629846 未加载

评论 #38630233 未加载

评论 #38630224 未加载

评论 #38630031 未加载

评论 #38630603 未加载

评论 #38629311 未加载

评论 #38630881 未加载

verdvermover 1 year ago

评论 #38630942 未加载

评论 #38630891 未加载

alexb_over 1 year ago

When I enter into the AI, Firefox blocks an insane amount of popups. The counter for blocked pop ups quickly reaches >100 where it stops counting. What is it trying to do?

评论 #38629100 未加载

评论 #38630292 未加载

评论 #38630957 未加载

behnamohover 1 year ago

Doesn't matter—it's already available on Bard and it's not good.

评论 #38628925 未加载

martythemaniakover 1 year ago

This is very good:- 60 queries per minute free - about 1/5th the price of GPT3.5 Turbo - priced per char, not per token - same image pricing as GPT4 150x150

评论 #38630208 未加载

yeldarbover 1 year ago

评论 #38630983 未加载

评论 #38631013 未加载