TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Siri creator shows off first public demo of Viv

224 pointsby bbrunnerabout 9 years ago

25 comments

kowdermeisterabout 9 years ago
Maybe I&#x27;m not the exact target market, but I always carefully pick my search parameters and research my decision making. For showing up, this is impressive to send flowers, but there&#x27;s always some minor details that break the experience, like the shop is closed, I can&#x27;t pay via X service. The Hotels.com booking is great, but not everybody is that rich to book a villa :)<p>As I see it, Viv aims to be the Google of service providers which is a pretty neat goal, nobody did that successfully, so I&#x27;m rooting for them. The platform approach is also a good marketing strategy that they want to allow it to embedable. For example letting people say ask basic things about a monument in a remote national park. The applications are really limitless if it fits on a Raspberry PI like card.<p>As someone who like to break things, please Viv process this for me:<p>&quot;VIV, please find me a cheap flight to Vienna, check Ryanair and Wizzair on the 21st of may for two adults&quot; &quot;order the results by price&quot; &quot;show details for the second one&quot; &quot;show the previous one&quot; &quot;are luggage included in the price?&quot; &quot;book it, but skip all marketing offers from the company&quot; &quot;find AirBnB apartments for that day that has wireless internet&quot; &quot;order results by price&quot; &quot;ok, then search in the range of 10 to 50€ per night&quot; &quot;select the first place&quot; &quot;message host: looking forward to meet you&quot; &quot;book it.&quot; &quot;Viv, bring me a beer&quot;
评论 #11662949 未加载
评论 #11662956 未加载
bsaulabout 9 years ago
Impressive demo, but yet i can&#x27;t help thinking that the &quot;big&quot; question still remain largely unsolved for those services.<p>Two points :<p>1&#x2F; viv keeps mentioning a &quot;breakthrough&quot; in computer science because they managed to create a program that will scan a network of services and fill the parameters from the query. Now, those guys aren&#x27;t jokers, so i presume there is definitely something fantastic behind, yet i can&#x27;t help thinking that once you&#x27;ve &quot;understood&quot; the intent, it sounds quite close to a sql &#x2F; graphql query planner.<p>2&#x2F; Which leads to the big question : guessing the intent. Every time i hear people mention &quot;intelligence&quot; or &quot;understanding&quot;, i show them this : &quot;hey siri, please do NOT set a timer for 4 minutes, i beg you&quot;. The fact is, it&#x27;s just a trick. It recognize words, but it doesn&#x27;t understand anything, it doesn&#x27;t have concepts, knowledge or any experience. It&#x27;s a dumb program that never had any life or sense to understand anything.<p>If i say &quot;find me a good ticket for Chicago tonight&quot;. How will it know that i&#x27;m talking about the rock band that&#x27;s playing tonight in Paris, France, since there&#x27;s absolutely not a chance that a human being asks for a plane ticket for such a big trip just a few hours in advance ?<p>This is, to me, <i>the</i> big and interesting question that online assistants makers needs to solve. Now of course, viv is aiming for a product release this year, so they&#x27;re building an intermediate solution, where developers will manually associate keywords to services, in an &quot;easy&quot; way. And it will sort of work.<p>Yet i&#x27;m still waiting for the real &quot;big&quot; advance.
评论 #11663536 未加载
评论 #11663129 未加载
评论 #11662869 未加载
评论 #11663074 未加载
评论 #11663911 未加载
评论 #11662872 未加载
评论 #11662862 未加载
评论 #11662921 未加载
评论 #11665606 未加载
评论 #11664243 未加载
ljoshuaabout 9 years ago
That&#x27;s pretty impressive!<p>Maybe this isn&#x27;t as big a concern as I think, but the ease of these commercial demos relies on a crucial assumption: that I am nearly completely price insensitive. I&#x27;m not a cheapskate, but I don&#x27;t usually purchase the first product a vendor puts up because there&#x27;s usually something better and slightly less expensive if you put in an additional minute or two into the search. But with the demos given (ordering flowers, booking hotels, etc.) it&#x27;s usually a sample of 2-3 different options, and from what I&#x27;ve seen, usually higher end items.<p>Will such assistants still maintain their usefulness when I don&#x27;t have the monetary pleasure and freedom of just saying &quot;Yes&quot; to the first option offered?
评论 #11662206 未加载
评论 #11663048 未加载
评论 #11662173 未加载
评论 #11662032 未加载
palakchokshiabout 9 years ago
EDIT: added intent in the conversation<p>I think one thing missing from these interactions is the idea of conversation. Let me elaborate. Distilling complex tasks into a single query yields complex queries. Drill down with simple initial query requires manual input from provided results. If there was some short term memory as part of the conversation we could potentially arrive at a much clearer intent.<p>e.g. &quot;Viv I want to go to San Diego this weekend. Find me some cheap flights.&quot; (Travel, San Diego, Saturday, air fare)<p>&quot;No problem. How long are you planning to stay?&quot; (return trip)<p>&quot;I want to come back on Sunday night&quot; (return trip Sunday night)<p>&quot;Will you be requiring accommodations?&quot; (air fare, hotel combo lookup)<p>&quot;Yes I want to stay near the waterfront&quot; (area for hotel, duration 1 night based on previous reply)<p>&quot;Sure no problem. Will you require a car rental?&quot; (air fare, hotel, car combo lookup)<p>&quot;No that&#x27;s fine&quot; (air fare, hotel lookup only)<p>&quot;Ok here are some good deals for flights and hotels on the waterfront for Saturday&quot;<p>&quot;What are my options for a morning flight?&quot; (Morning flight on Saturday)<p>&quot;These flights are in the morning&quot; (Subset of the flights found earlier)<p>&quot;How about late Friday flights?&quot; (Friday evening flights to San Diego)<p>&quot;Here are your options for a late Friday departure with the same hotel options&quot; (new departure schedule but same hotel options, new duration of 2 nights of hotel stay)<p>&quot;Show me just the flights for Friday&quot;<p>&quot;Here are just the flight options for Friday&quot;<p>&quot;Nah let&#x27;s see the Saturday options again&quot; (go back to previous criteria)<p>&quot;Here are the Saturday flight and hotel options&quot;
评论 #11665608 未加载
评论 #11665040 未加载
评论 #11665801 未加载
sklivvz1971about 9 years ago
&quot;This is software writing itself&quot;. Welcome to the 70&#x27;s.<p>The speaker comes across as poor in my opinion.<p>He&#x27;s doing marketing, not informing the audience, and he&#x27;s not even that interesting.<p>He&#x27;s saying a ton of stuff which is completely unsupported by reality.<p>Assistants are the next paradigm shift? Have you tried to use Siri in a car? Or at all? It&#x27;s a joke.<p>Weather examples? Show me something I use more than once a week.<p>Show me something actually <i>hard</i> that requires intelligence. Show me a recommendation service which does not suck. Show me an assistant that figures out something nice to do this weekend, based on location inferred by my plane tickets, weather, my taste, and what&#x27;s available.<p>Show me an assistant that <i>shops for the cheapest flowers</i>, not force me to use the service of the app maker&#x27;s choice.<p>Please, let&#x27;s stop the &quot;wow I can speak to my telephone, and it tells me the weather&quot; bore. Please.
musesumabout 9 years ago
So, is this an island parser on top of a Bayes net?<p>Haven&#x27;t read the papers for a few decades, but I wonder how the auto generated code differs from General Problem Solver: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;General_Problem_Solver" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;General_Problem_Solver</a><p>Hope the above doesn&#x27;t sound dismissive. Was working at company incubated at SRI. We were hand coding similar workflows. My guess is that VIV is a 10x improvement in workflow.<p>Be that as it may, it still seems to be constrained by Humans creating ontologies. So, may suffer scaling issues. Somewhat akin to Yahoo&#x27;s Human curated websites vs Google&#x27;s auto generated backrub.
评论 #11664263 未加载
awalGargabout 9 years ago
Sort of concerned about this:<p>Who decides which service to use for what thing? What if I don&#x27;t want to use uber for my cab but a new uber like service that my friend just started? If my friend adds an experience to viv for his service, how does viv decide which one to use? what about neutrality etc.?
评论 #11662601 未加载
评论 #11662531 未加载
评论 #11665074 未加载
评论 #11662568 未加载
stephengillieabout 9 years ago
You can make your own &quot;PowerShiri&quot; in Powershell, by tapping into the .NET Speech Recognition library [0].<p>Having built out a novel bot this weekend [1], this became an obvious next step. The PowerShiri demo linked lets you set up a key&#x2F;value table, and when Speech Recognition matches a key, it executes the Powershell command held in the value. In this way, you can issue the same command, e.g. &quot;What is the time?&quot; - and receive a dynamic response. The voice output is generated by the ubiquitous Out-Speech function.<p>Dynamic program generation, more like dynamic query generation, is the obvious next step. Instead of generating a massive key&#x2F;value table for every possible combination of commands, you treat each word as a key, then have each value generate the next set of keys.<p>It&#x27;s both annoying and gratifying to learn a weekend project is facing the same technical challenges as a well-funded corporation.<p>[0] <a href="http:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;9361594&#x2F;powershell-can-speak-but-can-it-write-if-i-speak" rel="nofollow">http:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;9361594&#x2F;powershell-can-sp...</a><p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;Gilgamech&#x2F;PowerShiri&#x2F;" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;Gilgamech&#x2F;PowerShiri&#x2F;</a>
cthulhujrabout 9 years ago
&gt;The live demo went off without any major glitches<p>At 3:54 - &quot;The other thing I want to note is that Siri... err ummm Viv&#x27;s audible voice is something we&#x27;re still working on.&quot;
评论 #11662428 未加载
评论 #11664926 未加载
yoavmabout 9 years ago
Who decides what &quot;app&quot; answers when I want to get flowers? I bet there are tens of apps doing it currently, and users have their own preferences. Same for anything weather-related. What would be my chances as a developer to compete with the more mainstream providers? While it isn&#x27;t huge now, it seems impossible with this future, where the user practically doesn&#x27;t even chooses the tools she wants to use.<p>(sorry if the question was answered in the video and I missed it)
评论 #11662852 未加载
frikabout 9 years ago
Sounds similar to IBM Watson that generates Prolog code on-the-fly. I wonder if Viv is using Prolog behind the scenes too (behind the visual AST representation).
评论 #11663694 未加载
评论 #11664029 未加载
piyush_soniabout 9 years ago
Well, booking hotels <i>their</i> way has always been easy. Just open the website, see the first three search results and book one which &#x27;looks&#x27; the best in photo (assuming you have all the money of the world). In reality, it&#x27;s not that easy, there are so many parameters to consider which they haven&#x27;t. Price, smoking preference, number and size of beds, cleanliness, wi-fi and its availability and speed, other amenities, parking and so much more you have to consider before deciding to get into a hotel. This personal assistant doesn&#x27;t seem to do any of that. The problem is, I don&#x27;t even know how can we automate or predict that because these are very personal preferences and change with each trip.
quocbleabout 9 years ago
Great to see they finally unveiled Viv. They&#x27;ve been working in stealth for a few years. Six months ago, I met the team for interview &amp; offered a position as Mobile Architect. I declined the job for reasons unrelated to Viv. I met the technical cofounders, Adam Cheyer &amp; Chris Brigham. It was just awesome to talking bunch of guys who made original Siri, and personally worked with Steve Jobs. You can read up on Adam, not only he&#x27;s an AI God, but he&#x27;s a super nice guy. If I were to bet on one AI company, they would be it. Like Kittaus eluded to, it&#x27;s going to take some time to see what use cases ordinary people use for but it will be exciting. Best of luck to them.
matt_wulfeckabout 9 years ago
I can&#x27;t help but feel bearish on any company like this that tries to exist outside of either the android or iOS integration. It&#x27;s a desert without OK Google&#x2F;Siri
评论 #11663262 未加载
评论 #11662386 未加载
edwhitesellabout 9 years ago
This is interesting to me because it really pulls together a number of disparate technologies to get towards the goal of being &quot;the AI&quot;. There&#x27;s nothing he talks about that&#x27;s overly complicated on it&#x27;s own, but integrating them all together is the key.<p>Once you have the voice to text (ASR) from Nuance (or any of the other companies in this space), then it&#x27;s a question of properly recognizing the contextual intent. Not a trivial task, but certainly getting easier with technology available. I think the visual models displayed in the demo are fairly telling as to how they handle this. Keywords for a domain (e.g. weather) become useful to determining the intent of the speech, as well as variables used in queries, or the data returned for display to the user.<p>For example, if I ask a question that mentions temperature, and a time, but not a length of time, it&#x27;s fairly obvious I&#x27;m asking for something in a weather domain rather than a recipe domain (and vice versa).<p>I&#x27;d guess the developer integrations he mentions then become a matter of defining those data points&#x2F;variables for the different data source so the &quot;AI&quot; can build the application to execute.<p>It&#x27;s like an enhanced API integration model. You need to know all of the input&#x2F;output parameters to integrate with an API. The intent&#x2F;contextual piece also uses the individual data points for the contextual intent recognition in the voice-to-text area.<p>The other interesting aspect is actually storing the preferences for the commerce side of things. Airlines do this already for saving preference of aisle vs window seats. They&#x27;re taking things a step further to remember those types of &quot;qualifying data&quot; for interactions you have so they can be saved across areas (read: API calls).<p>I suspect there will be a ton of work on behalf of the APIs too handle data in this way too, that&#x27;s what he says when it will take some time to see the direction things go in. If I opened up my existing airline travel APIs to this today, it&#x27;s unlikely anyone could correctly interact in a way that would provide all of the information needed to actually book a ticket. So, there will need to be some back-and-forth communication of those missing items. If someone finds a ticket they want to book and says &quot;order it&quot;, then viv will need a way for the API to communicate &quot;That&#x27;s great, but you also need to provide your TSA known traveler ID number&quot;. Then, because it&#x27;s something an API has asked for, Viv will know it&#x27;s a data point it should save for later.<p>Let&#x27;s hope Viv has un-breakable encryption and security with all of those &quot;personal preferences&quot;...
lowglowabout 9 years ago
I pitched a very early similar idea of Playa (<a href="http:&#x2F;&#x2F;getplaya.com&#x2F;" rel="nofollow">http:&#x2F;&#x2F;getplaya.com&#x2F;</a>) at Launch IoT Conference in front of the audience and was literally met with Jason and the panel of judges who had no idea what I was talking about.<p>After the pitch, I had representatives from NTT Docomo, Cisco, GE, and a few others come up to me and were interested to chat about ambient intelligence and stay in touch.<p>This is telling of how things are now changing and moving towards the Ambient Intelligent future.<p>We&#x27;re still building towards this future, and would like to meet up with anyone in SF that wants to help out.<p>We want to put souls inside your devices. :)<p>[EDIT] Luckily, Robert Scoble was kind enough to give us some outstanding attention on his feed, and also hook us up with RackSpace&#x27;s startup partnership program. Huge thank you to him for that.<p>[EDIT] We just set-up a baqqer account to be as transparent as possible as we build out the product. <a href="https:&#x2F;&#x2F;baqqer.com&#x2F;projects&#x2F;playa" rel="nofollow">https:&#x2F;&#x2F;baqqer.com&#x2F;projects&#x2F;playa</a>
aromanabout 9 years ago
Am I missing something, or was the demo on the Mac just a pre-recorded QuickTime video? The menu title bar said &quot;QuickTime Player&quot; and the title of the window was &quot;Movie Recording&quot;.<p>I don&#x27;t doubt it works for real, but isn&#x27;t it a bit weird he didn&#x27;t acknowledge the &quot;live demo&quot; was actually pre-recorded?
评论 #11663096 未加载
评论 #11662740 未加载
评论 #11662764 未加载
评论 #11663113 未加载
dharma1about 9 years ago
The developer experience looks interesting, looks like some kind of flow based, quite restricted coding environment? Getting that ecosystem and incentives for devs right will be the key to their success.<p>I&#x27;ve been using Amazon Alexa for a month and really struggling to find 3rd party apps or &quot;skills&quot; for it that I would want to use. Also, namespace pollution quickly becomes an issue when you start having thousands of apps that you didn&#x27;t install but are there to use.<p>I think Google will dominate the voice assistant space soon as they have so many crucial services already there.<p>Good to have competition though!
skykoolerabout 9 years ago
This sounds a lot like SoundHound.
评论 #11662414 未加载
kcparasharabout 9 years ago
Any ideas on what is used to power the graph demo used about 9 minutes in (<a href="https:&#x2F;&#x2F;youtu.be&#x2F;MI07aeZqeco?t=539" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;MI07aeZqeco?t=539</a>)? I haven&#x27;t seen something so fast and seamless in a browser.
评论 #11666552 未加载
forrestthewoodsabout 9 years ago
Live demo starts at 9:00. I am very very impressed.<p>One of my top complaints with Siri is inability to chain commands. If you want to modify a command you have to start over from scratch. Which often results in it misunderstanding another word. It&#x27;s very, very frustrating.
warrenmillerabout 9 years ago
I always cringe when someone says &#x27;software writing itself&#x27;. Looks to me more like software constructing a query, much like an SQL query. Not sure that is a &#x27;computer science breakthrough&#x27; Still, the whole thing is pretty neat.
kylehotchkissabout 9 years ago
I live by myself so the idea of talking when another human is not around or hearing my voice is sort of offputting. So cool to see this technology evolve, but I hardly can put Siri (an amazing technology) to good use!
评论 #11665343 未加载
askitabout 9 years ago
What is their business model ? What about privacy for such an app that has to know everything in order to provide a smooth experience ?
jadboxabout 9 years ago
But will it be open source?
评论 #11665042 未加载