TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: OK-Robot: open, modular home robot framework for pick-and-drop anywhere

542 pointsby MahiShafiullahabout 1 year ago
Hi all, excited to share our latest work, OK-Robot, which is an open and modular framework to perform navigation and manipulation with a robot assistant in practically any homes without having to teach the robot anything new! You can simply unbox the target robot, install OK-Robot, give it a &quot;scan&quot; (think a 60 second iPhone video), and start asking the robot to move arbitrary things from A to B. We already tested it out in 10 home environments in New York city, and one environment each in Pittsburgh and Fremont.<p>We based everything off of the current best machine learning models, and so things don&#x27;t quite work perfectly all the time, so we are hoping to build it together with the community! Our code is open: <a href="https:&#x2F;&#x2F;github.com&#x2F;ok-robot&#x2F;ok-robot">https:&#x2F;&#x2F;github.com&#x2F;ok-robot&#x2F;ok-robot</a> and we have a Discord server for discussion and support: <a href="https:&#x2F;&#x2F;discord.gg&#x2F;wzzZJxqKYC" rel="nofollow">https:&#x2F;&#x2F;discord.gg&#x2F;wzzZJxqKYC</a> If you are curious what works and what doesn&#x27;t work, take a quick look at <a href="https:&#x2F;&#x2F;ok-robot.github.io&#x2F;#analysis" rel="nofollow">https:&#x2F;&#x2F;ok-robot.github.io&#x2F;#analysis</a> or read our paper for a detailed analysis: <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2401.12202" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2401.12202</a><p>P.S.: while the code is open the project unfortunately isn&#x27;t fully open source since one of our dependencies, AnyGrasp, has a closed-source, educational license. Apologize in advance, but we used it since that was the best grasping model we could have access to!<p>Would love to hear more thoughts and feedback on this project!

23 comments

rapjr9about 1 year ago
Robots like this will have a small market until they can handle obstacles. The cat toy that the cat left in the middle of the floor, the papers that an open window blew off the table, the toys the kids left scattered about, the pencil that rolled off the desk while you were away, the dirty laundry you left laying on the floor, the ridge between carpet and hardwood floors, doors left open or closed, and more. That means there may be several tasks that intervene before a primary task can accomplished (move the toys, pick up the papers, pick up the laundry, open the door). Some obstacles will semi-permanently block a wheeled robot, such as cables, things stacked that you don&#x27;t want moved, furniture, a sleeping pet, stacked unopened packages from the mail, etc. I believe this means general purpose home robots can not have wheels, they must have legs, perhaps more than two legs for stability. It may sound weird but I think the ideal design might be somewhere between a large friendly spider and a dog. It&#x27;s odd how robotics has mostly fallen into this idea that the world is two dimensional and flat. They&#x27;ve idealized away the really difficult problems of dealing with mobility in a 3D world. Note that everything this robot does involves only planar horizontal surfaces. Basically it looks like a person had to go through the rooms and clean them up for the robot to function. Roomba&#x27;s have the same problem.
评论 #39496629 未加载
评论 #39489571 未加载
评论 #39489289 未加载
dlivingstonabout 1 year ago
That&#x27;s very cool. I have almost no experience with robotics, so excuse the silly questions:<p>- How does it know what objects are? Does it use some sort of realtime object classifier neural net? What limitations are there here?<p>- Does the robot know when it can&#x27;t perform a request? I.e. if you ask it to move a large box or very heavy kettlebell?<p>- How well does it do if the object is hidden or obscured? Does it go looking for it? What if it must move another object to get access to the requested one?
评论 #39484729 未加载
评论 #39487017 未加载
fnordpigletabout 1 year ago
This is remarkable and could be life changing for the disabled, elderly, gamers, or profoundly lazy and their caretakers.
评论 #39485621 未加载
评论 #39486904 未加载
alx__about 1 year ago
This is rad. I would totally buy a 25k robot if I could train it to fold and put away my laundry (serious)
评论 #39487139 未加载
评论 #39485978 未加载
评论 #39485145 未加载
ctothabout 1 year ago
For solving long term tasks like finding things that aren&#x27;t there, you can turn the annotated scene into a templated description and feed it to a large-enough model trained on interactive fiction.<p>You are standing in a kitchen. Ahead of you to your right there is a large refrigerator with the handle on the right side. There is a set of cabinets to your left with a plate sitting on the counter above them.<p>&gt; get beer<p>You don&#x27;t see any beer here.<p>&lt;&lt; COT: I know that beer is often found in the fridge. I should try opening the refrigerator<p>&gt; open fridge<p>Opening the refrigerator reveals 4 cans of beer.<p>&gt; get beer<p>taken<p>Obviously we&#x27;re still several years from this working, but it&#x27;s very exciting to consider. Interactive Fiction narrative fed by real sensors plus chain-of-thought blocks as internal monologue.
评论 #39487318 未加载
评论 #39488433 未加载
bshah_about 1 year ago
The failures analysis is super well done, nice work! Curious what qualifies as hardware failure, e.g. there&#x27;s 5 trials where the &quot;Realsense gave bad depth&quot;, and how that&#x27;s determined.
评论 #39484660 未加载
khnovabout 1 year ago
It is opensource but still costs nearly 25k dollar. why is it that expensive ?
评论 #39484591 未加载
评论 #39484807 未加载
评论 #39484561 未加载
评论 #39484110 未加载
评论 #39485204 未加载
评论 #39484075 未加载
owenpalmerabout 1 year ago
Isn&#x27;t this the same as dobb-e?<p><a href="https:&#x2F;&#x2F;dobb-e.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dobb-e.com&#x2F;</a>
评论 #39486732 未加载
评论 #39486762 未加载
btbuildemabout 1 year ago
I&#x27;ve been watching this project for a while now, great progress!<p>I envision an integration with a mobility aid (eg, a wheelchair) for someone with limited control over their limbs. Imagine a &quot;smart&quot; exoskeleton that can help with otherwise impossible tasks -- it could be a game-changer for so many people.
rsyncabout 1 year ago
I very much want a stabilized platform vehicle that I can send point-to-point with a payload on it.<p>So, a gyro-stabilized platform like a segway that I can send back and forth from point A to point B on a not-terrible-but-rough (walking path) route.<p>I have tried to stay abreast of the options in the past and have never seen anything that matches this ... does anyone know if there is anything new that matches this use-case ?<p>(the use-case is a tray of drinks and hors d&#x27;oeuvres that needs to go from one part of a property to another without spilling ... needs to be minimally all-terrain)
评论 #39486431 未加载
评论 #39490554 未加载
评论 #39498358 未加载
chubsabout 1 year ago
A friend is working on a slightly related project, I’m curious how they map out the room in voxels, anyone care to suggest how this is done?
评论 #39486313 未加载
taco_emojiabout 1 year ago
I know nothing about robotics, but can someone ELI5 why the robot makes so many extraneous movements? E.g. the video that shows it moving Takis from the desk to the nightstand, it approaches the desk, and then the arm mechanism moves all the way down (an unnecessary maneuver), then rises again before reaching the level needed to pick up the Takis.
评论 #39485753 未加载
crawsomeabout 1 year ago
This looks really cool, but I immediately think of the possibility of it starting a fire, and thinking everything&#x27;s fine
Geisterdeabout 1 year ago
I like the presentation of this, heres 10 different environments and multiple videos of each.
ingend88about 1 year ago
For a long time, I wanted to use a robot with a gripper to make tea. Is there any 6DOF robot available within a reasonable price of &lt;$1000 to do so ?
CodeWriter23about 1 year ago
Back in the day, my friend would lament not having “closetgrep” to find the needed thingy stored in an overfilled closet.
评论 #39487137 未加载
jamesdwilsonabout 1 year ago
I&#x27;d love to see this be usable as potentially a mower and or vacuum&#x2F;mop with different swappable components.
评论 #39484458 未加载
评论 #39484187 未加载
westmealabout 1 year ago
Is the title a reference to OK computer or is it just something you all came up with?
评论 #39486755 未加载
ghostly_sabout 1 year ago
why are these general purpose robots always so slow? Intuitively we expect machines to be able to do tasks faster than humans, but even the 5x speed video is much slower than a human could do the task.
mdfriefeldabout 1 year ago
congrats on the awesome work!
评论 #39487050 未加载
nxobjectabout 1 year ago
Take that, SHRDLU!
k4rliabout 1 year ago
It&#x27;s cool but what&#x27;s the point for a normal person? Useful for warehouses and manufacturing but I don&#x27;t see myself ever needing such things
评论 #39486938 未加载
评论 #39486363 未加载
jacobsenscottabout 1 year ago
It appears slow, but tests show it completes most tasks more quickly, accurately, and with less complaining, than most members of the gen z cohort.
评论 #39487928 未加载
评论 #39487971 未加载