TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: OK-Robot: open, modular home robot framework for pick-and-drop anywhere

542 点作者 MahiShafiullah超过 1 年前
Hi all, excited to share our latest work, OK-Robot, which is an open and modular framework to perform navigation and manipulation with a robot assistant in practically any homes without having to teach the robot anything new! You can simply unbox the target robot, install OK-Robot, give it a &quot;scan&quot; (think a 60 second iPhone video), and start asking the robot to move arbitrary things from A to B. We already tested it out in 10 home environments in New York city, and one environment each in Pittsburgh and Fremont.<p>We based everything off of the current best machine learning models, and so things don&#x27;t quite work perfectly all the time, so we are hoping to build it together with the community! Our code is open: <a href="https:&#x2F;&#x2F;github.com&#x2F;ok-robot&#x2F;ok-robot">https:&#x2F;&#x2F;github.com&#x2F;ok-robot&#x2F;ok-robot</a> and we have a Discord server for discussion and support: <a href="https:&#x2F;&#x2F;discord.gg&#x2F;wzzZJxqKYC" rel="nofollow">https:&#x2F;&#x2F;discord.gg&#x2F;wzzZJxqKYC</a> If you are curious what works and what doesn&#x27;t work, take a quick look at <a href="https:&#x2F;&#x2F;ok-robot.github.io&#x2F;#analysis" rel="nofollow">https:&#x2F;&#x2F;ok-robot.github.io&#x2F;#analysis</a> or read our paper for a detailed analysis: <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2401.12202" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2401.12202</a><p>P.S.: while the code is open the project unfortunately isn&#x27;t fully open source since one of our dependencies, AnyGrasp, has a closed-source, educational license. Apologize in advance, but we used it since that was the best grasping model we could have access to!<p>Would love to hear more thoughts and feedback on this project!

23 条评论

rapjr9超过 1 年前
Robots like this will have a small market until they can handle obstacles. The cat toy that the cat left in the middle of the floor, the papers that an open window blew off the table, the toys the kids left scattered about, the pencil that rolled off the desk while you were away, the dirty laundry you left laying on the floor, the ridge between carpet and hardwood floors, doors left open or closed, and more. That means there may be several tasks that intervene before a primary task can accomplished (move the toys, pick up the papers, pick up the laundry, open the door). Some obstacles will semi-permanently block a wheeled robot, such as cables, things stacked that you don&#x27;t want moved, furniture, a sleeping pet, stacked unopened packages from the mail, etc. I believe this means general purpose home robots can not have wheels, they must have legs, perhaps more than two legs for stability. It may sound weird but I think the ideal design might be somewhere between a large friendly spider and a dog. It&#x27;s odd how robotics has mostly fallen into this idea that the world is two dimensional and flat. They&#x27;ve idealized away the really difficult problems of dealing with mobility in a 3D world. Note that everything this robot does involves only planar horizontal surfaces. Basically it looks like a person had to go through the rooms and clean them up for the robot to function. Roomba&#x27;s have the same problem.
评论 #39496629 未加载
评论 #39489571 未加载
评论 #39489289 未加载
dlivingston超过 1 年前
That&#x27;s very cool. I have almost no experience with robotics, so excuse the silly questions:<p>- How does it know what objects are? Does it use some sort of realtime object classifier neural net? What limitations are there here?<p>- Does the robot know when it can&#x27;t perform a request? I.e. if you ask it to move a large box or very heavy kettlebell?<p>- How well does it do if the object is hidden or obscured? Does it go looking for it? What if it must move another object to get access to the requested one?
评论 #39484729 未加载
评论 #39487017 未加载
fnordpiglet超过 1 年前
This is remarkable and could be life changing for the disabled, elderly, gamers, or profoundly lazy and their caretakers.
评论 #39485621 未加载
评论 #39486904 未加载
alx__超过 1 年前
This is rad. I would totally buy a 25k robot if I could train it to fold and put away my laundry (serious)
评论 #39487139 未加载
评论 #39485978 未加载
评论 #39485145 未加载
ctoth超过 1 年前
For solving long term tasks like finding things that aren&#x27;t there, you can turn the annotated scene into a templated description and feed it to a large-enough model trained on interactive fiction.<p>You are standing in a kitchen. Ahead of you to your right there is a large refrigerator with the handle on the right side. There is a set of cabinets to your left with a plate sitting on the counter above them.<p>&gt; get beer<p>You don&#x27;t see any beer here.<p>&lt;&lt; COT: I know that beer is often found in the fridge. I should try opening the refrigerator<p>&gt; open fridge<p>Opening the refrigerator reveals 4 cans of beer.<p>&gt; get beer<p>taken<p>Obviously we&#x27;re still several years from this working, but it&#x27;s very exciting to consider. Interactive Fiction narrative fed by real sensors plus chain-of-thought blocks as internal monologue.
评论 #39487318 未加载
评论 #39488433 未加载
bshah_超过 1 年前
The failures analysis is super well done, nice work! Curious what qualifies as hardware failure, e.g. there&#x27;s 5 trials where the &quot;Realsense gave bad depth&quot;, and how that&#x27;s determined.
评论 #39484660 未加载
khnov超过 1 年前
It is opensource but still costs nearly 25k dollar. why is it that expensive ?
评论 #39484591 未加载
评论 #39484807 未加载
评论 #39484561 未加载
评论 #39484110 未加载
评论 #39485204 未加载
评论 #39484075 未加载
owenpalmer超过 1 年前
Isn&#x27;t this the same as dobb-e?<p><a href="https:&#x2F;&#x2F;dobb-e.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dobb-e.com&#x2F;</a>
评论 #39486732 未加载
评论 #39486762 未加载
btbuildem超过 1 年前
I&#x27;ve been watching this project for a while now, great progress!<p>I envision an integration with a mobility aid (eg, a wheelchair) for someone with limited control over their limbs. Imagine a &quot;smart&quot; exoskeleton that can help with otherwise impossible tasks -- it could be a game-changer for so many people.
rsync超过 1 年前
I very much want a stabilized platform vehicle that I can send point-to-point with a payload on it.<p>So, a gyro-stabilized platform like a segway that I can send back and forth from point A to point B on a not-terrible-but-rough (walking path) route.<p>I have tried to stay abreast of the options in the past and have never seen anything that matches this ... does anyone know if there is anything new that matches this use-case ?<p>(the use-case is a tray of drinks and hors d&#x27;oeuvres that needs to go from one part of a property to another without spilling ... needs to be minimally all-terrain)
评论 #39486431 未加载
评论 #39490554 未加载
评论 #39498358 未加载
chubs超过 1 年前
A friend is working on a slightly related project, I’m curious how they map out the room in voxels, anyone care to suggest how this is done?
评论 #39486313 未加载
taco_emoji超过 1 年前
I know nothing about robotics, but can someone ELI5 why the robot makes so many extraneous movements? E.g. the video that shows it moving Takis from the desk to the nightstand, it approaches the desk, and then the arm mechanism moves all the way down (an unnecessary maneuver), then rises again before reaching the level needed to pick up the Takis.
评论 #39485753 未加载
crawsome超过 1 年前
This looks really cool, but I immediately think of the possibility of it starting a fire, and thinking everything&#x27;s fine
Geisterde超过 1 年前
I like the presentation of this, heres 10 different environments and multiple videos of each.
ingend88超过 1 年前
For a long time, I wanted to use a robot with a gripper to make tea. Is there any 6DOF robot available within a reasonable price of &lt;$1000 to do so ?
CodeWriter23超过 1 年前
Back in the day, my friend would lament not having “closetgrep” to find the needed thingy stored in an overfilled closet.
评论 #39487137 未加载
jamesdwilson超过 1 年前
I&#x27;d love to see this be usable as potentially a mower and or vacuum&#x2F;mop with different swappable components.
评论 #39484458 未加载
评论 #39484187 未加载
westmeal超过 1 年前
Is the title a reference to OK computer or is it just something you all came up with?
评论 #39486755 未加载
ghostly_s超过 1 年前
why are these general purpose robots always so slow? Intuitively we expect machines to be able to do tasks faster than humans, but even the 5x speed video is much slower than a human could do the task.
mdfriefeld超过 1 年前
congrats on the awesome work!
评论 #39487050 未加载
nxobject超过 1 年前
Take that, SHRDLU!
k4rli超过 1 年前
It&#x27;s cool but what&#x27;s the point for a normal person? Useful for warehouses and manufacturing but I don&#x27;t see myself ever needing such things
评论 #39486938 未加载
评论 #39486363 未加载
jacobsenscott超过 1 年前
It appears slow, but tests show it completes most tasks more quickly, accurately, and with less complaining, than most members of the gen z cohort.
评论 #39487928 未加载
评论 #39487971 未加载