Solving the Rubik’s cube with a robot hand

171 pointsby gdbover 5 years ago

17 comments

daenzover 5 years ago

>We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulationSimulated training is so cool. Related, is anyone interested in a plugin for Blender that allows you to easily build physically-accurate simulation environments for robots and then apply reinforcement learning to the virtual robots? I have a hodge-podge amount of code for doing exactly this, and I'm curious if anyone else would be interested in it?

评论 #21262815 未加载

评论 #21262689 未加载

评论 #21264801 未加载

评论 #21261955 未加载

评论 #21264477 未加载

评论 #21264553 未加载

评论 #21264402 未加载

评论 #21261725 未加载

ZhuanXiaover 5 years ago

Gwern on the luck of the last mover:"Launching too early means failure, but being conservative & launching later is just as bad because regardless of forecasting, a good idea will draw overly-optimistic researchers or entrepreneurs to it like moths to a flame: all get immolated but the one with the dumb luck to kiss the flame at the perfect instant, who then wins everything, at which point everyone can see that the optimal time is past."Robotics has been a money pit for startups and corporations for a long time. Think of the billions Toytoa has spent on home robotics research, to little avail.But at some point it won't be. Some entity will "kiss the flame" at the right movement. The wealth they create will be beyond any company ever, by an almost incomparable margin.

评论 #21261222 未加载

评论 #21262838 未加载

评论 #21264511 未加载

评论 #21266628 未加载

sytelusover 5 years ago

Many caviates but impressive progress in manipulation, especially sim2real:- Only 20% attempts successful on hardest configs with 26+ moves- Solving steps are not generated by RL (but could be[1])- Cube is modified internally to transmit additional state via bluetooth- Highly calibrated and fine tuned environment+MuJoCo based sim to match simulation to reality as much as possible- Open AI Five algorithm is pretty much reused as-is- Cumulative training time = 13 thousand years, same order of magnitude as the 40 thousand years- 32+64 V100 GPUs per training cycle[1] <a href="https://arxiv.org/abs/1805.07470" rel="nofollow">https://arxiv.org/abs/1805.07470</a>

hinkleyover 5 years ago

Some of Vernor Vinge's books deal with the 'alien' in alien intelligence in ways that were quite illuminating/shocking for me at the time. They weren't just humanoids with animal instincts. He created intelligent spiders that were believable. And the only sympathetic treatment of a hive mind I've yet encountered (Card's are pale in comparison)But one of my favorite inventions of his was a creature that had somehow evolved wheels. With veins and nerves and such there is hardly a creature on earth that can rotate a limb farther much farther than 200°, and the ones that can, like owls, we treat with a certain reverence.Developing an artificial wrist that can spin arbitrarily would be, I'd think, a quite compelling compensation for someone having to use a prosthetic arm. It would also make for some wicked Rubix solving skills. I wonder how proprioception would deal with that though...

评论 #21267145 未加载

minimaxirover 5 years ago

I appreciate the plushed giraffe perturbation. Reinforcement learning needs to account for all eventualities, including giraffes.

perl4everover 5 years ago

"What people don't appreciate, when they picture Terminator-style automatons striding triumphantly across a mountain of human skulls, is how hard it is to keep your footing on something as unstable as a mountain of human skulls."...I'm not feeling so confident now.

gapoover 5 years ago

It's great that OpenAI has continued to exist as an technological organization with no clear revenue expectations. At the same time I am not sure how long they can sustain doing what they are doing OR whether there is this new found feasibility for private research organizations to exist in this space provided they produce clear high-quality output like OpenAI is doing.

评论 #21262571 未加载

评论 #21266620 未加载

jefft255over 5 years ago

As a roboticist, it's really clear to me that this sort of transfer in controlled environment is hard but doable. I think it's already been demonstrated many times and I'm not that convinced that there is anything new in there except more GPU + fancier robot.I'll be impressed by RL is a) they manage to do sim2real in open environments, think Doom -> office building or b) they manage to get data efficient enough that sim2real is still necessary but you don't have to do real data collection with 10 parallel robots for days on end.As someone in mobile robotics as opposed to pure manipulation, I read these papers and I'm like: "How the hell am I supposed to get this to work on a robot moving in the real world???". I don't see anyone being close to this right now.

评论 #21266000 未加载

评论 #21265162 未加载

est31over 5 years ago

This is cool. I wonder about the hardware. Why does the mount for the hand have a fan? Does it contain the inference computer? Power transformers?

askytbover 5 years ago

Does anyone have any experience with soft robotics? For example these guys: <a href="https://www.youtube.com/watch?v=X6CRe2ieuYE" rel="nofollow">https://www.youtube.com/watch?v=X6CRe2ieuYE</a> advertise their gripper as supposedly being able to handle weight/size variety with no training at all, just with the use of different materials in the gripper

breckover 5 years ago

A few weeks back I was at a program synthesis conference and gave a short lightning talk where I said deep learning so far has been used to solve the easy computer chess, and the easy computer go, etc...not to take away from those accomplishments at all, I was just saying that having a robot beat grandmasters at real world physics chess where you have to move the pieces with many degrees of freedom is a harder problem, but trivial for a 7 year old.I thought we were still a decade away from having machines beat humans at real chess and real go, but this makes me think maybes it’s just 5 years out. Very impressive.

评论 #21263868 未加载

imtringuedover 5 years ago

This is a pretty pathetic result and is damning for the progress of AI. Instead of focusing on efficiency, AI researchers simply throw more resources at the problem with the hope that it's enough. The end result after 13000 years of training is a robot hand that can do nothing but solve rubik's cubes and fails 40% of the time.

yCloserover 5 years ago

world record one handed is 6":88 average of 5 is 9":48 <a href="https://www.worldcubeassociation.org/results/rankings/333oh/average" rel="nofollow">https://www.worldcubeassociation.org/results/rankings/333oh/...</a>non-world-class: doing <30" one handed is very doable, anyone can do less than 1 min (yes, if you know how to solve and you trained one handed. ofc not if you never solved a cube in your life)that said... I really don't understand how the hand keeps the cube "floating" around. In one handed the technique is pretty much to keep the cube fixed holding front/back centers with thumb and index. Something like <a href="https://www.youtube.com/watch?v=mUF3aPDTO-4" rel="nofollow">https://www.youtube.com/watch?v=mUF3aPDTO-4</a>I understand the achievement, but wow, this solve is HORRIBLE. What did they train the network with to get this?!

throwaway07Ju19over 5 years ago

Around 1988, I read a book that claimed the ideal robot hand would have fingers that repeatedly bifurcate until it has a digit so small it can manipulate matter at the atomic level. Implausible but fun to think about.

ilakshover 5 years ago

I think artificial muscles that are more biomimetic with better power-to-weight ratios are going to make a huge improvement in robot capabilities at some point. Especially for humanoids.

____Sash---701_over 5 years ago

Any YC companies going after the robotics industry?

a13nover 5 years ago

Would be cool to see it done with two hands (or one), solved faster than the human world record. It's still pretty clumsy looking.