Andrej Karpathy talks about how Tesla's NNs are structured and trained [video]

415 pointsby ojnover 5 years ago

20 comments

Fun fact for all of you:Some time ago (around ~10 years) this guy (the presenter) was internet famous for being a Rubik cube speed solver and making tutorials and videos about that: <a href="https://www.youtube.com/watch?v=609nhVzg-5Q" rel="nofollow">https://www.youtube.com/watch?v=609nhVzg-5Q</a>

评论 #21498129 未加载

评论 #21499423 未加载

评论 #21499065 未加载

jacquesmover 5 years ago

The competition in this space is great but I can't help but wonder what would happen if instead all these companies pooled their resources and went after the goal collectively. There is so much duplication going on and the paths do not seem to me - as an outsider - to be all that divergent, which is usually a pre-condition for having a lot of independent efforts one of which will succeed.It's as if everybody wants to be the one to exclusively own the tech. Imagine every car manufacturer having a completely different take on what a car should be like from a safety perspective. We have standards bodies for a reason and given the fact that there are plenty of lives at stake here maybe for once the monetary angle should get a back-seat (pun intended) to safety and a joint effort is called for. That would also stop people dying because operators of unsafe software are trying to make up for their late entry by 'moving fast and breaking things' where in this case the things are pedestrians, cyclists and other traffic participants who have no share in the monetary gain.

评论 #21498303 未加载

评论 #21497679 未加载

评论 #21497297 未加载

评论 #21497321 未加载

评论 #21498324 未加载

评论 #21497810 未加载

评论 #21498045 未加载

评论 #21499640 未加载

评论 #21497627 未加载

评论 #21497665 未加载

评论 #21497406 未加载

评论 #21497618 未加载

评论 #21497332 未加载

评论 #21499850 未加载

评论 #21498030 未加载

评论 #21497654 未加载

timzamanover 5 years ago

His team is hiring;<a href="https://www.tesla.com/careers/job/software-engineerdeeplearning-49779" rel="nofollow">https://www.tesla.com/careers/job/software-engineerdeeplearn...</a><a href="https://www.tesla.com/careers/job/machine-learninginfrastructureengineerautopilot-48125" rel="nofollow">https://www.tesla.com/careers/job/machine-learninginfrastruc...</a><a href="https://www.tesla.com/careers/job/machine-learningscientistautopilot-48414" rel="nofollow">https://www.tesla.com/careers/job/machine-learningscientista...</a>

评论 #21496466 未加载

评论 #21496505 未加载

评论 #21496439 未加载

modelessover 5 years ago

Awesome presentation. Crazy that they're developing their own training hardware too. It's going to be a very crowded space very soon. Can they really stay ahead of everyone else in the industry? Can it really be cheaper to staff up whole teams to design chips for cutting edge nodes, fabricate them, build supporting hardware and datacenters and compilers, than to just rent some TPUs on Google Cloud?I can see the case for doing their own edge hardware for the cars (barely), but I really don't think doing training hardware will pay off for them. If they're serious about it, they should spin it out as a separate business to spread the development cost over a larger customer base.Also, I'm really curious whether the custom hardware in the cars is benefiting them at all yet. Every feature they've released so far works fine on the previous generation hardware with 1/10 the compute power. At some point won't they need to start training radically larger networks to take advantage of all that untapped compute power?

评论 #21497376 未加载

评论 #21499760 未加载

评论 #21497049 未加载

评论 #21496986 未加载

评论 #21497178 未加载

评论 #21497729 未加载

评论 #21496939 未加载

评论 #21497139 未加载

评论 #21497044 未加载

sdanover 5 years ago

Really liked this talk.Looks like they are really nicely orchestrating workloads and training on numerous nets asynchronously.As a person in the AV industry I think Tesla's ability to control the entire stack is great for Tesla... maybe not for everyone who can't afford/doesn't have a Tesla.

评论 #21496807 未加载

londons_exploreover 5 years ago

I'm still amazed that Teslas team isn't using a map... I know maps get outdated and are sometimes wrong, but having inaccurate knowledge of what's around the corner is far far more helpful than not having any clue whats around the corner.The smart solution would be to consider a map a probabilistic thing, which neural networks are really good at handling.

评论 #21498415 未加载

评论 #21499534 未加载

Gravitylossover 5 years ago

Interesting that they don't have a full 3D world model. I'm certainly not a machine learning expert. I'm still amazed the route from image recognition to a 2D map of "what's drivable" to autonomous driving is so direct. One would expect to hit a ceiling really soon with that approach.To me it seems we're still in really early days.

评论 #21502099 未加载

eanzenbergover 5 years ago

One thing I didn't quite understand is how training sub-graphs in parallel works. If you are editing a sub-graph of a monolith type model, aren't you affecting other graphs that have dependencies on the one you're editing? If these are independent graphs, then what's a "sub-graph" even mean?

评论 #21496887 未加载

评论 #21497405 未加载

评论 #21497302 未加载

fypover 5 years ago

For those who want to learn more, I would start with Mask-RCNN where you have a very similar architecture: one shared backbone with multiple heads that can be retrained for various tasks (bounding boxes, masks, keypoints, etc): <a href="https://youtu.be/g7z4mkfRjI4?t=628" rel="nofollow">https://youtu.be/g7z4mkfRjI4?t=628</a>

keganover 5 years ago

Anyone knows why Andrej's team chooses PyTorch (as oppose to say TensorFlow?)

评论 #21496975 未加载

评论 #21496570 未加载

评论 #21496746 未加载

评论 #21497002 未加载

laichzeit0over 5 years ago

The good news for me is that the upper bound for fully autonomous self-driving cars is no more than 50 years away. What a time to be alive. If it happens before then, that will be an absolute bonus.

diveanonover 5 years ago

Andrej Karpathy is such a treasure.He is an excellent presenter who really has a passion for teaching.Im not really involved with the industry, so I cant really speak to how he holds up to other experts. However he is by far the most digestable resource I have found for learning about NN and science behind them.If you are just discovering him now, google his name and just start reading. His work is truly binge worthy in the most meaningful way.

SloopJonover 5 years ago

The description of SmartSummon about halfway through the talk is interesting. One of the views looks like SLAM using a particle filter, but Andrej seems to say that it's done entirely within a neural net.

alexnewmanover 5 years ago

Jeeze and I can't get my pytorch to stop leaking memory. I couldn't imagine trying to drive a car with it

评论 #21496690 未加载

jfosterover 5 years ago

I wonder if the environment the car discovers includes elevation. Would be necessary for handling many carparks.

ngcc_hkover 5 years ago

Wow

adamnemecekover 5 years ago

The trick for level 5 is learning the mapping between the lidar point cloud and the video stream. It’s the best of both worlds.

评论 #21497575 未加载

评论 #21496325 未加载

评论 #21496933 未加载

评论 #21498125 未加载

评论 #21497956 未加载

评论 #21496482 未加载

mkageniusover 5 years ago

Oh he's no longer with OpenAI? Sam Altman must be worried about this..

评论 #21496196 未加载

评论 #21496207 未加载

评论 #21496531 未加载

评论 #21496195 未加载

new_realistover 5 years ago

Meanwhile Waymo is way ahead.

评论 #21496651 未加载

评论 #21497515 未加载

评论 #21500241 未加载

评论 #21497010 未加载

mindfulplayover 5 years ago

Just listening to this talk scares me. The amount of errors - even in a seemingly normal, sunny day - is mind boggling to think people trust this crap.How can we rely on the output of eight cameras? This is not a kid's science project.It's all fancy neural networks until someone dies. Pretty callous and Silicon valley-mindset for such an important and critical function of the car.Will never buy a Tesla after having seen this.

评论 #21496631 未加载

评论 #21496592 未加载

评论 #21496567 未加载

评论 #21496775 未加载

评论 #21496665 未加载

评论 #21497428 未加载

评论 #21496825 未加载

评论 #21496636 未加载

评论 #21496572 未加载