TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Tesla’s CV Approach to Autonomous Driving Built an Unassailable Lead in FSD

11 点作者 bald大约 4 年前

13 条评论

porphyra大约 4 年前
Tesla fanboys seem to frequently forget the following facts:<p>* People using lidar also use deep learning.<p>* Lidar and HD maps are totally orthogonal concepts with nothing to do with each other. Lidar helps you avoid running into trucks without HD maps. Camera-based methods can use HD maps too.<p>* Lidars are less affected by rain and snow than cameras are (thanks to larger optical aperture, multiple returns, and faster &quot;shutter speed&quot; due to nanosecond pulses rather than 1&#x2F;30 s exposures).<p>* Tesla&#x27;s &quot;big data&quot; is not more effective than a concerted active learning data collection campaign with a moderate-sized fleet.<p>* HD maps are very cheap to create.<p>* Lidars may in fact be cheaper than the GPU you&#x27;re going to need to run your monocular depth network, while being more accurate and more robust.
评论 #26633541 未加载
评论 #26628598 未加载
tmotwu大约 4 年前
I could not get through the entire article with straight face. It&#x27;s riddled with fallacies and misconceptions on the problems currently facing self driving. It was especially this horrible take that put the credibility of this article into question:<p>&gt; &quot;A simple analogy can illustrate the conceptual difference between Computer Vision and LiDAR. Imagine two students, where one is just cramming and memorizing the content (LiDAR), while the other one is trying to really understand the material and truly learn it (Tesla FSD). The student that learned the material (Tesla FSD) will be able to answer the exam questions correctly, even if the questions on the exam are swapped, the questions are rephrased, or new components are added to the questions, while the student that memorized the content (LiDAR) will likely fail the exam.&quot;<p>Tesla&#x27;s advantage isn&#x27;t even it&#x27;s computer vision systems - computer vision models don&#x27;t exactly scale like language transformers (where larger, sparser parameters make better models). There hasn&#x27;t been any significant advancements in vision models since Faster-RCNNs or YOLO, which is close to five years old now. Especially if you want to compare Tesla&#x27;s SotA against Waymo, which has an army of Captcha labelers and a large plethora of example images.<p>The goldmine is in identifying and navigating around rare edge cases - data that can only be obtained with hundreds of thousands of hours of real world driving. It has very little to do with the correct set of camera and calibration configurations. The novel research is deep into safety verification strategies. Like how do can we predict human or object behavior by using miliseconds of prior movement of an object? Or how to use human arm movements and eye pupils to determine if a pedestrian or other driver is distracted? Can we avoid an accident if we have a better behavioral understanding of the scene?
dr_faustus大约 4 年前
What lead? The main advantage of Tesla is that they have the guts to put beta quality software in cars and get away with it (from a regulatory perspective) up to now.<p>As can be witnessed in loads of videos on YouTube, the current FSD betas still pretty much require full attention (I would say even more attention than driving youself because you have to watch the environment AND the behaviour of Autopilot). And these demos are on American roads which are among the easiest to drive in the world (generally very wide, lots of traffic lights, large intersections, etc.).<p>Bad weather conditons (which are a thing outside of southern California) are even more problematic. So while the tech might or might not be more advanced than what other car companies have in their labs, reliably its still pretty much an improved cruise control and in that regard not that far ahead of the competition (all premium brands have that in one way or the other).
评论 #26628743 未加载
评论 #26628861 未加载
jsight大约 4 年前
Given the early stage, I find the idea of anyone having an &quot;unassailable lead&quot; to be laughable.<p>The idea that the one with such a lead is Tesla is even more laughable, and I say that as someone that actually likes some significant elements of Tesla&#x27;s approach. Although I pretty strongly dislike some aspects too.
评论 #26628558 未加载
评论 #26631610 未加载
NovemberWhiskey大约 4 年前
There seems to be gigantic non-sequitur here where good performance of convolutional neural networks for image recognition is seen as evidence that visual sensors suffice for full driving autonomy and that LIDAR-based approaches are heading in the wrong direction.<p>There doesn&#x27;t seem any evidence for the proposition that &quot;if only we can train the model with more data it&#x27;ll suddenly be good enough&quot;. There&#x27;s an extremely long tail of scenarios, operating in different weather conditions, in different environments, at different times of day, and the driving problem is adaptive - the network needs to predict how others will respond to its own behavior.<p>The authors dismiss the idea that sensor fusion may provide more additive capability than train-with-m0ar-data but don&#x27;t actually seem to provide any basis for that.
adflux大约 4 年前
Another dumb post by someone who believes Tesla operates in a vacuum... Tesla is a very small player in terms of cars produced. Volkswagen has started outselling Tesla in some European countries. And dont forget about GM and Toyota. No lead is unaissalable in a highly competitive market like car manufacturing. And its not like there&#x27;s no competition from the tech side either. Microsoft, Apple and Google are all working on making self driving cars. Any combination of said tech companies with a large car manufacturer would be a serious threat.
评论 #26628496 未加载
soVeryTired大约 4 年前
Has the author done any serious work in computer vision? Imagenet is one thing, but there are many many problems that need to be solved before a vision-based vehicle can run reliably. I worked at a self-driving company for a while, and they struggled with issues like multiple target tracking and reasoning about occluded vehicles.<p>And in a sense, vision is the easy part. There are whole areas like prediction and motion planning where the current state of the art isn&#x27;t really up to scratch.
verdverm大约 4 年前
hmm, I think the author missed that Waymo drove in snow 3 years ago [0], obviously outside of Pheonix. They also don&#x27;t seem to understand it&#x27;s CV + Lidar, not versus... I&#x27;d be surprised at a CV only system that can handle snow like seen in the Waymo example.<p>The rest of the article does not hold up once you realize the author is in an either-or mindset and thinks Waymo is not using CV, which they are, and has vertical vision stack, with arguably better experience at scaling ML<p>If you want to see actual FSD (without a human in the driver seat), this Waymo beta user has been video documenting on a regular basis: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;playlist?list=PL-13jt3ZPb7X6qJTo_MEnREMe-86I4S_q" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;playlist?list=PL-13jt3ZPb7X6qJTo_MEn...</a><p>Having seen the &quot;self-driving&quot; displays from both, Waymo&#x27;s shows Waymo information, which gives me greater trust in the system. By contrast, there are Tesla videos with some scary moments.<p>[0] <a href="https:&#x2F;&#x2F;www.engadget.com&#x2F;2018-05-08-waymo-snow-navigation.html" rel="nofollow">https:&#x2F;&#x2F;www.engadget.com&#x2F;2018-05-08-waymo-snow-navigation.ht...</a>
评论 #26628358 未加载
评论 #26628391 未加载
评论 #26628474 未加载
josefresco大约 4 年前
&quot;Immediately, the Bolt EUV is, in my experience, the best autonomous driving experience available today.&quot;<p>&quot;But for my money right now, Super Cruise beats Tesla’s Autopilot in real-world usage.&quot;<p><a href="https:&#x2F;&#x2F;electrek.co&#x2F;2021&#x2F;03&#x2F;05&#x2F;chevy-bolt-super-cruise-autopilot&#x2F;" rel="nofollow">https:&#x2F;&#x2F;electrek.co&#x2F;2021&#x2F;03&#x2F;05&#x2F;chevy-bolt-super-cruise-autop...</a>
shusson大约 4 年前
&gt; Why is Computer Vision (using neural networks) superior to LiDAR?<p>It would be better to compare LiDAR to CMOS sensors. You can apply computer vision techniques to either.
评论 #26628736 未加载
mrjet大约 4 年前
I have spent the last six years working in self-driving and disagree with the article. I am an autonomy engineer at a major self-driving focused company.<p>First: The choice is between “lidar and cameras” and “cameras alone.” I am not aware of any contenders who have are only using lidar. That means the only downside to using lidar is cost.<p>Second, the article is incorrect. lidar is extremely reliable for detecting dogs, pedestrians, and anything else you can think of. For lidars with sufficient intensity sensitivity, you can even read the text on signs.<p>Here’s a list of some tradeoffs for available sensors.<p>Cost: Sterling Anderson said in a talk at MIT a few years ago “there is no unobtanium in lidar.” Making lidar cheap is a matter of manufacturing scale. Not a matter of new physics. Cameras are still much cheaper and will remain so for some time. This alone might justify choosing cameras for consumer vehicles. The game-changing imaging radars that exist are not cheap.<p>Long-Tail Events: On a camera-based system without depth sensors, the vehicle must react based on correct identification of obstacles. Consider an image of a pedestrian painted onto the road. A system with depth sensors will not need to stop.<p>Depth estimation with multiple cameras leaves a lot to be desired. It is bad for untextured objects. Poor illumination conditions will prevent texture from being visible to the cameras. Poor illumination conditions have no effect on lidar&#x2F;radar.<p>I would not bet my life on a estimated depth from a monocular camera, no matter how many layers the DNN has.<p>Weather: Lidar works fine in the rain and snow. Degraded, but fine. Radar works fine in the rain and snow. Cameras can be made to work well, especially if placed in enclosures that self-clean. ATG’s vehicles famously made “whooshing” sounds as their pneumatic lens-cleaners forced water off of their camera lenses.<p>Time-of-Day: visible light cameras will do poorly. Every system I have seen has degraded camera performance at night. Some systems include an NIR channel to help. You cannot bring enough onboard illumunation to compete with the Sun. Most lidars choose a wavelength that leaves them completely unaffected in day vs night. Ouster has different noise characteristics during the day, but not enough to matter.<p>Range: At long distances, no commercial sensor can beat the angular resolution of cameras. This is where they shine most. That’s why you see highway-focused systems emphasize cameras so much. Blackmore was a promising path to enabling highway-range lidar capability, but they were bought by Aurora years ago now.<p>It is possible that cameras are completely sufficient. It is possible that Tesla is even ahead. But this article’s reasons won’t be the causal factor.<p>The company that builds a functional autonomous car will introduce the largest sea change in transportation since the automobile. The value delivered by each car will be massive. An additional $8,000 for a single lidar is not a dealbreaker. And that’s at today’s costs for a nice Hesai. In 2025 it will be yet smaller. In 2016, the only good lidar on the market was the Velodyne HDL64, which cost $80k. Pucks failed too often.
bpodgursky大约 4 年前
I don&#x27;t know how this will pan out, but I&#x27;ve seen a lot of criticism of Tesla &quot;only&quot; using vision, as if it&#x27;s a ridiculous concept that will never work.<p>But humans drive... only using vision.<p>Maybe it&#x27;s not possible for Tesla to get true FSD using only video data given current technology, but the idea that it&#x27;s laughable doesn&#x27;t make any sense to me. People drive in new environments, using only their eyes, all the time.
评论 #26628437 未加载
评论 #26628404 未加载
评论 #26628456 未加载
benjohnson大约 4 年前
We will need better-than-vision to drive safely - whiteness all the humans that pile up in multi-car crashes when it gets foggy or snowing.<p>Lidar, or something like it, will have to be part of the equation given that the best visual processing computers of all time (human brains) don’t get enough data from their systems to make good choices in bad weather.
评论 #26628338 未加载
评论 #26628336 未加载
评论 #26628499 未加载
评论 #26628393 未加载