TechEcho

11 comments

kekeblomalmost 4 years ago

Hi HN! hietalajulius and I have been working on a toolkit for solving computer vision problems.These days, there are a lot of fancy solutions to many computer vision problems, but there aren't good implementations of the algorithms, getting to a working solution requires figuring out lots of different steps, tools are buggy and not well maintained and often, you need a lot of training data to feed the algorithms. Projects easily balloon into months long R&D projects, even when done by seasoned computer vision engineers. With the Stray Robots toolkit, we aim to lower the barrier for deploying computer vision solutions.Currently, the toolkit allows you to build 3D scenes from a stream of depth camera images, annotate the scenes using a GUI and fit computer vision algorithms to infer the labels from single images, among a few other things. In this project, we used the toolkit to build a simple electric scooter detector using only 25 short video clips of electric scooters.If you want to try it out, you can install the toolkit by following the instructions here: <a href="https://docs.strayrobots.io/installing/index.html" rel="nofollow">https://docs.strayrobots.io/installing/index.html</a>Going forward we plan to add other components such as 3D keypoint detection, semantic segmentation and 6D object pose estimation.Let us know what you think! Both of us are here to answer any questions you may have.

jonatronalmost 4 years ago

Using video to automatically build a large training set is smart! Well done! I was thinking about making a properly free and open dataset from just walking around London, and this gives me some ideas...

评论 #28106998 未加载

评论 #28101371 未加载

actually_a_dogalmost 4 years ago

Super cool, especially the way it was able to differentiate that Posti box from the scooters, even though they have vaguely the same shape. Just out of curiosity, what confidence level did the classifier assign to the Posti box as a scooter?

评论 #28107181 未加载

AndrewKemendoalmost 4 years ago

Just so I understand the idealized pipeline here, a user does the following:1. Use the Scanner app to take the images and camera pose data2. Export the scene directory (color and depth images and json files) somehow to your computer3. Import (integrate, open) the directory via the Stray CLI4. Annotate voxels via 3D bounding box in Studio GUI5. Generate labels from the annotated voxels6. Import data and labels, train and test a detectron model with pytorch7. Export trained model in torchscript format8. ProfitI assume you require users to "ETL" the scene directory from your phone to your desktop/laptop via some manual transfer process?Is there any reason I couldn't stop at step 5 and push my new labeled date to my own training system?

评论 #28101544 未加载

评论 #28101593 未加载

tadeeganalmost 4 years ago

This is only beneficial in static scenes right? Otherwise you can’t get free labels across the whole video.

评论 #28102345 未加载

kayhialmost 4 years ago

Newbie here, where's the intersection between object detection and OCR?For example, if I have images in different pdf files that I want to compare or trying to identify information on the wine label, what are criteria to consider on which method to use?

评论 #28103411 未加载

posix_compliantalmost 4 years ago

Heads up for anyone else, I was interested in the strayscanner app to try on my iPhone 11, but I’m getting an error when trying to record: “unsupported device: this device doesn’t seem to have the required level of ARKit support”.

评论 #28101358 未加载

评论 #28103155 未加载

luisramalhoalmost 4 years ago

Good job guys! Greetings to @hietalajulius (Smartly alumni here)

评论 #28107082 未加载

notumalmost 4 years ago

Looks great! Very innovative approach. Are the generated models compatible with OpenCV OAK camera?

评论 #28103776 未加载

评论 #28107128 未加载

tvirosialmost 4 years ago

Cool article! Those clips are from sweden right? :) Just curious if I spotted it right

评论 #28101255 未加载

nathan_phoenixalmost 4 years ago

Just curious, what's the business plan behind this?

评论 #28101606 未加载

11 comments

kekeblomalmost 4 years ago

jonatronalmost 4 years ago

评论 #28106998 未加载

评论 #28101371 未加载

actually_a_dogalmost 4 years ago

评论 #28107181 未加载

AndrewKemendoalmost 4 years ago

评论 #28101544 未加载

评论 #28101593 未加载

tadeeganalmost 4 years ago

This is only beneficial in static scenes right? Otherwise you can’t get free labels across the whole video.

评论 #28102345 未加载

kayhialmost 4 years ago

评论 #28103411 未加载

posix_compliantalmost 4 years ago

评论 #28101358 未加载

评论 #28103155 未加载

luisramalhoalmost 4 years ago

Good job guys! Greetings to @hietalajulius (Smartly alumni here)

评论 #28107082 未加载

notumalmost 4 years ago

Looks great! Very innovative approach. Are the generated models compatible with OpenCV OAK camera?

评论 #28103776 未加载

评论 #28107128 未加载

tvirosialmost 4 years ago

Cool article! Those clips are from sweden right? :) Just curious if I spotted it right

评论 #28101255 未加载

nathan_phoenixalmost 4 years ago

Just curious, what's the business plan behind this?

评论 #28101606 未加载

Show HN: Object Detection in an Hour

11 comments

Show HN: Object Detection in an Hour

11 comments