TechEcho

Hi HN, we’re Mokshith and Abhi from Sieve (<a href="https://sievedata.com" rel="nofollow">https://sievedata.com</a>). We’re building an API that lets you add video search to internal tools or customer applications, instantly. Sieve can process 24 hours of video in less than 10 minutes, and makes it easy to search video by detected objects / characteristics, motion data, and visual similarity. You can use our models out of the box, or plug-in your own model endpoints into our infrastructure. ('Model' here means any software that produces output given an image.)Every industry from security, to media, supply chain, construction, retail, sports, and agriculture is being transformed by video analytics—but setting up the infrastructure to process video data quickly is difficult. Having to deal with video ingestion pipelines, computer-vision model training, and search functionality is not pretty. We’re building a platform that takes care of all of this so teams can focus on their domain-expertise, building industry-specific software.We met in high school, and were on the robotics team together. It was our first exposure to computer vision, and something we both deeply enjoyed. We ended up going to UC Berkeley together and worked on computer vision at places like Scale AI, Niantic, Ford, NVIDIA, Microsoft, and Second Spectrum. We were initially trying to solve problems for ourselves as computer vision developers but quickly realized the unique problems in video having to do with cost, efficiency, and scale. We also realized how important video would be in lots of verticals, and saw an opportunity to build infrastructure which wouldn’t have to be rebuilt by a fullstack dev at any company again.Let’s take the example of cloud software for construction which might include tons of features from asset trackers to rental management and compliance checks. It doesn’t make sense for a construction software company to build their own video processing for telematics—the density and scale of video make this a difficult task. A single 30 FPS camera generates over 2.5M frames within a day of recording. Imagine this across thousands of cameras and many weeks of footage—not to mention the actual vertical-specific software they’re building for end users.Sieve takes care of everything hard about processing and searching video. Our API allows you to process and search video with just two API calls. We use filtering, parallelization, and interpolation techniques to keep costs low, while being able to process 24 hours of video in under 10 minutes. Users can choose from our pre-existing set of models, or use their own models with our video processing engine. Our pricing can range anywhere from $0.08-$0.45 per minute of video processed based on the models clients are interested in and usage volume. Our FAQ page (<a href="https://sievedata.com/faq" rel="nofollow">https://sievedata.com/faq</a>) explains these factors in more detail.Our backend is built on serverless functions. We split each video into individual chunks which are processed in parallel and passed through multiple layers of filters to determine which chunks are “important”. We’re able to algorithmically ignore parts of video which are static, or change minimally, and focus on the parts that contain real action. We then run more expensive models on the most “important” parts of video, and interpolate results across frames to return information to customers at 30 FPS granularity. Our customers simply push signed video URLs to our platform, and this happens automatically. You can then use our API to query for intervals of interest.We haven’t built an automated sign up flow yet because we're focused on building out the core product for now. But we wanted to give all of you the chance to try Sieve on your own videos for free, so we've set up a special process for HN users. Try it out here: <a href="https://sieve-data.notion.site/Trying-Sieve-s-Video-Search-4bd7754bb04d468fb1a6c98225f68ccb" rel="nofollow">https://sieve-data.notion.site/Trying-Sieve-s-Video-Search-4...</a>. We'll email you a personal, limited-access API key.Here's a video demo of using our dashboard to do video search: <a href="https://www.youtube.com/watch?v=_uyjp_HGZl4" rel="nofollow">https://www.youtube.com/watch?v=_uyjp_HGZl4</a>We’d love to hear what you think about the product and vision, and ideas on how we can improve it. Thanks for taking the time to read this, we’re grateful to be posting here :)

5 comments

plasmaover 3 years ago

Neat project, you may want to consider adding audio processing too (eg sound detected) as part of the video.You could go deeper and compare samples of audio that could be uploaded separately (eg, siren sounds), check out MFCC processing <a href="https://en.wikipedia.org/wiki/Mel-frequency_cepstrum#Applications" rel="nofollow">https://en.wikipedia.org/wiki/Mel-frequency_cepstrum#Applica...</a> to do Shazam-style audio comparison.I wonder too if you process it on a per-frame basis or you can take series of frames too (eg analyze the last 5 seconds of frames) to detect things like a "hand wave".

评论 #30184292 未加载

jensneuseover 3 years ago

Hey, that's a really nice product. I love that it's API first? Would you be interested in adding it to our API hub? We're currently in private beta and would love to have your API on it. Why? It's the best developer experience to integrate APIs and we'd love to send our users your way. Here's a link if you're interested: <a href="https://hub.wundergraph.com/" rel="nofollow">https://hub.wundergraph.com/</a>

评论 #30183608 未加载

applgo443over 3 years ago

This is really cool!I have a few questions1. Is there really no other competitor or company that tried to tackle this problem in the past? It feels like a really common usecase and someone must've done something about it!2. Do you have a fixed set of words that the user should use to query? I'm an AI researcher/practitioner who worked in this area. It's super difficult to search for tail objects in images/text.3. Why API first?

评论 #30184742 未加载

评论 #30186176 未加载

PanManover 3 years ago

Like the concept, but especially for live feeds, seems expensive? at $3456/month/camera ? Is this not your use-case (it's the first one on your use-cases page)? Or am I missing something? Congrats on the launch!

评论 #30184974 未加载

Neanderover 3 years ago

This is cool. Something tangential I've always wanted from video apps and APIs is the ability to highlight any video online. To just bracket cool clips in long videos and then be able to view and later edit those highlights together rapidly.

Launch HN: Sieve (YC W22) – Pluggable APIs for Video Search

5 comments

Launch HN: Sieve (YC W22) – Pluggable APIs for Video Search

5 comments