I am working on a solution on top of Git, but storing diffs only, can integrate with MySQL and S3, can create version snapshots. You would have the videos in a bucket and the version-controlled history with links to the videos in another, also on S3. Data can be added as a diff commit and later merged into production datasets. You would own the history via a readable Git repo and pointers to your versioning S3, on top of any snapshots. No UI yet, though. Many open questions, some of them:
- Data ingestion: how often per person, how many new videos each time, how many field personnel?
- Dataset carveouts: how often, based on what exactly would you filter?
- Metadata: which ones per video, how often querying, on a specific version of the datasets? A few query examples would help to imagine where the metadata should live.
My email is in the profile, feel free to reach out, most likely my solution is too early stage for your needs.