Hey HN, I'm one of the creators of DAGsHub (<a href="https://DAGsHub.com" rel="nofollow">https://DAGsHub.com</a>). Data Science Pull Requests (DS PRs) expand Pull Requests (PRs) to include data, models, and experiments. The idea behind DS PRs is to automate the data science review process and enable Open Source Data Science.<p>Concretely this means:<p>- Reviewing, comparing, and commenting on your experiments (metrics, parameters, visualizations), in context<p>- Seeing what data and models have changed (not just code)<p>- Comparing and diffing notebooks<p>- After reviewing the DS PR, you can merge it in, which will merge code, data, and models all at once<p>Learning to use Data Science PRs is very straightforward, read more here: <a href="https://dagshub.com/docs/collaborating_on_dagshub/data_science_pull_requests/" rel="nofollow">https://dagshub.com/docs/collaborating_on_dagshub/data_scien...</a><p>There is a lot of work to be done, and many things to be improved. I really want to make this workflow as simple and effective for everyone, and your input would be greatly appreciated.
Looks really interesting! How do you connect the data and models in my project to the code, so you know what to merge?<p>Also, any way to easily connect to an existing project on GitHub/GitLab?