Hey HN community! Creator of DagsHub here. Hacktoberfest 2021 is well underway, but there's still a lot of time left, and I was missing some opportunities to contribute to the community on the ML/DS fronts.<p>We've decided to support Hacktoberfest by creating an open-source catalog of datasets in the audio domain. The idea is to have a bunch of audio datasets, which will be completely open-source, with the ability to view, visualize (waveform, spectrograms, etc), and download to use in your projects. Check out this dataset that I created as an example: https://dagshub.com/DagsHub/Librispeech-ASR-corpus/src/master/dev-clean/84/121123/84-121123-0000.flac.<p>You can read the full guidelines here: https://dagshub.com/blog/hacktoberfest-x-dagshub-2/
Would be happy to answer questions, but I think if you're passionate about open-source ML, this is a great opportunity to contribute.