TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Reproducible machine learning with PyTorch and Quilt

135 pointsby akarvealmost 7 years ago

7 comments

p1eskalmost 7 years ago
Oh, this resonates with me so much! I&#x27;m running 4 different DeepSpeech models right now, each using a differently processed version of LibriSpeech dataset (mfcc&#x2F;fbanks&#x2F;linear spectrograms, deltas? energy? padding? etc). Because the original DS papers didn&#x27;t bother describing it, and every implementation I found uses completely different methods and libraries.<p>Not to mention every one of those implementation packages their preprocessed version into a different data format, and then creates a different data pipeline (and I only looked at tensorflow implementations)
评论 #17554018 未加载
dkobranalmost 7 years ago
In case you missed it, here&#x27;s a link to the full training example that you can run yourself: <a href="https:&#x2F;&#x2F;www.paperspace.com&#x2F;console&#x2F;jobs&#x2F;jvqssfqawv5zn&#x2F;logs" rel="nofollow">https:&#x2F;&#x2F;www.paperspace.com&#x2F;console&#x2F;jobs&#x2F;jvqssfqawv5zn&#x2F;logs</a><p>Inference example: <a href="https:&#x2F;&#x2F;www.paperspace.com&#x2F;console&#x2F;jobs&#x2F;js4mqzm91fj2lg" rel="nofollow">https:&#x2F;&#x2F;www.paperspace.com&#x2F;console&#x2F;jobs&#x2F;js4mqzm91fj2lg</a><p>Disclosure: I work on Paperspace
infinity0almost 7 years ago
A step in the right direction for machine learning in science, but they could have done some more research into naming conflicts:<p>$ apt-cache show quilt<p>Package: quilt<p>[..]<p>Description-en: Tool to work with series of patches<p>Quilt manages a series of patches by keeping track of the changes each of them makes. They are logically organized as a stack, and you can apply, un-apply, refresh them easily by traveling into the stack (push&#x2F;pop). . Quilt is good for managing additional patches applied to a package received as a tarball or maintained in another version control system. The stacked organization is proven to be efficient for the management of very large patch sets (more than hundred patches). As matter of fact, it was designed by and for Linux kernel hackers (Andrew Morton, from the -mm branch, is the original author), and its main use by the current upstream maintainer is to manage the (hundreds of) patches against the kernel made for the SUSE distribution. . This package provides seamless integration into Debhelper or CDBS, allowing maintainers to easily add a quilt-based patch management system in their packages. The package also provides some basic support for those not using those tools. See README.Debian for more information.<p>$ zcat &#x2F;usr&#x2F;share&#x2F;doc&#x2F;quilt&#x2F;changelog.gz | tail -n3<p>Version 0.26 (Tue Oct 21 2003) - Change summary not available
评论 #17571800 未加载
jononoralmost 7 years ago
Was not aware of Quilt for hosting datasets. Is it the go-to in this area? What other alternatives are there?
评论 #17554420 未加载
评论 #17552682 未加载
评论 #17554273 未加载
评论 #17553554 未加载
评论 #17554628 未加载
评论 #17553638 未加载
评论 #17643919 未加载
评论 #17554223 未加载
cwyersalmost 7 years ago
It seems to me like the machine learning algorithm here is mostly learning how to add JPEG compression artifacts to images.
ForFreedomalmost 7 years ago
Isn&#x27;t quilt just bluring the pixels to an extend?
评论 #17579888 未加载
rhackeralmost 7 years ago
Please please please don&#x27;t kill our favorite plot device. Make sure the process takes exactly 3 days.