TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Are there good resources on data engineering and infra (not science)?

2 pointsby cplatabout 7 years ago
Most of Coursera&#x2F;DataCamp&#x2F;DataQuest&#x2F;Udacity have to do with the science aspects; i.e., running things on your computer. However, I&#x27;ve not yet found equivalent resources for putting data&#x2F;ML systems into production.<p>The aspects that I&#x27;m looking for are not how to set up MySQL or something like that. It has more to do with what all types of systems usually work together well, and how do you set up things like monitoring progress of training a model, deploying the model, versioning the model, etc.<p>I&#x27;m looking for courses&#x2F;books&#x2F;anything else (paid is fine) that will help me venture into this.<p>If you have a &quot;learning path&quot;, that&#x27;d be great to have too!

1 comment

sahil-kangabout 7 years ago
<i>Designing Data Intensive Applications</i> is a good read, especially if you’re interested in the “programming in the large” aspect of data engineering [1]. It does have a slightly theoretical taste to it, but I think you’ll find that helpful since some of the problems you listed don’t really have a good solution at the moment (versioning a model, for example).<p>[1] <a href="http:&#x2F;&#x2F;dataintensive.net" rel="nofollow">http:&#x2F;&#x2F;dataintensive.net</a>