TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Clean code to learn ML feature-extraction techniques

12 pointsby datashovelabout 9 years ago
The ML community has done an amazing job disseminating information about the field, making it accessible to large numbers of people like myself who aren&#x27;t professionally involved in the field.<p>That said, the one area that has stumped me and I haven&#x27;t been able to find good clean code to learn from is Feature Extraction. I was wondering if anyone here can point me in the direction of some clean code that a non-expert in the field could learn from.<p>I&#x27;m definitely aware that there are books &#x2F; tutorials on the subject, but none of that has &quot;made it click&quot; for me yet. To be honest I feel like most of my real &quot;advances&quot; have been while looking at code (after familiarizing myself at a high level with the theory and math).<p>For example, the tensorflow playground source code appears to be a gold mine filled with good clean code that a novice can grok.<p>EDIT: If beggars can be choosers I&#x27;m most interested in seeing a practical implementation that uses a clustering algorithm (such as kmeans) to build up a set of features from image data. Such a technique is discussed in the following video.<p>https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=wZfVBwOO0-k

1 comment

karterkabout 9 years ago
What task are you learning the features for? If it&#x27;s classification, instead of treating it as a two step (feature generation followed by classification) problem, I suggest that you just use a multi-layer neural network with a sigmoid function at the final layer so that you directly predict the output class from raw pixels. This way the feature engineering is taken care of by the algorithm (the weights of the hidden layers).<p>To give an idea of what I mean, see: <a href="http:&#x2F;&#x2F;karpathy.github.io&#x2F;2015&#x2F;10&#x2F;25&#x2F;selfie&#x2F;" rel="nofollow">http:&#x2F;&#x2F;karpathy.github.io&#x2F;2015&#x2F;10&#x2F;25&#x2F;selfie&#x2F;</a>
评论 #11533390 未加载