TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: How to Do ML Research?

4 pointsby giveexamplesalmost 2 years ago
For the researchers out there, how do you do research?<p>Background:<p>I&#x27;ve been looking at how to create a recurrent seq-to-seq model, that&#x27;s not transformers. The ideas I implement do not work. It seems like off the well trodden path, there are traps everywhere - how should I tune parameters, add biases, normalize, is this dataset impossible, gradient explosion and vanishing, etc.<p>From a &quot;research = gradient descent&quot; point of view, I&#x27;m stuck at a point with no gradient - I have no idea what I&#x27;m doing wrong, or what to will get a better result. Am I missing a workflow. intuition, or tools, or other things?

1 comment

tikkunalmost 2 years ago
Here&#x27;s some AI research links I&#x27;ve collected over the past while, I put them in a public gist: <a href="https:&#x2F;&#x2F;gist.github.com&#x2F;TikkunCreation&#x2F;5de1df7b24800cc05b4823da4116f1ad" rel="nofollow">https:&#x2F;&#x2F;gist.github.com&#x2F;TikkunCreation&#x2F;5de1df7b24800cc05b482...</a><p>Karpathy&#x27;s post about the research process in particular may be helpful for you
评论 #36119890 未加载