TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

The Importance of a Data Acquisition Team

1 pointsby xLaszloover 4 years ago

1 comment

xLaszloover 4 years ago
Organising labelling is a core part of productionised ML and the Domain Team has the most knowledge to do it. But are they the best suited to do it?<p>Our experience shows that because incentives differ in production and off-line labelling, the Domain Team might create biased (subjective) dataset in Human-In-The-Loop settings.<p>Adding an extra team that was trained by the Domain Team but can only influence end-to-end performance through the deployed model helped to keep the training dataset objective while improving the productionised pipeline and helping the Domain Team.<p>First time HN poster, please share your recommendations in the comments. We are working on a &quot;Machine Learning Product Manual&quot; ebook about best practices to productionise and operate ML from a product perspective. Follow me on twitter at <a href="https:&#x2F;&#x2F;twitter.com&#x2F;xLaszlo" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;xLaszlo</a> for updates.