TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Lambdo – Feature engineering and machine learning together

73 点作者 asavinov超过 6 年前

4 条评论

ericand超过 6 年前
&gt; Feature engineering is a mechanism of creating new levels of abstraction in knowledge representation because each (non-trivial) feature extract and makes explicit some piece of knowledge hidden in the data. It is almost precisely what deep learning is intended for. In this sense, feature engineering does what hidden layers of a neural network do or what the convolutional layer of a neural network does<p>Very intriguing and thoughtful statement. I hadn&#x27;t ever thought of it that way.
评论 #18592880 未加载
评论 #18595105 未加载
asavinov超过 6 年前
Lambdo is a workflow engine which simplifies data analysis by combining in one analysis pipeline<p>* Feature engineering and machine learning: Lambdo does not distinguish them and treats them as data transformations<p>* Model training and prediction: both feature definitions and ML models can be trained as part of one workflow<p>* Table population and column evaluation: workflow consists of nodes of these two types. This makes it similar to Bistro: <a href="https:&#x2F;&#x2F;github.com&#x2F;asavinov&#x2F;bistro" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;asavinov&#x2F;bistro</a><p>Lambdo is intended for the following use cases:<p>* Numerous derived features with parameters derived from the data<p>* Regular re-training is required by using the same features as those to be used during prediction<p>* Time series analysis because it is where the quality of derived features is especially important<p>* Customization via user-defined Python functions
kmax12超过 6 年前
I definitely see the need for packages like this. So much of a data scientist&#x27;s time is spent on feature engineering, but there are relatively few tools out there that are trying to improve that step in the process compared to tools for the modeling step.<p>I see this tool as a something that can help with the deployment piece of the feature engineering. As things stand, it&#x27;s &quot;easy&quot; to package and deploy modeling code, but much harder to package up your feature engineering workflow. In part because there is no agreed upon standard for developing feature engineering pipelines.<p>I&#x27;d be curious how this could be combined with a library like Featuretools (<a href="http:&#x2F;&#x2F;github.com&#x2F;featuretools&#x2F;featuretools&#x2F;" rel="nofollow">http:&#x2F;&#x2F;github.com&#x2F;featuretools&#x2F;featuretools&#x2F;</a>) which helps automate the discovery of features, but currently has less functionality related to deployment.<p>(full disclosure: I work on Featuretools)
评论 #18592101 未加载
mooneater超过 6 年前
Things I always want when looking at something new:<p>- Where does it sit in relation to other data science components?<p>- What does it integrate with, and what is it agnostic to?<p>- Smallest self-contained use case? I see some examples in the repo readme but they are not self-contained so its harder for me to imagine its use
评论 #18592379 未加载
评论 #18591848 未加载