TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Scikit-Learn Version 1.0

260 pointsby m3atover 3 years ago

10 comments

lysecretover 3 years ago
Excellent library for train_test_split. Jokes aside. This next to Numpy, Pandas Jupyter and Matplotlib + the DL libraries are the reason Python is the powerhouse it is for Data Science.
评论 #28524015 未加载
lr1970over 3 years ago
Early on, pandas made some unfortunate design decisions that are still biting hard. For example, the choice of datetime (pandas.Timestamp) represented by a 64-bit int with a fixed nanosecond resolution. This choice gives dynamic range of +- 292 years around 1970-01-01 (the epoch). This range is too small to represent the works of William Shakespeare, never mind human history. Using pandas in these areas becomes a royal pain in the neck, for one constantly needs to work around pandas datetime limitations.<p>OTOH, in numpy one can choose time resolution units (anything from attosecond to a year) tailoring time resolution to your task (from high energy physics all way to astronomy). Panda&#x27;s choice is only good for high-frequency stock traders, though.
评论 #28526981 未加载
评论 #28528631 未加载
lr1970over 3 years ago
Just to clarify, scikit-learn 1.0 has not been released yet. The latest tag in the github repo is 1.0.rc2<p><a href="https:&#x2F;&#x2F;github.com&#x2F;scikit-learn&#x2F;scikit-learn&#x2F;releases&#x2F;tag&#x2F;1.0.rc2" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;scikit-learn&#x2F;scikit-learn&#x2F;releases&#x2F;tag&#x2F;1....</a>
NeutralForestover 3 years ago
Excellent library with stellar documentation, I hope it&#x27;ll live on for a long time.
评论 #28523920 未加载
评论 #28523362 未加载
评论 #28523359 未加载
laichzeit0over 3 years ago
Great that they finally added quantile regression. This was sorely missed.<p>I’m still hoping for a mixed-effects model implementation someday, like lme4 in R. The statsmodels implementation can only do predictions on fixed effects, which limits it greatly.<p>I’ve always wondered why mixed effect type models are not more popular in the ML world.
评论 #28532297 未加载
conor_fover 3 years ago
<a href="https:&#x2F;&#x2F;0ver.org&#x2F;" rel="nofollow">https:&#x2F;&#x2F;0ver.org&#x2F;</a> will need an update!
infimumover 3 years ago
scikit-learn (next to numpy) is the one library I use in every single project at work. Every time I consider switching away from python I am faced with the fact that I&#x27;d lose access to this workhorse of a library. Of course it&#x27;s not all sunshine and rainbows - I had my fair share of rummaging through its internals - but its API design is a de-facto standard for a reason. My only recurring gripe is that the serialization story (basically just pickling everything) is not optimal.
评论 #28524192 未加载
评论 #28531785 未加载
评论 #28526143 未加载
monkeybuttonover 3 years ago
Scikit-Learn is great, and, reading the documentation for other 3rd party ML packages and seeing the words &quot;Scikit-learn API&quot; is even better.
XoS-490over 3 years ago
What about sktime? <a href="https:&#x2F;&#x2F;github.com&#x2F;alan-turing-institute&#x2F;sktime" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;alan-turing-institute&#x2F;sktime</a>
zibzabover 3 years ago
Is anyone using scikit for NN?<p>Why&#x2F;why not?
评论 #28523242 未加载
评论 #28523294 未加载
评论 #28523504 未加载
评论 #28523442 未加载
评论 #28523251 未加载