TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

In-database R coming to SQL Server 2016

64 点作者 Hansi大约 10 年前

9 条评论

bladecatcher大约 10 年前
While this is a good concept in theory, I&#x27;d be skeptical about building on top of such a system. The primary reason is the slowness of R. I built heavy duty data mining systems using a stack of kdb&#x2F;q and R. In my experience, R, when used for simple clustering algorithms like k-means and k-medoids slowed down my system by nearly 70 times. This is despite running parallelized versions of these algorithms (by means of the SPRINT R package) using mpiexec.<p>IMO, there is a very big gap in this space. There is an urgent need for high performant data inference languages. MATLAB is decent, but is still clunky for my taste. Plus, I prefer the simplicity of a file mapped column oriented database like the one offered by kdb. As KDB is too expensive for me right now, I&#x27;m considering building on top of the excellent J language&#x2F;JDB database stack for my big data needs.
评论 #9565230 未加载
评论 #9558417 未加载
评论 #9556422 未加载
评论 #9556434 未加载
gtrubetskoy大约 10 年前
For what it&#x27;s worth, PostgreSQL had this since 2003. <a href="http:&#x2F;&#x2F;www.joeconway.com&#x2F;plr&#x2F;" rel="nofollow">http:&#x2F;&#x2F;www.joeconway.com&#x2F;plr&#x2F;</a><p>IMHO scripts running in a database server never work all that well - debugging is a nightmare. At least this has been my experience from trying PG plpython a few years ago.<p>Link to the original announcement email: <a href="http:&#x2F;&#x2F;www.postgresql.org&#x2F;message-id&#x2F;3E514A46.2040604@joeconway.com" rel="nofollow">http:&#x2F;&#x2F;www.postgresql.org&#x2F;message-id&#x2F;3E514A46.2040604@joecon...</a>
评论 #9556769 未加载
fs111大约 10 年前
How does that work when R is GPL licensed? Doesn&#x27;t that make SQL server a derived work?
评论 #9556657 未加载
评论 #9556338 未加载
Jake232大约 10 年前
I&#x27;m always reluctant to these kinds of ideas, of executing code on&#x2F;within my database server.<p>I know it&#x27;s apparently sandboxed, but that didn&#x27;t work out too well for ElasticSearch recently: <a href="https:&#x2F;&#x2F;jordan-wright.github.io&#x2F;blog&#x2F;2015&#x2F;03&#x2F;08&#x2F;elasticsearch-rce-vulnerability-cve-2015-1427&#x2F;" rel="nofollow">https:&#x2F;&#x2F;jordan-wright.github.io&#x2F;blog&#x2F;2015&#x2F;03&#x2F;08&#x2F;elasticsearc...</a>.
评论 #9556857 未加载
saosebastiao大约 10 年前
I hope it&#x27;s not as bad as PL&#x2F;R. Seemed like a good idea, but the performance was so terrible that it was essentially useless.
评论 #9556226 未加载
sorokod大约 10 年前
How does Java inside Oracle DB is doing nowadays?
modarts大约 10 年前
Shudder
atorralb大约 10 年前
just so you know, is already in SAP HANA... is nice to see that programming languages are part of a database a not just an extension
评论 #9556655 未加载
vittore大约 10 年前
Why Microsoft? Why? Why not python?
评论 #9556824 未加载
评论 #9556860 未加载