TechEcho

9 comments

bladecatcherabout 10 years ago

While this is a good concept in theory, I'd be skeptical about building on top of such a system. The primary reason is the slowness of R. I built heavy duty data mining systems using a stack of kdb/q and R. In my experience, R, when used for simple clustering algorithms like k-means and k-medoids slowed down my system by nearly 70 times. This is despite running parallelized versions of these algorithms (by means of the SPRINT R package) using mpiexec.<p>IMO, there is a very big gap in this space. There is an urgent need for high performant data inference languages. MATLAB is decent, but is still clunky for my taste. Plus, I prefer the simplicity of a file mapped column oriented database like the one offered by kdb. As KDB is too expensive for me right now, I'm considering building on top of the excellent J language/JDB database stack for my big data needs.

评论 #9565230 未加载

评论 #9558417 未加载

评论 #9556422 未加载

评论 #9556434 未加载

gtrubetskoyabout 10 years ago

For what it's worth, PostgreSQL had this since 2003. <a href="http://www.joeconway.com/plr/" rel="nofollow">http://www.joeconway.com/plr/</a><p>IMHO scripts running in a database server never work all that well - debugging is a nightmare. At least this has been my experience from trying PG plpython a few years ago.<p>Link to the original announcement email: <a href="http://www.postgresql.org/message-id/3E514A46.2040604@joeconway.com" rel="nofollow">http://www.postgresql.org/message-id/3E514A46.2040604@joecon...</a>

评论 #9556769 未加载

fs111about 10 years ago

How does that work when R is GPL licensed? Doesn't that make SQL server a derived work?

评论 #9556657 未加载

评论 #9556338 未加载

Jake232about 10 years ago

I'm always reluctant to these kinds of ideas, of executing code on/within my database server.<p>I know it's apparently sandboxed, but that didn't work out too well for ElasticSearch recently: <a href="https://jordan-wright.github.io/blog/2015/03/08/elasticsearch-rce-vulnerability-cve-2015-1427/" rel="nofollow">https://jordan-wright.github.io/blog/2015/03/08/elasticsearc...</a>.

评论 #9556857 未加载

saosebastiaoabout 10 years ago

I hope it's not as bad as PL/R. Seemed like a good idea, but the performance was so terrible that it was essentially useless.

评论 #9556226 未加载

sorokodabout 10 years ago

How does Java inside Oracle DB is doing nowadays?

modartsabout 10 years ago

Shudder

atorralbabout 10 years ago

just so you know, is already in SAP HANA... is nice to see that programming languages are part of a database a not just an extension

评论 #9556655 未加载

vittoreabout 10 years ago

Why Microsoft? Why? Why not python?

评论 #9556824 未加载

评论 #9556860 未加载

9 comments

bladecatcherabout 10 years ago

评论 #9565230 未加载

评论 #9558417 未加载

评论 #9556422 未加载

评论 #9556434 未加载

gtrubetskoyabout 10 years ago

评论 #9556769 未加载

fs111about 10 years ago

How does that work when R is GPL licensed? Doesn't that make SQL server a derived work?

评论 #9556657 未加载

评论 #9556338 未加载

Jake232about 10 years ago

评论 #9556857 未加载

saosebastiaoabout 10 years ago

I hope it's not as bad as PL/R. Seemed like a good idea, but the performance was so terrible that it was essentially useless.

评论 #9556226 未加载

sorokodabout 10 years ago

How does Java inside Oracle DB is doing nowadays?

modartsabout 10 years ago

Shudder

atorralbabout 10 years ago

just so you know, is already in SAP HANA... is nice to see that programming languages are part of a database a not just an extension

In-database R coming to SQL Server 2016

9 comments

In-database R coming to SQL Server 2016

9 comments