A replica of this benchmark on my laptop running R has this running in about 1/4 second. Seems like a pretty trivial benchmark?<p>library(data.table)<p>x = data.table(a=sample(10,10e6,replace=TRUE),num=sample(100,10e6,replace=TRUE))
t1=proc.time(); x[,sum(num),by=a]; print(proc.time()-t1)<p><pre><code> user system elapsed
0.209 0.032 0.245</code></pre>