TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Statistics with Julia [pdf]

470 pointsby aapelialmost 6 years ago

16 comments

superdimwitalmost 6 years ago
I&#x27;d really recommend anyone doing mildly numerical &#x2F; data-ey work in python to give Julia a patient and fair try.<p>I think the language is really solidly designed, and gives you ridiculously more power AND productivity than python for a whole range of workloads. There are of course issues, but even in the short time I&#x27;ve been following &amp; using the language these are being rapidly addressed. In particular: generally less rich system of libraries (but some Julia libraries are state of the art across all languages, mainly due to easy metaprogramming and multiple dispatch) + generally slow compile times (but this is improving rapidly with caching etc). I would also note that you often don&#x27;t really need as many &quot;libraries&quot; as you do in python or R, since you can typically just write down the code you want to write, rather than being forced to find a library that wraps a C&#x2F;C++ implementation like in python&#x2F;r.
评论 #20424282 未加载
评论 #20423880 未加载
jointpdfalmost 6 years ago
This looks like a good reference for the fundamentals of both statistics and Julia, as claimed. I have a small critique, since the authors asked for suggestions.<p>The format for the code samples goes like (code chunk —&gt; output&#x2F;plots —&gt; bullet points explaining the code line-by-line). This creates a bit of a readability issue. The reader will likely follow a pattern like: (Skim past the code chunk to the explanation —&gt; Read first bullet, referencing line X —&gt; Go back to code to find line X, keeping the explanation in mental memory —&gt; Read second bullet point —&gt; ...). In other words, too much switching&#x2F;scrolling between sections that can be pages apart. Look at the example on pages 185-187 to see what I mean.<p>I’m not sure what the optimal solution is. Adding comments in the code chunks themselves adds clutter and is probably worse (not to mention creates formatting nightmares). I think my favorite format is two columns, with the code on the left side and the explanations on the right.<p>Here’s what I have in mind (doesn’t work on mobile): <a href="https:&#x2F;&#x2F;allennlp.org&#x2F;tutorials" rel="nofollow">https:&#x2F;&#x2F;allennlp.org&#x2F;tutorials</a>. Does anyone know of a solution for formatting something like this?
评论 #20430979 未加载
评论 #20422231 未加载
评论 #20422715 未加载
xvilkaalmost 6 years ago
Note that Julia 1.2[1] is on the verge[2] of being released. Also, it is interesting to see the list[3] of GSoC and JSoC (Julia&#x27;s own Summer of Code). A lot of projects target the ML&#x2F;AI applications. Personally, I am waiting for proper GNN support[4] in FluxML, but seems not much interest in it.<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;JuliaLang&#x2F;julia&#x2F;milestone&#x2F;30" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;JuliaLang&#x2F;julia&#x2F;milestone&#x2F;30</a><p>[2] <a href="https:&#x2F;&#x2F;discourse.julialang.org&#x2F;t&#x2F;julia-v1-2-0-rc2-is-now-available&#x2F;26170" rel="nofollow">https:&#x2F;&#x2F;discourse.julialang.org&#x2F;t&#x2F;julia-v1-2-0-rc2-is-now-av...</a><p>[3] <a href="https:&#x2F;&#x2F;julialang.org&#x2F;blog&#x2F;2019&#x2F;05&#x2F;jsoc19" rel="nofollow">https:&#x2F;&#x2F;julialang.org&#x2F;blog&#x2F;2019&#x2F;05&#x2F;jsoc19</a><p>[4] <a href="https:&#x2F;&#x2F;github.com&#x2F;FluxML&#x2F;Flux.jl&#x2F;issues&#x2F;625" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;FluxML&#x2F;Flux.jl&#x2F;issues&#x2F;625</a>
caiocaiocaioalmost 6 years ago
Julia looked interesting to me, so I tried 1.0 after it came out. I have a oldish laptop (fine for my needs), and every time I tried to do seemingly anything, it spent ~5 minutes recompiling libraries or something. So I&#x27;ve been waiting newer versions that hopefully stop doing that, or for me to buy a better computer.
评论 #20421503 未加载
评论 #20421362 未加载
评论 #20422582 未加载
ChrisRackauckasalmost 6 years ago
This is a very good resource. The one thing I would ask is that I would like to see examples of using DifferentialEquations.jl when you get to the section on dynamical systems, especially when doing discrete event simulation and stochastic differential equations. I opened an issue in the repo and we can continue discussing there (I&#x27;ll help write the code, I want to use this in my own class :P)!
评论 #20421684 未加载
评论 #20431000 未加载
adamnemecekalmost 6 years ago
I invite everyone to check out julia. The language is pleasant and gets out of the way. The interop is nuts. To call say numpy fft, you just do<p>using PyCall<p>np = pyimport(&quot;numpy&quot;)<p>np.fft.fft(rand(ComplexF64, 10))<p>Thats it. You call it with a julia native array, the result is in a julia native array as well.<p>Same with cpp<p><a href="https:&#x2F;&#x2F;github.com&#x2F;JuliaInterop&#x2F;Cxx.jl" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;JuliaInterop&#x2F;Cxx.jl</a><p>Or matlab<p><a href="https:&#x2F;&#x2F;github.com&#x2F;JuliaInterop&#x2F;MATLAB.JL" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;JuliaInterop&#x2F;MATLAB.JL</a><p>It&#x27;s legit magic
评论 #20421369 未加载
bdod6almost 6 years ago
Can someone explain how this is more powerful than someone use an Python&#x2F;R based workflow? E.g., I currently use a combination .ipynb, python scripts, and RStudio and this feels like it covers everything I need for any data science project.
评论 #20422212 未加载
评论 #20422103 未加载
aapelialmost 6 years ago
Accompanying code here: <a href="https:&#x2F;&#x2F;github.com&#x2F;h-Klok&#x2F;StatsWithJuliaBook" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;h-Klok&#x2F;StatsWithJuliaBook</a>
Merrillalmost 6 years ago
In section &quot;1.2 Setup and Interface&quot; there is a very short description of the REPL and how it can be downloaded from julialang.org, as well as a much longer description of JuliaBox and how Jupyter notebooks can be run from juliabox.com for free.<p>Although JuliaBox has been provided for free by Julia Computing, there has been discussion that this may not be possible in the future. However, Julia Computing does provide a distribution of Julia, the Juno IDE, and supported packages known as JuliaPro for free.<p>For new users, would the free JuliaPro distribution be a good alternative to JuliaBox and&#x2F;or downloading the REPL and kernal from julialang.org?
评论 #20423384 未加载
cwyersalmost 6 years ago
For people who have more Julia experience -- is this (thinking mainly of chapter 4) representative of how most Julia users do plotting? It looks like a lot of calling out to matplotlib via PyPlot. I know Julia has a ggplot-inspired library called Gadfly.jl, is PyPlot more commonly used?
评论 #20422648 未加载
评论 #20422544 未加载
评论 #20422570 未加载
dlphn___xyzalmost 6 years ago
whats the selling point with Julia? why would i use it over something like R?
评论 #20422312 未加载
评论 #20422221 未加载
jbee618almost 6 years ago
Would love to see chapter exercises to test comprehension and reinforce learning objectives.
chakerbalmost 6 years ago
I was going to ask is there any Kindle version of this, then I skimmed over the book, and I don&#x27;t think it will be readable on a Kindle. And even if it does, the reading experience will definitely be inferior.
评论 #20431049 未加载
mrutsalmost 6 years ago
Julia is everything python could have been, and much more. I&#x27;m stuck with python right now as a lot of people in the data science&#x2F;ML community are, but it&#x27;s becoming increasingly viable to use Julia for &quot;real&quot; work. The Python-Julia interop story is pretty strong as well, which allows you to (somewhat) easily convert pandas&#x2F;pytorch&#x2F;sklearn code into Julia using Python wrappers. Julia has some unconventional things in it but they are all growing on me:<p>1. Indices by default start with 1. This honestly makes a ton of sense and off by one errors are less likely to happen. You have nice symmetry between the length of a collection and the last element, and in general just have to do less &quot;+ 1&quot; or &quot;- 1&quot; things in your code.<p>2. Native syntax for creation of matrices. Nicer and easier to use than ndarray in Python.<p>3. Easy one-line mathematical function definitions: f(x) = 2*x. Also being able to omit the multiplication sign (f(x) = 2x) is super nice and makes things more readable.<p>4. Real and powerful macros ala lisp.<p>5. Optional static typing. Sometimes when doing data science work static typing can get in your way (more so than for other kinds of programs), but it&#x27;s useful to use most of the time.<p>6. A simple and easy to understand polymorphism system. Might not be structured enough for big programs, but more than suitable for Julia&#x27;s niche.<p>Really the only thing I don&#x27;t like about the language is the begin&#x2F;end block syntax, but I&#x27;ve mentioned that before on HN and don&#x27;t need to get into it again.
评论 #20422862 未加载
评论 #20422956 未加载
评论 #20421907 未加载
评论 #20422265 未加载
abakusalmost 6 years ago
I find Julia&#x27;s .&gt; , .==, .*, .&#x2F; (dots for element-by-element ufunc)... really ugly. Numpy&#x27;s design is cleaner and better.
评论 #20425534 未加载
plouffyalmost 6 years ago
Commenting to find later.
评论 #20421208 未加载
评论 #20422156 未加载