TechEcho

8 comments

afpxalmost 8 years ago

Anyone doing probabilistic programming with big data? I started experimenting with probabilistic programming frameworks several years ago, but couldn't get it to scale to the level of data I'm working with (~6 dimensions ~100 trillion vectors), or even a small fraction of that. But, I'm sure it's being done in scientific circles somewhere.Are there communities to collaborate on probabilistic programming? It seems like he domain knowledge is obscure enough that all the good information is locked up in the big corporations and academics.

评论 #14730243 未加载

评论 #14731578 未加载

3pt14159almost 8 years ago

Welcome to the project I've been waiting for years to get out of alpha. It's frustrating. If I had a hundred million dollars I'd burn a couple million getting this funded. It seems like it will be useful to humanity.

评论 #14730420 未加载

mayneackalmost 8 years ago

previous discussions:<a href="https://news.ycombinator.com/item?id=6864339" rel="nofollow">https://news.ycombinator.com/item?id=6864339</a><a href="https://news.ycombinator.com/item?id=10750900" rel="nofollow">https://news.ycombinator.com/item?id=10750900</a>

jarymalmost 8 years ago

I can imagine this could be really great wrapped up as a Postgres extension

stanfordkidalmost 8 years ago

This sounds kind of similar to the stuff this startup called "Prior Knowledge" was working on prior to being acquired by Salesforce: <a href="https://www.crunchbase.com/organization/prior-knowledge#/entity" rel="nofollow">https://www.crunchbase.com/organization/prior-knowledge#/ent...</a>

indescions_2017almost 8 years ago

Glad to see this is out as well! Using probabilistic frameworks has the potential to eliminate a lot of the human error which can easily enter a large simulation. It's fair to say in the future probabilistic modules will become part of every standard library in every programming language, and distribution sampling functions will be as common as trig functions in a math library.I am curious though how I would build up large queries in the BQL (SQL-like query language) or MML (meta-modeling language). For the orbital example, we conceivably only have a relatively low dimensional space. But what about a Bayes net for investigating genetic variants in a large genomic population? Doesn't this quickly become intractable?

评论 #14729548 未加载

kensaialmost 8 years ago

Is there a comparison of its accuracy against traditional methods? Admittedly, this machine assisted modeling sounds really interesting.

elvinyungalmost 8 years ago

They really missed out on an opportunity to call it DataBayes.

8 comments

afpxalmost 8 years ago

评论 #14730243 未加载

评论 #14731578 未加载

3pt14159almost 8 years ago

评论 #14730420 未加载

mayneackalmost 8 years ago

jarymalmost 8 years ago

I can imagine this could be really great wrapped up as a Postgres extension

stanfordkidalmost 8 years ago

indescions_2017almost 8 years ago

评论 #14729548 未加载

kensaialmost 8 years ago

Is there a comparison of its accuracy against traditional methods? Admittedly, this machine assisted modeling sounds really interesting.

elvinyungalmost 8 years ago

They really missed out on an opportunity to call it DataBayes.

BayesDB: A probabilistic programming platform

8 comments

BayesDB: A probabilistic programming platform

8 comments