TechEcho

zerebubuthalmost 7 years ago

Sounds a lot like the "on the fly parsing" (§3.1) in Alagiannis' NoDB (See <a href="https://stratos.seas.harvard.edu/files/stratos/files/nodb-cacm.pdf" rel="nofollow">https://stratos.seas.harvard.edu/files/stratos/files/nodb-ca...</a> for details).

carterschonwaldalmost 7 years ago

I’m more interested in the work some folks are doing on using succinct structures style techniques to accelerate parsing. <a href="https://github.com/haskell-works/hw-json" rel="nofollow">https://github.com/haskell-works/hw-json</a><p>It’s still relatively immature. But it’s a more algorithmic approach that I think plays nice with pretty much any source of semistructred Data. Though simd acceleration certainly is pretty sweet too

X6S1x6Okd1stalmost 7 years ago

Anyone figure out how to get an instance of spark up with sparser working?

CalChrisalmost 7 years ago

#define PREPROCESSING 1

PaulHoulealmost 7 years ago

I have done that for a long time.

评论 #17719241 未加载

Filter Before You Parse: Faster Analytics on Raw Data with Sparser

5 comments

Filter Before You Parse: Faster Analytics on Raw Data with Sparser

5 comments