TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: ClearBrain (YC W18) – Automated Causal Analytics

65 点作者 bmahmood超过 5 年前

8 条评论

bmahmood超过 5 年前
Hi I’m Bilal, cofounder at <a href="https:&#x2F;&#x2F;www.clearbrain.com" rel="nofollow">https:&#x2F;&#x2F;www.clearbrain.com</a> . ClearBrain is a new analytics platform that helps you rank which product behaviors cause vs correlate to conversion. Think Google PageRank, but for Analytics.<p>Our founding team worked on this problem for quite a few years while at Google and Optimizely. We contributed to Google Analytics to analyze historical behaviors in seconds, but observing historical trends merely produced noisy correlations. We built Optimizely to measure true cause and effect through A&#x2F;B testing, but tests took 4-6 weeks on avg to reach significance, and so it would take years to measure the impact of every single page or feature in an app.<p>So we asked ourselves, could we estimate which in-app behaviors cause conversion, to complement (not replace) a traditional A&#x2F;B test? We spent a year in R&amp;D, and built ClearBrain as a self-serve “causal analytics” platform. All you have to do is specify a goal - signup, engagement, purchase - and ClearBrain ranks which behaviors are most likely to cause conversion.<p>Building this required a mix of real-time processing + auto ML + algorithm work. We connect to a company’s app data via Segment, and ingest their app events in real-time via Cloud Dataflow into a BigQuery backend. When a customer uses the ClearBrain UI to select a specific app event as their conversion goal, our backend will automatically run multiple observational studies to analyze how every other app event may cause that goal. This is done in parallel using SparkML, to analyze thousands of different events in minutes. (more on our algorithm here: <a href="https:&#x2F;&#x2F;blog.clearbrain.com&#x2F;posts&#x2F;introducing-causal-analytics" rel="nofollow">https:&#x2F;&#x2F;blog.clearbrain.com&#x2F;posts&#x2F;introducing-causal-analyti...</a>)<p>We’ve had beta customers like Chime Bank, InVision, and TravelBank use ClearBrain to estimate which behaviors and landing pages cause their users to convert, and in turn prioritize their actual growth and A&#x2F;B testing efforts there.<p>We’re now releasing the product into general availability in partnership with Segment - available on a free self-serve basis today! We look forward to feedback from the HN community. :)
评论 #21401509 未加载
newtothebay超过 5 年前
Can you talk about how to infer causality without running an experiment? From your description, &quot;real-time processing + auto ML + algorithm&quot; still sounds very much observational to me.<p>I&#x27;m asking not as knock against your service, but genuine curiosity about how you manage to solve this incredibly hard problem.<p>EDIT: From your white paper, it looks like you&#x27;re running a regression that controls for a bunch of confounders. You also interact the treatment variable with those confounders to get the heterogeneous treatment effect.<p>My concern with that is that we&#x27;re not controlling for unobservable confounders, which make causal inference so difficult. If we assume that controlling for observable confounders is enough (we shouldn&#x27;t!), then correlation and causation are the same.<p>White paper: <a href="https:&#x2F;&#x2F;blog.clearbrain.com&#x2F;posts&#x2F;introducing-causal-analytics" rel="nofollow">https:&#x2F;&#x2F;blog.clearbrain.com&#x2F;posts&#x2F;introducing-causal-analyti...</a>
评论 #21403473 未加载
pfbtgom超过 5 年前
I think you have an interesting product, but I&#x27;m having serious issues with your marketing.<p>Extraordinary claims require extraordinary evidence. How many of your estimated treatment effects have been supported by experiments? Do you have experiments demonstrating that your model generalizes? How accurate are your estimates compared to experimental results?<p>It&#x27;s ironic that you&#x27;re marketing a causal + analytics product without any data. Generating a narrative and basing it off of observational data is the typical trap that many causal claims fall into. Portraying yourselves as statistical experts and pushing unsubstantiated claims is misleading bordering on unethical.
seanwilson超过 5 年前
In reference to how regular AB testing needs a certain amount of data to get statistically significant results, what kind of level of traffic + conversions would you need for this to work? Would it be useful for sites with low traffic?
评论 #21400091 未加载
gingerlime超过 5 年前
Hi Bilal!<p>I think I reached out to you in early 2018. Any news about the integration without segment (e.g. Amplitude in our case)? And what’s the pricing model? Couldn’t find much on the site (maybe it’s more limited on mobile?)
评论 #21403019 未加载
polskibus超过 5 年前
How is this related to process mining theory and tools like Celonis?
puranjay超过 5 年前
Nothing about the product but man, that&#x27;s a fantastically beautiful and informational website. Reminds me of Stripe
评论 #21405628 未加载
dtran超过 5 年前
Congrats on the launch Bilal and team!