TechEcho

10 comments

nostromoalmost 11 years ago

Airbnb could likely get a lot more bang for their buck by letting hosts run experiments on pricing than by testing button colors and whatnot.I ran an online marketplace at a previous gig. Our service providers always complained that they didn't know what to charge to maximize their business. They couldn't see the forest as a tree. Because we had the data for all providers, we started letting them know if they were under- or over-priced, and we saw more conversions and revenue.Dynamic pricing (like Uber does on holidays) alone could be hugely valuable.

评论 #7807992 未加载

评论 #7808342 未加载

评论 #7810192 未加载

评论 #7808536 未加载

评论 #7807933 未加载

wkonkelalmost 11 years ago

A simple hack is to run an A-A-B-B test instead of an A-B test. Rather than splitting 50-50, use 25-25-25-25 splits. When A1==A2 and B1==B2, then you know that you have statistically relevant data and you can compare A to B. Depending on the dataset, this could happen in minutes or weeks.

评论 #7810850 未加载

评论 #7807888 未加载

评论 #7807918 未加载

评论 #7809041 未加载

评论 #7807803 未加载

评论 #7807785 未加载

thinkmoorealmost 11 years ago

Statisticians have spent time thinking about the right way to deal with these sorts of problems for a long time: <a href="https://en.wikipedia.org/wiki/Sequential_analysis" rel="nofollow">https://en.wikipedia.org/wiki/Sequential_analysis</a>.Funnily enough, the page they reference for calculating the right sample size actually talks about sequential analysis, but AirBnB doesn't mention this in describing their solution...

sutterbombalmost 11 years ago

HN user btilly has a really helpful essay on the math behind stopping tests earlier than your predetermined sample size. It calls for setting a maximum duration, and provides stopping points along the way. Works similar to the method AirBnB describes.<a href="http://elem.com/~btilly/ab-testing-multiple-looks/part2-limited-data.html" rel="nofollow">http://elem.com/~btilly/ab-testing-multiple-looks/part2-limi...</a>

bjlorenzenalmost 11 years ago

As a developer working for a major competitor to airbnb on a shopping page, and having implemented hundreds of experiments on my page, I can say that these guys are way too obsessed with statistical certainty.Rate of deployment of experiments is a better focus; since all your opponents are bound to copy your winners anyways, you have to rely on the few months edge you've earned before they do so, and constantly maintain that lead.

评论 #7808047 未加载

coherentponyalmost 11 years ago

This article contains some serious p-value abuse. The p-value should be adjusted to account for multiple testing. You do this to minimise the effect that a hypothesis would be accepted purely due to random chance.Try setting your p-value to your Type 1 error rate divided by the number of tests you perform. It will be much smaller, and this is a good thing. Significance should really test for significance, not random chance.

jessriedelalmost 11 years ago

I wish AirBnB would make the cost scale logarithmic, to match the fact that this is roughly how the prices will be distributed too. I'm usually only using the left-most 5% of that slider.

评论 #7807985 未加载

cbovisalmost 11 years ago

Can anyone point out a good introduction to some of the methods used in the article? Terms such as the p-value, treatment effect etc.

RA_Fisheralmost 11 years ago

The cult of statistical significance is alive and well. A 0.05 p-value implies a 1:20 chance of "alternative" performing worse upon final installation. That's rather risk adverse. It also implies that "alternative" is worse from the get-go. When is that the case? Type 1 and Type 2 errors are much more balanced in web apps. Anyone care to show me why that's a bad mentality?

评论 #7808055 未加载

205guyalmost 11 years ago

Ok, I'll be "that" guy who heckles every AirBnB post, even if this one did have some nice graphs (and ideas).When is AirBnB going to experiment with helping their hosts follow the law? I bet I can predict that graph. Why, look at all those illegal rentals in SF right there in the sample screenshots--oh the irony.Remember, DON'T FUCK UP THE CULTURE! But it's OK to fuck up your host city for a buck or 2 billion.

评论 #7807877 未加载

评论 #7807827 未加载

10 comments

nostromoalmost 11 years ago

评论 #7807992 未加载

评论 #7808342 未加载

评论 #7810192 未加载

评论 #7808536 未加载

评论 #7807933 未加载

wkonkelalmost 11 years ago

评论 #7810850 未加载

评论 #7807888 未加载

评论 #7807918 未加载

评论 #7809041 未加载

评论 #7807803 未加载

评论 #7807785 未加载

thinkmoorealmost 11 years ago

sutterbombalmost 11 years ago

bjlorenzenalmost 11 years ago

评论 #7808047 未加载

coherentponyalmost 11 years ago

jessriedelalmost 11 years ago

I wish AirBnB would make the cost scale logarithmic, to match the fact that this is roughly how the prices will be distributed too. I'm usually only using the left-most 5% of that slider.

评论 #7807985 未加载

cbovisalmost 11 years ago

Can anyone point out a good introduction to some of the methods used in the article? Terms such as the p-value, treatment effect etc.

Experiments at Airbnb

10 comments

Experiments at Airbnb

10 comments