Is Facebook's “Prophet” the time-series Messiah or just a naughty boy?

198 点作者 cton将近 4 年前

28 条评论

lr1970将近 4 年前

As someone who spent a good part of my professional career in forecasting and time-series analysis I would like to point out that "point-forecasts" are mostly useless in many import practical applications such as FinTech, e-commerce, sports betting, etc. Point-forecast models such as Prophet fail to give you a meaningful measure of uncertainty of the predicted value. A much better approach are probabilistic forecasting models that predict the probability distribution of the random variable of interest [0]. Probability distribution is the right language to express prediction values and their uncertainty at the same time. And the decisions based on forecasts oftentimes take the uncertainty measures in consideration, e.g. portfolio risk optimization or buying decisions in e-commerce.[0] <a href="https://en.wikipedia.org/wiki/Probabilistic_forecasting" rel="nofollow">https://en.wikipedia.org/wiki/Probabilistic_forecasting</a>

评论 #27699457 未加载

评论 #27700898 未加载

评论 #27702727 未加载

评论 #27700042 未加载

评论 #27700264 未加载

评论 #27698446 未加载

评论 #27698332 未加载

baladre将近 4 年前

I work with highly seasonal data (city-wide water consumption) and Prophet has been a great tool for us.In terms of performance, it has been the best for a few of our forecasts, compared to GRUs, LSTM, ARIMA and SARIMA. When it wasn't the best, it wasn't too far from the best model. But, to be fair, our forecast are of quite stable data, so most models do well.However, I would say that the key strength of Prophet is how easy it is. You can produce results really fast, you can throw data with missing range, holidays, and it has interpretability components out of the box. It depends on what do you need, but for most of our tasks, we and our stakeholders are more than happy to sacrifice a bit of performance for this features.

评论 #27696963 未加载

评论 #27702753 未加载

DonHopkins将近 4 年前

Of course Prophet is a Python library, thus the Monty Python's Life of Brian reference:"Now you listen here. He's not the messiah. He's a very naughty boy. Now GO AWAY!"<a href="https://www.youtube.com/watch?v=3_kKAeh6qyc&ab_channel=dingerbell" rel="nofollow">https://www.youtube.com/watch?v=3_kKAeh6qyc&ab_channel=dinge...</a>"Yes, we are all individuals!" "Yes, we are all different!" "I'm not.":<a href="https://www.youtube.com/watch?v=KHbzSif78qQ&ab_channel=lucasbeer" rel="nofollow">https://www.youtube.com/watch?v=KHbzSif78qQ&ab_channel=lucas...</a>Prophets:<a href="https://www.youtube.com/watch?v=hmyuE0NpNgE&ab_channel=radiac" rel="nofollow">https://www.youtube.com/watch?v=hmyuE0NpNgE&ab_channel=radia...</a>Life of Brian:<a href="https://en.wikipedia.org/wiki/Monty_Python%27s_Life_of_Brian" rel="nofollow">https://en.wikipedia.org/wiki/Monty_Python%27s_Life_of_Brian</a>Not the Messiah (He's a Very Naughty Boy):<a href="https://en.wikipedia.org/wiki/Not_the_Messiah_(He%27s_a_Very_Naughty_Boy)" rel="nofollow">https://en.wikipedia.org/wiki/Not_the_Messiah_(He%27s_a_Very...</a>

评论 #27719154 未加载

0x008将近 4 年前

My experience in general is that most time series model are inadequate in predicting time series except for very trivial cases of seasonalities or simple linear/nonlinear trends.I think that you can throw any model you like at the problem but all you will do is overfit most of the time.

评论 #27696830 未加载

评论 #27699166 未加载

leoc将近 4 年前

There's a reaction from one of the library's authors <a href="https://twitter.com/seanjtaylor/status/1410447403153457152" rel="nofollow">https://twitter.com/seanjtaylor/status/1410447403153457152</a> <a href="https://threadreaderapp.com/thread/1410447403153457152.html" rel="nofollow">https://threadreaderapp.com/thread/1410447403153457152.html</a> .(IDK anything about this subject, just sharing the link.)

microprediction将近 4 年前

I'm the author. AMA. TMA. I'm genuinely interested in understanding the popularity of prophet, which as I point out is a non-trivial statistical endeavor.<a href="https://microprediction.github.io/timeseries-elo-ratings/html_leaderboards/univariate-k_003.html" rel="nofollow">https://microprediction.github.io/timeseries-elo-ratings/htm...</a>

chirau将近 4 年前

Facebook also recently put out a toolkit for time series analysis called Kats<a href="https://facebookresearch.github.io/Kats/" rel="nofollow">https://facebookresearch.github.io/Kats/</a>

评论 #27697499 未加载

rustyconover将近 4 年前

Prophet is not my favorite time series forecasting package. I agree with the author’s findings.Microprediction.com (the site that wrote this article) is a great place to win monthly cash prizes for actually providing forecast accuracy, the twist is you have to provide your forecast as a sample of 225 points from a forecast distribution rather than just providing a point forecast. This makes participation “interesting”.Also to win you have to be more accurate than everything and everyone else that is providing predictions.

sambucini将近 4 年前

i've used prophet along many other ts methods for price forecasting in energy trading. my experience is that prophet is ok, but rather opaque. having tried many packages, I've always come back to the classical statistical methods seeing benefits in transparency (what's actually going on? impact of regressors?), speed and most importantly that these methods force the user to think about what's happening in the data and make conscious decisions about how to model things. but i can see that sometimes you dont care too much about accuracy and understanding but that you just want a forecast for something that works decently without much hassle.

评论 #27698454 未加载

duffmancd将近 4 年前

I'll try to give my perspective, though it's mostly expanding on em500's [1] 3rd paragraph. I think you're asking the wrong question about prophet - most users don't care "how good is it?" but instead "is it good enough?", and then "how easy is it to use?"Prophet solves a broad class of easy problems that a lot of ordinary businesses have: you have several years of basically regular data (sales or page views or store foot traffic) that you know has yearly/weekly/daily (if you have sub-daily data) cycles, and you want to give a reasonable prediction to the business so they can plan for the upcoming week/month/year. And you want to remove the periodic effects so you can see the underlying trends.Imagine someone, lets call them Bill, who might be called a data scientist, or business analyst or just assistant operations manager, for a medium-large business. Bill has the last 5 years of sales/views/traffic data in the database (anything before that is in a bunch of excel spreadsheets on the share drive), and knows just enough python to be dangerous. Bill can probably explain an R-squared value but is not an expert at statistics by any measure. He wants to fit the data, but has several problems:1) the weekly trend does not line up with the yearly data, as the year starts on a different weekday.2) Those damn public holidays, some of them occur on a specific date, some of them on the "first Monday of the month", and some of them seem to change almost randomly year-to-year.3) The reporting system was down for a couple of weeks in June and Feb last year, and the numbers for the first few years were copied from excel, so sometimes are missing the first or last day of the month.Prophet comes by default with yearly/weekly seasonality. Prophet comes out-of-the-box with a simple way to import holidays, and even a way to specify your own. Prophet doesn't require any cleaning, or special procedures to deal with missing data. And it is quick and easy to use, and get nice-looking, broadly reasonable graphs out (with the above mentioned, consistent data). And that solves the business problem.And Bill's probably heard of it because it is (a) popular already, and (b) has Facebook's name attached.That's my take as to why, even if it is not even close to the most accurate method, Prophet is so broadly popular.[1] <a href="https://news.ycombinator.com/item?id=27697274" rel="nofollow">https://news.ycombinator.com/item?id=27697274</a>

评论 #27702811 未加载

jdewsnip将近 4 年前

Like most models its data dependent. Had quite a lot of success (was paid) using it on data with multi-seasonality (daily plus seasonal trend) with regressors and change points, where there is not a lot of other options.As its a General Additive Model you can decompose the prediction into parts put them in front of a non-technical user for validation i.e. show effect of daily seasonality, yearly, holidays and regressors. You could even use it to show visually where the model is going wrong for predictions on a blog post ;)Is it the most accurate model on all time series? No but it is useful and good enough for certain use cases.I find it quite interesting what you can do with about 100 lines of stan code. Here is good link on some one building prophet in pymc3 rather than stan to explain its innards.<a href="https://www.ritchievink.com/blog/2018/10/09/build-facebooks-prophet-in-pymc3-bayesian-time-series-analyis-with-generalized-additive-models/" rel="nofollow">https://www.ritchievink.com/blog/2018/10/09/build-facebooks-...</a>If you want something more flexible you can drop down to this level of code i.e. pymc3, pyro, tfp and bsts. If just want a univarate forecast then ensembles of state space methods are hard to beat as evidenced by the M competitions.But It’s Tough to Make Predictions, Especially About the Future

em500将近 4 年前

I'm a professional forecaster (i.e. getting paid for it) at a large e-commerce company. We have extensive experience with Prophet and a host of other approaches (all the traditional models in Hyndman's book/R package, some scattered LSTM/NN implementations). Here's my quick take (the article is a lot more extensive than the median blogpost, and likely warrants a more extensive study than I have time for right now.)Prophet main claims ("Get a reasonable forecast on messy data with no manual effort. Prophet is robust to outliers, missing data, and dramatic changes in your time series.") are surely exaggerated. As the article shows, time series come in many different shapes, and many of them are not handled properly. It deals well with distant-past or middle-of-the-sample outliers, but not with recent outliers. It cannot deal with level changes (as opposed to trend/slope changes). None of this should be a surprise if you take some time to understand the underlying model, which unlike most neural nets is very easily to completely understand and visualise: it's really a linear regression model with fixed-frequency periodic components (for yearly seasonality and weekly seasonality) and a somewhat-flexible piecewise-linear trend. The strong assumption that the trend is continuous (with flexible slopes that pivot around a grid of trend breakpoints, which are trimmed by regularisation) accounts for most of the cases where the forecasts are clearly wrong.That said, it does occupy a bit of a sweet spot in commercial forecasting applications. It it's largely tuned for a few years of daily data with strong and regular weekly and yearly seasonalities (and known holidays), or a few weeks/months of intraday and weekday seasonalities. Such series are abundant in commerce, but a bit of a weak spot for the traditional ARIMA and seasonal exponential smoothers in Hyndman's R package. These tended to be tuned on monthly or quarterly data, where Prophet often performs worse. In our experience, for multiple years of daily commercial-activity data, there are no automated approaches that easily outperform Prophet. You can get pretty similar (or slightly better) results with Hyndman's TBATS model if you choose the periodicities properly (not surprising, as the underlying trend-season-weekday model is pretty similar as Prophet, but a bit more sophisticated). Some easy win for the Prophet devs are probably to incorporate a Box-Cox step in the model, and a sort-term ARMA error correction, then the model really resembles TBATS. You can usually get better results with NNs that are a bit more tuned to the dataset. But if you know nothing a priori about the data except that it's a few years of sales data, your fancy NN will probably resemble Prophet's trend-season-weekday model anyway.All of these assume that we're trying to forecast any time series' future only from its own past. If you want to predict (multiple) time series using multiple series as input/predictors, that's a whole new level of difficulty. I don't know of a good automatic/fast/scalable approach that properly guards against overfitting. Good results for multiple-input forecasting approaches probably requires some amount of non-scalable "domain knowledge".

评论 #27697520 未加载

评论 #27698444 未加载

dcl将近 4 年前

This echoes my experience. It often finds seasonality that doesn't exist, but it also makes it look like you are doing something impressive...Inexperienced stakeholders don't want/like to see smooth forecasts, even if you provide predication intervals.

Yenrabbit将近 4 年前

Messiah? No. But one big plus that this article doesn't talk much about is how easy it is to get started with it even if you're a beginner.A few lines of code gets you a fitted model and some insightful plots. From these you can see if 1) it's doing great and you don't need to spend hours training some crazy transformer model, 2) It's got some flaws and you should maybe try something else (which you can now compare against fbprophet as a baseline) or 3) This data is crazier than I thought, maybe we should rethink things...TLDR: It's easy to throw this at a new forecasting problem, and although it isn't perfect (as the article shows) sometimes it is still a useful step IMO.

评论 #27702615 未加载

vjeux将近 4 年前

Very entertaining article. That said, as a human, I hard a really had time forecasting most of the time series shown in the article. The ones that had a semblance of regularity it felt like Prophet did a resonable job.

Wookai将近 4 年前

If you are interested in time series predictions, I would suggest you had a look at darts (<a href="https://github.com/unit8co/darts" rel="nofollow">https://github.com/unit8co/darts</a>). It's a well designed library which provides a unified API to deal with time series and try/compare different algorithms/frameworks like Prophet, recurrent NNs, etc.

ghego1将近 4 年前

Their findings are very much in line with my experience with fbprophet, it is usually the least effective lib to predict a time series in my tests.

评论 #27695897 未加载

emptysongglass将近 4 年前

Can someone ELI5 what Prophet and time series are? Is there ever a chance a layperson would find use in Prophet?

评论 #27697573 未加载

评论 #27697553 未加载

评论 #27697562 未加载

microprediction将近 4 年前

I would recommend Python fans take a look at <a href="https://www.microprediction.com/blog/popular-timeseries-packages" rel="nofollow">https://www.microprediction.com/blog/popular-timeseries-pack...</a> and the Elo ratings for timeseries (e.g.) <a href="https://microprediction.github.io/timeseries-elo-ratings/html_leaderboards/univariate-k_003.html" rel="nofollow">https://microprediction.github.io/timeseries-elo-ratings/htm...</a> The timemachines package wraps prophet and others for the purpose of comparision <a href="https://github.com/microprediction/timemachines" rel="nofollow">https://github.com/microprediction/timemachines</a>

microprediction将近 4 年前

To close, let me say that this post ended up being more negative than I expected and, like Nessie, my opinion may rise in the future when I understand the implications of the Prophet generative model better, and either modify it or find better ways to identify its strengths. The unanswered question here is why Prophet is so popular, and this surely merits a better explanation than I have given. I think there are probably statistical angles I am not seeing - something reflecting the fact that people are voting with their eyeballs when they use Prophet.(from the article)- The author

评论 #27700715 未加载

评论 #27699873 未加载

jcims将近 4 年前

Is there a way to adapt time-series products like this to analysis of categorical data, in my case audit logs. I'd love to be able to suss out patterns/clusters/sequences but most of the stuff I've looked at requires you to do unnatural things like flip the categorical content into distinct dimensions with a calculated rate metric.

devit将近 4 年前

Shouldn't time-based (weekly and holiday) and trend effects be multiplied rather than added?For instance, if users spend twice as many hours on the weekend on a website, and the total number of users has doubled, then these effects multiply to give 4x visits than the baseline non-weekend.

streamofdigits将近 4 年前

Lots of good investigative work but its not clear if there is a decidable problem here.

wespiser_2018将近 4 年前

Excellent write up, great to see some attention paid towards libraries built on top of STAN, and answering questions of uncertainty via Bayesian stats.

AtNightWeCode将近 4 年前

Need way more data points...

PaulHoule将近 4 年前

You'd think people who are hyping AI were trying to bring about the next "AI Winter".People hear "FAANG" and it suspends their critical judgement.

munro将近 4 年前

Naughty boy.

chroem-将近 4 年前

The point of Prophet is to make time series accessible to people who aren't experts in time series. Yes, it has some well known failure modes, but using Prophet always beats a static LY "forecast." This article is like complaining a Nissan Leaf can't outrace a Tesla.

评论 #27696023 未加载

28 条评论

lr1970将近 4 年前

评论 #27699457 未加载

评论 #27700898 未加载

评论 #27702727 未加载

评论 #27700042 未加载

评论 #27700264 未加载

评论 #27698446 未加载

评论 #27698332 未加载

baladre将近 4 年前

评论 #27696963 未加载

评论 #27702753 未加载

DonHopkins将近 4 年前

评论 #27719154 未加载

0x008将近 4 年前

评论 #27696830 未加载

评论 #27699166 未加载

leoc将近 4 年前

microprediction将近 4 年前

chirau将近 4 年前

Facebook also recently put out a toolkit for time series analysis called Kats<a href="https://facebookresearch.github.io/Kats/" rel="nofollow">https://facebookresearch.github.io/Kats/</a>

评论 #27697499 未加载

rustyconover将近 4 年前

sambucini将近 4 年前

评论 #27698454 未加载

duffmancd将近 4 年前

评论 #27702811 未加载

jdewsnip将近 4 年前

em500将近 4 年前

评论 #27697520 未加载

评论 #27698444 未加载

dcl将近 4 年前

Yenrabbit将近 4 年前

评论 #27702615 未加载

vjeux将近 4 年前

Wookai将近 4 年前

ghego1将近 4 年前

Their findings are very much in line with my experience with fbprophet, it is usually the least effective lib to predict a time series in my tests.

评论 #27695897 未加载

emptysongglass将近 4 年前

Can someone ELI5 what Prophet and time series are? Is there ever a chance a layperson would find use in Prophet?

评论 #27697573 未加载

评论 #27697553 未加载

评论 #27697562 未加载

microprediction将近 4 年前

评论 #27700715 未加载

评论 #27699873 未加载

jcims将近 4 年前

devit将近 4 年前

streamofdigits将近 4 年前

Lots of good investigative work but its not clear if there is a decidable problem here.

wespiser_2018将近 4 年前

Excellent write up, great to see some attention paid towards libraries built on top of STAN, and answering questions of uncertainty via Bayesian stats.

AtNightWeCode将近 4 年前

Need way more data points...

PaulHoule将近 4 年前

You'd think people who are hyping AI were trying to bring about the next "AI Winter".People hear "FAANG" and it suspends their critical judgement.

munro将近 4 年前

Naughty boy.

chroem-将近 4 年前

评论 #27696023 未加载