TradeExpert, a trading framework that employs Mixture of Expert LLMs

113 pointspar wertykil y a 5 jours

18 comments

Good one, happy to add my perspective here:DISCLAIMER: I've spent the last 8 month heavily on building a quant-based asset management app (though, still not live, currently in final steps to sync processes with broker)a) I tried to leverage some of this AI-voodoo stuff, though not on the level as in the paper; my findings are clear (at least for me): AI-driven trading does not give you a bigger/better edge than any of the other well-known approachesb) In fact, AI-based approaches are at best on par with traditional approaches, in lot of scenarios not even this; I havent seen any setup from anyone which actually outperformed one of the classic approaches. BUT: The AI-guys have much higher cost, be it Infra, processing time / waiting time in front of screen etc. So you have you to pick carefully, which one you choose.c) I'm doing today only "standard approaches" with volume/statistics/vola/price action, as this approach is super-cost-efficient (i need only one cheap datastream) and a lightweight machine for 10 / 20 USD a monthd) It is clearly possible to outperform the market, though these approaches are not scalable unlimited - Ex: depending on the used instruments, there may not be enough liquidity to buy continuously for 100k, but maybe for 10k only. Apply leverage of 5-10 on an asset that moved 5% in last 10 days on a 10k position - is this outperforming? A clear >yes< in my perception?e) People who have built & found a stable approach do not share it or talk about it, there is no real community; you will get details of working approaches only from people whom you are really "friend with"; there is a lot of unshared but working business tactics in the field.

评论 #44168111 未加载

评论 #44162861 未加载

评论 #44164530 未加载

评论 #44164281 未加载

评论 #44160123 未加载

bguberfainil y a 4 jours

So they used a LLM with knowledge cut in mid 2023 to evaluate 2023? Seems like a classic leakage problem.From paper: "testing set: January 1, 2023, to December 31, 2023"From the Llama 2 doc: "(...) some tuning data is more recent, up to July 2023."

评论 #44159357 未加载

flowerthoughtsil y a 4 jours

> Alpha Factors incorporates 108 technical indicators and factors with their expressions, which are believed to possess predictive power regarding stock price movements.Examples of the indicators are in Figure 15. The ablation studies in Table 4 suggest that market and news information made a much bigger impact than the magic indicators. Makes sense if the indicators are simple enough that the LLM can reproduce them without losing processing power.I somewhat like that they used DJI and not SPX, but 2023 was a sideways bull year with DJI +12% and SPX +23%. One year is way too short of a study.> Hardware: NVIDIA A5000 GPU x 4, AMD Ryzen Threadripper PRO 3975WX CPU, 256 GB RAMSeems approachable.> The proposed TradExpert framework utilizes a Mixture of Experts (MoE) approach, where four LLMs are specialized in processing distinct sources of financial data. All these LLMs are based on the LLaMA-2-7B Touvron et al. (2023b) model and fine-tuned using the LoRA mechanism Hu et al. (2022)Relatively small LLM.Overall, this does seem like an interesting study, even for just comparing data sources.

评论 #44168657 未加载

niemandhieril y a 4 jours

If I understand this correctly we have come full circle on what MoE means.MoE started out as some form of multi model approach.Afaik in current architectures it’s basically a load balancing method that while it increases latency makes the model better suitable for distributed operations.To me this reads as if the author uses the term closer to Urs original meaning than its current.

ArtTimeInvestoril y a 5 jours

How do people on HN think about the market?Do you think the market is so efficient that anyone who outperforms it is merely lucky?Or do you think the market is inefficient enough for a person smart enough to be able to outperform it by thinking?In other words: Do you think a single person can rationally decide to invest their time into thinking about the stock market? Or would that always be a fallacy, and whatever the outcome is - we can't decide if it was just good or bad luck?

评论 #44157624 未加载

评论 #44157528 未加载

评论 #44157872 未加载

评论 #44158561 未加载

评论 #44158517 未加载

评论 #44162888 未加载

评论 #44157581 未加载

评论 #44158763 未加载

评论 #44158166 未加载

评论 #44164563 未加载

评论 #44157589 未加载

评论 #44157614 未加载

评论 #44157583 未加载

评论 #44157809 未加载

评论 #44158028 未加载

评论 #44157531 未加载

评论 #44159878 未加载

评论 #44158178 未加载

评论 #44157984 未加载

评论 #44157800 未加载

评论 #44159396 未加载

评论 #44157615 未加载

评论 #44158555 未加载

评论 #44157596 未加载

评论 #44159500 未加载

评论 #44157901 未加载

16th_hopil y a 4 jours

Does anyone understand how the Market Expert works? It takes in numerical OHLC data and converts it to embeddings for use by the LLM… but embedding are also numbers so I don’t see how that’s any easier for the LLM to process since it’s a language model.> The Market Analyst LLM focuses on analyzing historical OHLCV (Open, High, Low, Close, Vol- ume) data to predict stock movements. However, time series data is inherently continuous and lacks the discrete token structure that LLMs are designed to process. This misalignment poses a signifi- cant challenge in effectively utilizing LLMs on time series. To this end, we utilize a reprogramming mechanism Jin et al. (2024) to reprogram the input financial time series into text prototype repre- sentations.

csantiniil y a 4 jours

The market is like an ecosystem. There are huge mammals (investment banks, hedge funds) that look at certain type of preys, and there are smaller rodents that only eat very tiny worms.In high volatility regimes, ie. stocks with low market cap, the market is far from efficient. Hedge funds are not even looking at stocks with 100M market cap.There are traders that act in these regimes that beat the market, exactly because they play small.Anyhow most people would be better of by assuming the market is completely efficient.

bGl2YW5jil y a 2 jours

I feel like applying MoE to LLMs is a just a brute-force prediction approach that's neither efficient or very accurate.

thunder-blue-3il y a 4 jours

fwiw, I tried something similar about 5–10 years ago. I wasn’t using LLMs like the abstract here suggests, and honestly, I’m not sure how you'd act on a signal fast enough with them. When I gave it a shot, there was some slight predictive value, but in the end it felt like noise and gambling, so I moved on.

mkoryakil y a 4 jours

Do is it "tradExpert" or "tradeExpert" ?Shortening variable names from words by a vowel or two is a hge pt peev.

languagehackeril y a 4 jours

How did it perform against a boglehead portfolio? Were fees and commissions included? Seems weird to evaluate performance over a single year for trades. Much more interested in long-term growth over one or more market cycles.

评论 #44164277 未加载

AIorNotil y a 5 jours

So did it make any money?

评论 #44157736 未加载

评论 #44157654 未加载

评论 #44157930 未加载

infectoil y a 4 jours

Not a very good study. I did not look at any of the researchers background but it’s like they did not consult their respective finance school departments.

mhh__il y a 4 jours

A sharpe of 5 is ludicrously good for a long only equities strategy

reedf1il y a 4 jours

Did they release the code or the model itself?

mzhaaseil y a 4 jours

I want to coin a new law: if a news title has "revolutionizing" in the title, its bullshit.

评论 #44158421 未加载

ameliusil y a 4 jours

I'm curious at what point stock and derivatives trading becomes something entirely for AI, and thus restricted to companies rich enough to buy tons of GPUs. Are we near that point yet?

评论 #44158364 未加载

thedudeabides5il y a 4 jours

lol 5 sharpeclassic 'tell me you overfit without telling me' tell