BloombergGPT: A Large Language Model for Finance

175 点作者 SerCe大约 2 年前

22 条评论

I'm curious about how large language models will do in finance, considering the one thing LLMs do remarkably poorly is math.I use ChatGPT to keep track of tasks and Todo lists. It works phenomenally well for me, and the natural language back-and-forth helps keep me motivated. I give it a set of tasks, with time estimates, and it organizes these tasks for me, and I tell it when I complete them, and it updates my task list.The one funny mistake it makes is that when it groups my tasks (say I have 3 "Work" tasks and 2 "Personal" tasks") it sums up the total estimated time for each task group, but the totals are often wrong, especially when I start adding new tasks or completing tasks.When so much of finance requires numeric accuracy, I'm curious how BloombergGPT handles numbers.

评论 #35392745 未加载

评论 #35382445 未加载

评论 #35384110 未加载

评论 #35383186 未加载

choeger大约 2 年前

I wonder if we should go back to ontologies now. It should be fairly easy to have a LLM generate entries for any ontology. Then we need to check them for truth, obviously, but that can be parralized and potentially automated.Say, I'd like an ontology of the current stock market focusing on the relationship between natural persons and public companies, board members, well-known analysts, investors and so on. This would be tedious for anyone to do, but should be fairly simple with a LLM.Another task, maybe a little bit further into the future is categorizing open source intelligence. Think of oryxspioenkop.com and their famous lists of lost equipment in the Russian invasion of Ukraine. It's tedious and time-consuming but generates a valuable dataset. Here, image recognition would be necessary, but the principle is still the same, no?Come to think of it, how does a company like TomTom generate map data nowadays?

wanderingmind大约 2 年前

No API, no code, no model to test, we are to believe the verbiage that the paper contains at its face value?

评论 #35392792 未加载

评论 #35382193 未加载

Michelangelo11大约 2 年前

Wait... judging by their own press release, this thing looks terrible. They didn't benchmark it on finance-specific tasks against GPT-4, or even GPT-3.5, just a bunch of more-or-less random also-ran models. It beat those models, but that's not saying much at all, and the fact that they didn't benchmark it against any OpenAI model in finance-specific tasks speaks volumes.(They did benchmark it against GPT-3 in general-purpose tasks and, unsurprisingly, GPT-3 came out on top.)Press release link: <a href="https://www.bloomberg.com/company/press/bloomberggpt-50-billion-parameter-llm-tuned-finance/" rel="nofollow">https://www.bloomberg.com/company/press/bloomberggpt-50-bill...</a>

评论 #35392680 未加载

nl大约 2 年前

Reading the paper it's apparent what a huge contribution HuggingFace made by doing the BLOOM work in the open. There's so much knowledge there that anyone else trying to train a LLM can use.Also it looks like public filings are a large dataset that isn't currently being used by other LLMs.

yanglet大约 2 年前

A few points to share:1). Finance is high dynamic. BloombergGPT retrains LLM using a mixed dataset of finance and general sources is too much expensive (1.3M hours). Lightweight adaptation is highly favorable.2). Internet-scale finance data (timely updates using an automatic data curation pipeline) is critical. BloombergGPT has privileged data access and API access. A promising alternative is "democratizing Internet-scale finance data".3). Another key technology is "RLHF (Reinforcement learning from human feedback)", which is missing in BloombergGPT. RLHF enables learning individual preferences (risk-aversion level, investing habits, personalized robo-advisor, etc.)

geenew大约 2 年前

This must be the 'big data' I heard so much about.LLMs in general, I mean. They seem to be the first widespread application for large, unstructured datasets. Still hype-y, but maybe even a /practical/ application.

评论 #35383858 未加载

vgeek大约 2 年前

What about LTCM-GPT?

yanglet大约 2 年前

Anyone interested in building a ChatGPT for FinTech, or FinGPT? I and my team are actively developing this project: <a href="https://github.com/AI4Finance-Foundation/ChatGPT-for-FinTech">https://github.com/AI4Finance-Foundation/ChatGPT-for-FinTech</a>

yanglet大约 2 年前

FinGPT better be open-source and open-finance.<a href="https://news.ycombinator.com/item?id=35403167" rel="nofollow">https://news.ycombinator.com/item?id=35403167</a>

PaulHoule大约 2 年前

i like how they used a little more than 50% of domain specific text and a little less of general text. it beats the other LLMs on financial tasks for 4 out of 5 tasks usually by a large margin and but another one squeaks past it on NER.

评论 #35381753 未加载

alecco大约 2 年前

I would've made InvestopediaGPT. Much better quality and better for training.

mightytravels大约 2 年前

Where is my 'ArbitrageGPT'? Renaissance might be working on that...

评论 #35390385 未加载

评论 #35381593 未加载

blitzo大约 2 年前

One industry i could see in trouble from this is robo advisor.

rvz大约 2 年前

At least Bloomberg is not pretending to be 'open' or 'open source' with their GPT model despite the same excuses. Unlike Microsoft® AI.com.

m3kw9大约 2 年前

Link to use model?

opisthenar84大约 2 年前

I wonder how well this performs over standard GPT-4 with domain knowledge injected in as prompt.

评论 #35382357 未加载

steve1977大约 2 年前

As quant finance has been confidently wrong for decades, that’s probably a great match!

OrdoAbChao大约 2 年前

What is the average P/e ratio of the stocks listed in the NASDAQ.

eismcc大约 2 年前

I’ve been working on <a href="http://www.dollarsign.ai" rel="nofollow">http://www.dollarsign.ai</a> and it would be interesting to have this directly integrated.

dongobread大约 2 年前

They'll release training logs but no model file or data? How does anyone trust that this isn't just overfitted garbage trained directly on their benchmarks?

meghan_rain大约 2 年前

> BloombergGPT far outclasses the models we evaluated ourselves, and is slightly behind GPT-3.tldr it's worse than GPT3?

评论 #35382925 未加载

22 条评论

hn_throwaway_99大约 2 年前

评论 #35392745 未加载

评论 #35382445 未加载

评论 #35384110 未加载

评论 #35383186 未加载

choeger大约 2 年前

wanderingmind大约 2 年前

No API, no code, no model to test, we are to believe the verbiage that the paper contains at its face value?

评论 #35392792 未加载

评论 #35382193 未加载

Michelangelo11大约 2 年前

评论 #35392680 未加载

nl大约 2 年前

yanglet大约 2 年前

geenew大约 2 年前

评论 #35383858 未加载

vgeek大约 2 年前

What about LTCM-GPT?

yanglet大约 2 年前

FinGPT better be open-source and open-finance.<a href="https://news.ycombinator.com/item?id=35403167" rel="nofollow">https://news.ycombinator.com/item?id=35403167</a>

PaulHoule大约 2 年前

评论 #35381753 未加载

alecco大约 2 年前

I would've made InvestopediaGPT. Much better quality and better for training.

mightytravels大约 2 年前

Where is my 'ArbitrageGPT'? Renaissance might be working on that...

评论 #35390385 未加载

评论 #35381593 未加载

blitzo大约 2 年前

One industry i could see in trouble from this is robo advisor.

rvz大约 2 年前

At least Bloomberg is not pretending to be 'open' or 'open source' with their GPT model despite the same excuses. Unlike Microsoft® AI.com.

m3kw9大约 2 年前

Link to use model?

opisthenar84大约 2 年前

I wonder how well this performs over standard GPT-4 with domain knowledge injected in as prompt.

评论 #35382357 未加载

steve1977大约 2 年前

As quant finance has been confidently wrong for decades, that’s probably a great match!

OrdoAbChao大约 2 年前

What is the average P/e ratio of the stocks listed in the NASDAQ.

eismcc大约 2 年前

I’ve been working on <a href="http://www.dollarsign.ai" rel="nofollow">http://www.dollarsign.ai</a> and it would be interesting to have this directly integrated.

dongobread大约 2 年前

They'll release training logs but no model file or data? How does anyone trust that this isn't just overfitted garbage trained directly on their benchmarks?

meghan_rain大约 2 年前

> BloombergGPT far outclasses the models we evaluated ourselves, and is slightly behind GPT-3.tldr it's worse than GPT3?

评论 #35382925 未加载