TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Building an AI that watches rugby

87 点作者 reddavis大约 1 个月前

18 条评论

patapong大约 1 个月前
I want AIs that clean my apartment while I watch rugby, not AIs that watch rugby while I clean the apartment. ;)<p>In seriousness, this is a cool project and show how sophisticated analysis LLMs can do in a plug and play manner. They may not always be the best solution but a fantastic baseline that can be deployed and adapted to a usecase in less than an hour.
xnx大约 1 个月前
Better title: &quot;LLM OCR on Rugby screenshots to read score and clock&quot;
hummuscience大约 1 个月前
The moment I started reading this, I got reminded of this recent study: <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;html&#x2F;2503.10212v1" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;html&#x2F;2503.10212v1</a><p>The scope is a bit different. The study uses an LLM to interpret pose estimation data and describe the behavior in each frame. The output is text which can be used to create embeddings of behavior. As someone who works in ethology, that&#x27;s a clever (but maybe expensive) idea.<p>I think the author could use something similar. With multi-person pose estimation models.
goeiedaggoeie大约 1 个月前
Reading the scoreboard from a TV screen and selling that data is restricted in many jurisdictions. This work is pretty naive I think.
评论 #43716142 未加载
评论 #43726164 未加载
评论 #43720651 未加载
评论 #43717654 未加载
评论 #43716603 未加载
评论 #43723236 未加载
mon_大约 1 个月前
Why the focus on scorekeeping? I feel like an AI model is overkill here, when you have text-based sources readily available such as news apps, Twitter feeds, and apps such as Livescore which would be easier and cheaper to scrape. They probably cover more matches that aren&#x27;t televised too.<p>I&#x27;d be curious to see what useful insights could be gleamed from the match commentary. You have the main commentator giving play-by-play objective reporting and then a &#x27;colour&#x27; commentator giving some subjective analysis during breaks in play. I bet there&#x27;s a lot of interesting ways this could be used.
评论 #43716072 未加载
评论 #43715502 未加载
sebastiennight大约 1 个月前
I love that as soon as he writes,<p>&gt; The plan was simple.<p>You know you&#x27;re in for a funny read.<p>More seriously though, the JSON example from a vision language model is interesting but does not take into account how much extrapolation (hallucination) the model will insert over time.<p>For instance, even if not visible in the image, your VLM will probably start inserting details (such as the color of the team&#x27;s jersey) based on knowing the team&#x27;s three-letter identifier.<p>So the reliability of the system will go down over time, and it probably compounds if you&#x27;re using some of that info to feed further steps in the loop.
dncornholio大约 1 个月前
So Rugby is missing a lot of data beside the scoreline, so they created an AI that can extract the scoreline.
disjunct大约 1 个月前
I wrote a similar script that used a TV tuner during the last World Cup. Since I had an ATSC source, I was able to just pull the CTA-708 captions directly and with little delay.
4ndrewl大约 1 个月前
My observation is that watching Rugby on TV is no the same as watching a Rugby match. You&#x27;re watching something where choices have been made around what you&#x27;re to see, so your model is already restricted in what it can see.<p>You really need to take a &#x27;full pitch&#x27; feed directly from the venue, rather than what is broadcast.
walthamstow大约 1 个月前
I&#x27;m not a rugger bugger but every 5 seconds doesn&#x27;t really seem like often enough to be taking screenshots. In soccer anyway, a lot can happen in 5 seconds.
评论 #43720242 未加载
djtango大约 1 个月前
I don&#x27;t quite get how diffing frames allows you to find the scores.<p>TFA mentions comparing a frame with and without - but how do you generate that frame without? If you can already do it, what&#x27;s useful about doing that?
评论 #43716779 未加载
评论 #43715821 未加载
chrsw大约 1 个月前
Does this mean there&#x27;s probably AI that&#x27;s already watching high profile football (soccer) matches?
评论 #43717311 未加载
rfdearborn大约 1 个月前
&gt; Sending a full-resolution screenshot every five seconds gets expensive fast.<p>For now.
itissid大约 1 个月前
Why does yolo not work?
numpad0大约 1 个月前
aw. I thought this would be about an AI cat that makes wrong commentaries that you can make pointless arguments against. There should be one.
cewl123大约 1 个月前
I want AI that does my job while I watch rugby
petesergeant大约 1 个月前
&gt; We can’t hire analysts to watch every match and enter data manually.<p>I&#x27;m surprised there&#x27;s not enough fans willing to do that if you could gamify it.
评论 #43720533 未加载
damnitbuilds大约 1 个月前
TL;DR: It extracts the score from the video and gets text from the commentary in the audio.<p>I was hoping for more.