TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Gated Linear Networks

150 pointsby asparaguialmost 5 years ago

4 comments

fxtentaclealmost 5 years ago
That is an amazing paper, a great result and new neutral architectures are long overdue.<p>But I don&#x27;t believe that this has any significance in practice.<p>GPU memory is the limiting factor for most current AI approaches. And that&#x27;s where the typical convolutional architectures shine, because they effectively compress the input data, then work on the compressed representation, then decompress the results. With gated linear networks, I&#x27;m required to always work on the full input data, because it&#x27;s a one step prediction. As the result, I&#x27;ll run out of GPU memory before I reach a learning capacity that is comparable to conv nets.
评论 #23531429 未加载
评论 #23529663 未加载
评论 #23535339 未加载
Immortal333almost 5 years ago
&quot;We show that this architecture gives rise to universal learning capabilities in the limit, with effective model capacity increasing as a function of network size in a manner comparable with deep ReLU networks.&quot;<p>What exactly this statement means?
评论 #23530929 未加载
评论 #23529528 未加载
评论 #23529492 未加载
评论 #23529505 未加载
nlalmost 5 years ago
I didn&#x27;t realise Hutter was on leave from ANU at DeepMind.
caretak3ralmost 5 years ago
As a relative neophyte in this realm, this is fascniating to read. Comparing this to the the models&#x2F;methods to derive said properties, is good education for me.