TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Formal Algorithms for Transformers

106 pointsby hexhowellsalmost 3 years ago

7 comments

lynguistalmost 3 years ago
I find the distinction introduced in this paper into encoder-decoder Transformers, encoder-only Transformers and decoder-only Transformers very useful for my informal understanding of the different architectures. Thank you for this clear clarification.
评论 #32185217 未加载
sva_almost 3 years ago
I like how this seems to actually be self-contained. They even have a list of notations in the end.
geysersamalmost 3 years ago
This is a fantastic resource. It's the missing piece of many machine learning articles.
tartakovskyalmost 3 years ago
Zero diagrams, but maybe they wouldn’t be helpful to clarify the concept? Guess it depends on the types of learners, I’m not sure.
评论 #32181415 未加载
godelskialmost 3 years ago
I can't tell who this paper is aimed at. It isn't formal. It isn't mathematical. It isn't a good description and doesn't have good coverage. I can only assume it is for citations.
ThrowawayTestralmost 3 years ago
I was assuming electrical transformers.
评论 #32180358 未加载
mrhetheralmost 3 years ago
familiar with basic ML terminology might be an understatement
评论 #32185230 未加载
评论 #32181509 未加载