Ask HN: Transformer alternatives that could have emergent properties when scaled
3 by s_r_n | 1 comments on Hacker News.
I am trying to identify model architecture candidates that could, like transformers, have "emergent" properties when they are scaled (see https://ift.tt/Tg926Rc). Some contenders I already know about are: * Monarch Mixer (https://ift.tt/un0JaS8) * Hyena (https://ift.tt/EbXpV0N) Thanks for your help.

Post a Comment

Previous Post Next Post