Attention: More Musings

FavoriteLoadingAdd to favorites

The attention model I posed last post is still reasonable, but the comparison model is not. (These revelations are the fallout of a fun conversation with myself, Nikos, and Sham Kakade. Sham recently took a faculty position at the University of Washington, which is my neck of the woods.)As a reminder, the attention model is a binary classifier which takes…
Original post: Attention: More Musings
Source: Machined Learnings