Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Writing an LLM from scratch, part 32d – Interventions: adding attention bias

gilesthomas.com

3 points by gpjt 8 hours ago