Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Benchmarking how well LLMs can play FizzBuzz

huggingface.co

2 points by _venkatasg 12 hours ago

_venkatasg 12 hours ago

I got this silly idea for a benchmark and decided to test how well models do. Repo here: https://github.com/venkatasg/fizzbuzz-bench