Serving Large Language Models with a Minimalist Python CLI flama.dev 4 points by vorticotech 5 hours ago