It's cool but TPS count is not a meaningful limiting factor to new software. These small models are also too dumb for QA in complex codebases (for now), but on a future timeline they are super cool. Model distillation and ablation generally is very interesting.