Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Looking at tokens they were trained on is also a really great indicator of world understanding. Llama 3 is a game changer for some usecases because there's finally a model that understands the world deeply as opposed to typical models which can be fine tuned into hyper specific tasks, but generalize poorly, especially in D2C usecases where someone might probe the model's knowledge


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: