I don't think that's the moral of the story at all. It's already challenging eno... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		enraged_camel 2 days ago \| parent \| context \| favorite \| on: "Token anxiety", a slot machine by any other name I don't think that's the moral of the story at all. It's already challenging enough to review the output from one model. Having to review two, and then comparing and contrasting them, would more than double the cognitive load. It would also cost more. I think it's much more preferable to pick the most reliable one and use it as the primary model, and think of others as fallbacks for situations where it struggles.

		help

baq 2 days ago [–]

you should always benchmark your use cases and you obviously don't review multiple outputs; you only review the consensus.

see how perplexity does it: https://www.perplexity.ai/hub/blog/introducing-model-council

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact