"this looks about right and has no obvious bugs" is my standard when reviewing h...

bagacrap · on June 20, 2023

My standard changes based on which human's code I'm reviewing though.

Unless I'm feeling particularly lazy or the code isn't important.

AnimalMuppet · on June 20, 2023

Well... after fairly long experience, we have discovered that your standard is mostly adequate for human generated code (as long as it's not going into a critical system). That may be based on the (empirically collected) statistics of how human-generated code fails - that if it's wrong, it usually either "looks" wrong or obviously fails.

GPT-produced code may have different failure statistics, and therefore the human heuristic may not work for GPT-produced code. It's too early to tell.