“GPT-4 ranked higher than the majority of physicians in psychiatry… it performed similarly to the median physician in general surgery & internal medicine… GPT-4 performance was lower in pediatrics & OB/GYN but remained higher than a considerable fraction” of active doctors.