The New York Times on 1/23/25 wrote (click image above) about how hard it is to devise tests hard enough to test AI models, moving from “undergraduate level” exams to graduate level and world-class expert questions.
We are just starting to dabble in generative AI and LLM’s for delivering healthcare. It is interesting to see what is being up-ended in other fields.