Sleep Health Toxicology Older Adults Heart Disease Public Perception Substance Abuse Sleep Psychology Exercise Cardiology Fluoride
The newly released dataset evaluates AI models' medical response accuracy, revealing top performers and raising concerns over grading transparency and safety validation.