A World of Knowledge
    News / Science News

    Artificial intelligence beats doctors in accurately assessing eye problems

    The clinical knowledge and reasoning skills of GPT-4 are approaching the level of specialist eye doctors, a study led by the University of Cambridge has found.



    Artificial intelligence beats doctors in accurately assessing eye problems. Photo: Mavocado/Getty


    GPT-4 - a ‘large language model’ - was tested against doctors at different stages in their careers, including unspecialised junior doctors, and trainee and expert eye doctors. Each was presented with a series of 87 patient scenarios involving a specific eye problem, and asked to give a diagnosis or advise on treatment by selecting from four options.

    GPT-4 scored significantly better in the test than unspecialised junior doctors, who are comparable to general practitioners in their level of specialist eye knowledge.

    GPT-4 gained similar scores to trainee and expert eye doctors - although the top performing doctors scored higher.

    The researchers say that large language models aren’t likely to replace healthcare professionals, but have the potential to improve healthcare as part of the clinical workflow.

    They say state-of-the-art large language models like GPT-4 could be useful for providing eye-related advice, diagnosis, and management suggestions in well-controlled contexts, like triaging patients, or where access to specialist healthcare professionals is limited.

    “We could realistically deploy AI in triaging patients with eye issues to decide which cases are emergencies that need to be seen by a specialist immediately, which can be seen by a GP, and which don’t need treatment,” said Dr Arun Thirunavukarasu, lead author of the study, which he carried out while a student at the University of Cambridge’s School of Clinical Medicine.

    He added: “The models could follow clear algorithms already in use, and we’ve found that GPT-4 is as good as expert clinicians at processing eye symptoms and signs to answer more complicated questions.

    “With further development, large language models could also advise GPs who are struggling to get prompt advice from eye doctors. People in the UK are waiting longer than ever for eye care.

    Large volumes of clinical text are needed to help fine-tune and develop these models, and work is ongoing around the world to facilitate this.

    The researchers say that their study is superior to similar, previous studies because they compared the abilities of AI to practicing doctors, rather than to sets of examination results.

    “Doctors aren't revising for exams for their whole career. We wanted to see how AI fared when pitted against to the on-the-spot knowledge and abilities of practicing doctors, to provide a fair comparison,” said Thirunavukarasu, who is now an Academic Foundation Doctor at Oxford University Hospitals NHS Foundation Trust.

    He added: “We also need to characterise the capabilities and limitations of commercially available models, as patients may already be using them - rather than the internet - for advice.”

    The test included questions about a huge range of eye problems, including extreme light sensitivity, decreased vision, lesions, itchy and painful eyes, taken from a textbook used to test trainee eye doctors.

    This textbook is not freely available on the internet, making it unlikely that its content was included in GPT-4’s training datasets.

    GPT-4 and GPT-3.5 – or ‘Generative Pre-trained Transformers’ - are trained on datasets containing hundreds of billions of words from articles, books, and other internet sources.

    These are two examples of large language models; others in wide use include Pathways Language Model 2 (PaLM 2) and Large Language Model Meta AI 2 (LLaMA 2).

    GPT-4 powers the online chatbot ChatGPT to provide bespoke responses to human queries. In recent months,

    ChatGPT has attracted significant attention in medicine for attaining passing level performance in medical school examinations, and providing more accurate and empathetic messages than human doctors in response to patient queries.

    APRIL 23, 2024



    YOU MAY ALSO LIKE

    Competition between species played a major role in the rise and fall of hominins, and produced a “bizarre” evolutionary pattern for the Homo lineage.
    Dr. Katherine Bemis of the National Systematics Laboratory recently helped describe a new species of fish, the papillated redbait.
    Childhood maltreatment can continue to have an impact long into adulthood because of how it effects an individual’s risk of poor physical health and traumatic experiences many years later, a new study has found.
    The USDA’s Agricultural Research Service (ARS), in collaboration with the Ecological Forestry Applications Research Centre in Spain and North Dakota State University, conducted a comparison of the physiological and molecular processes involved in the summer and winter dormancy of Osmia lignaria, also known as blue orchard bee or orchard mason bee.
    Analysing the diversity of organic compounds dissolved in freshwater provides a reliable measure of ecosystem health, say scientists.
    A team of American researchers has discovered that the DNA of a common worm species seems to be resistant to damage from chronic radiation in the Chernobyl Exclusion Zone (CEZ).

    © 1991-2024 The Titi Tudorancea Bulletin | Titi Tudorancea® is a Registered Trademark | Terms of use and privacy policy
    Contact