International Student Conference for Scientific Research

Accuracy of ChatGPT in the diagnosis and treatment of cervical disorders: comparative study to APTA guidelines

Paper ID : 1044-ISCSR3 (R2)

Authors

Mariam Wagdy Ayoub *¹, Marwa El-kawy Ghait Mohamed², Marwa Mohammed Ahmed³, Mariam Ramdam Hares³

¹Student of Physical Therapy at Sphinx University , Assiut

²Assistant lecturer, Department of Physical Therapy for Pediatrics, Faculty of Physical Therapy, Sphinx University, Assiut, Egypt

³Student of Physical Therapy at Sphinx University

Abstract

ChatGPT is a system that uses a large amount of data , enabling it to understand human language , provide responses for it and perform tasks typically requiring human cognition . It is developing rapidly in all fields of society, including the medical field. Especially in Covid-19, increased need for helping in diagnosis and follow up from home .

Objective
This paper aims to evaluate efficiency , accuracy and ability of ChatGPT in generating physical therapy treatment plans .

Method
Thirty-three cases of cervical conditions in their twenties, diagnosed with spondylosis with neck pain and brachialgia, spondylosis with headache, and spondylosis with muscle spasm, were selected from the Physiotherapy Clinic of Sphinx University.
The cases were entered into ChatGPT-2, which was tasked with generating physical therapy treatment plans for them. Additionally, treatment plans were obtained from APTA .
A comparison was made between the treatment plans generated by ChatGPT-2 and APTA guidelines for each case, by Cohens kappa .

Result
After comparing the PT treatment plan provided by ChatGPT with APTA guidelines for cervical conditions, Cohens kappa value for agreement between them was 0.5 ,that indicates moderate agreement between ChatGPT recommendation and APTA guidelines . And observed that The treatment plan generated by ChatGPT offers a generalized strategy for all cervical cases, regardless of the specific disease or the nature of the condition (acute, subacute, or chronic).
The APTA guidelines categorizes cervical cases into four groups, ensuring evidence-based, condition-specific interventions. In contrast, ChatGPT’s plan includes a broader range of modalities, some lacking strong scientific support .

Conclusion
This study suggests that physical therapy treatment plans generated by ChatGPT are not sufficiently accurate. While ChatGPT’s plans incorporate APTA recommendations, they also include treatment modalities that are not scientifically proven. Therefore, further research is needed to evaluate its effectiveness

Keywords

Keywords : cervical , Artificial intelligence , treatment plan , ChatGPT .

Status: Abstract Accepted