The World Health Organization (WHO) is calling for caution to be exercised when using artificial intelligence (AI)-generated large language model tools (LLMs) to protect and promote human well-being, human safety, and autonomy, and preserve public health.
LLMs include some of the most rapidly expanding platforms such as ChatGPT, Bard, BERT, and many others that imitate understanding, processing, and producing human communication. Their meteoric public diffusion and growing experimental use for health-related purposes is generating significant excitement around the potential to support people’s health needs.
It is imperative that the risks be examined carefully when using LLMs to improve access to health information as a decision-support tool or even to enhance diagnostic capacity in under-resourced settings to protect people’s health and reduce inequity.
While the WHO is enthusiastic about the appropriate use of technologies, including LLMs, to support health care professionals, patients, researchers, and scientists, there is concern that caution that would normally be exercised for any new technology is not being exercised consistently with LLMs. This includes widespread adherence to key values of transparency, inclusion, public engagement, expert supervision, and rigorous evaluation.
Precipitous adoption of untested systems could lead to errors by health care workers, cause harm to patients, erode trust in AI and thereby undermine (or delay) the potential long-term benefits and uses of such technologies around the world.
Concerns that call for rigorous oversight needed for the technologies to be used in safe, effective, and ethical ways include the following:
Data used to train AI may be biased, generating misleading or inaccurate information that could pose risks to health, equity, and inclusivity
LLMs generate responses that can appear authoritative and plausible to an end user. However, these responses may be completely incorrect or contain serious errors, especially for health-related responses
LLMs may be trained on data for which consent may not have been previously provided for such use, and LLMs may not protect sensitive data (including health data) that a user provides to an application to generate a response;
LLMs can be misused to generate and disseminate highly convincing disinformation in the form of text, audio, or video content that is difficult for the public to differentiate from reliable health content
While committed to harnessing new technologies, including AI and digital health to improve human health, the WHO recommends that policymakers ensure patient safety and protection while technology firms work to commercialize LLMs.
The WHO proposes that these concerns be addressed, and clear evidence of benefit be measured before their widespread use in routine health care and medicine—whether by individuals, care providers, health system administrators, or policymakers. The six core principles identified by the WHO are:
Promote human well-being, human safety, and the public interest
Ensure transparency, explainability, and intelligibility
Foster responsibility and accountability
Ensure inclusiveness and equity
Promote AI that is responsive and sustainable
In efforts to align with the core principles, the WHO reiterates the importance of applying ethical principles and appropriate governance—as enumerated in the WHO guidance on the ethics and governance of AI for health—when designing, developing, and deploying AI for health.
- This press release was originally published on the World Health Organization website