Preliminary programme
Wednesday, June 12th 2024
14:00-17:00 | Pre-conference workshop: Healthcare Text Analytics in the Era of Large Language Models
Recent advancements in large language models (LLMs), such as ChatGPT, has revolutionised the field of natural language processing (NLP) and opened new possibilities for healthcare text analytics. This tutorial, structured as a combination of lectures and demonstrations, aims to provide a comprehensive guide to leveraging large language models in the healthcare domain, focusing on advanced techniques and applications. The tutorial will begin with an overview of the open source LLMs, emphasising their potential in addressing complex challenges within healthcare text analytics. Special attention will be given to the unique issues surrounding privacy, security, and domain-specific nuances inherent in healthcare data. Participants will be guided through practical applications of LLMs in two distinct healthcare text domains: 1) Discharge Note Generation and 2) PubMed Abstract Information Extraction. Practical demonstrations will illustrate how LLMs can be tailored for each specific domain using prompting, in-context learning, instruction tuning (finetuning). Furthermore, we will delve into LLMs’ challenges in adapting to handle multi-modal data representations. Tutorial organisers: Yunsoo Kim, Jinge Wu and Honghan Wu (University College London, Institute of Health Informatics) |
Thursday, June 13th 2024
9:30-10:15 | Registration |
10:15-10:30 | Welcome |
10:30-11:15 | Keynote: Prof Suzan Verberne (Leiden University). Large Language Models in healthcare: should we care more? ChatGPT can do a lot for us: it can serve as a text corrector, as a source of inspiration, as a programming aid, and as an interactive search engine. ChatGPT is also widely used in the health domain, both by doctors and patients. Large language models (LLMs) such as ChatGPT can write very convincing texts, but being able to write fluently is not the same as providing correct information. Should we worry about that? In my presentation I will first discuss our work on text mining from patient experiences, highlighting the challenges of extracting medical information from informal text. Then I will discuss the opportunities of using LLMs, and go into the risks and challenges. I will also make suggestions for responsible use of LLMs for medical applications. |
11:15-11:30 | Break |
11:30-12:20 | Presentations: session 1
|
12:20-12:35 | Open community forum and discussions: session 1 This is an open slot for colleagues to briefly inform the community about any ongoing or future activities, initiatives, projects, etc. It can be used to invite collaborations, highlight opportunities and challenges, etc. Every speaker will have 3 minutes. |
12:35-14:00 | Lunch |
14:00-15:15 | Panel: Secure data environments for healthcare NLP applications Chair: Arlene and Vishnu |
15:15-16:45 | Posters and demos: session 1
|
16:45-17:30 | PhD forum session Chairs: Arlene Casey (University of Edinburgh) and Ruizhe Li (University of Aberdeen)
Panel: |
17:30-18:30 | Birds of feather meetings: session
Space will be available for colleagues to self-organise and run birds-of-feather or specific project meetings. The following groups will meet:
|
18:30-22:00 | Drinks reception (18:30) and conference dinner (from 19:00) |
Friday, June 14th 2024
09:15-09:30 | Introduction to Day 2 |
09:30-10:20 | Presentations: session 2
Chair:
|
10:20-11:40 | Posters and demos: session 2
|
11:40-12:20 | PhD forum session 2 Chairs: Arlene Casey (University of Edinburgh) and Ruizhe Li (University of Aberdeen)
Panel: |
12:20-12:35 | Open community forum and discussions: session 2 This is an open slot for colleagues to briefly inform the community about any ongoing or future activities, initiatives, projects, etc. It can be used to invite collaborations, highlight opportunities and challenges, etc. Every speaker will have 3 minutes. |
12:35-14:00 | Lunch |
14:00-14:45 | Keynote: Dr Alistair Johnson (Glowyr, Inc.) The bottleneck has always been data! The world has been in awe at the recent applications of sophisticated machine learning models derived from large datasets. Yet in medicine, we continue to use decades old algorithms to support patient care. Models for cancer progression are based upon staging guidelines defined in the 70s, patient severity of illness is estimated using a scoring system from the 90s, and our latest and greatest criteria for sepsis was a model with three input variables. The reasons for the technological naivety in medicine are multifactorial, but one aspect stands out: researchers simply do not have much data. In this talk I will highlight the MIMIC series of databases, a suite of publicly accessible deidentified medical records. I’ll give an insider’s view on how the electronic health records for thousands of individuals were comprehensively deidentified, transformed, and shared for research without harm to the individual’s themselves. I’ll overview the utility of this data, and highlight some of our own work on language modeling enabled by the broad access to deidentified free-text clinical notes. I’ll conclude with my thoughts on how the field should better balance the benefits and risks of using patient data for research. |
14:45-16:00 | Industry forum: Collaboration models for clinical large language models Chair: Dr Ben Fell (Akrivia) Panel: Richard Dobson (King’s College London) |
16:00-16:15 | Final remarks and close |
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –