An AI oncologist to help cancer patients worldwide
Before performing radiation therapy, radiation oncologists first carefully review medical images of the patient to identify the gross tumour volume, the observable portion of the disease. They then design patient-specific clinical target volumes that include surrounding tissues, since these regions can hide cancerous cells and provide pathways for metastasis.
Known as contouring, this process establishes how much radiation a patient will receive and how it will be delivered. In the case of head and neck cancer, this is a particularly sensitive task due to the presence of vulnerable tissues in the vicinity.
Though it may sound straightforward, contouring clinical target volumes is quite subjective. A recent study from Utrecht University found wide variability in how trained physicians contoured the same patient's computed tomography (CT) scan, leading some doctors to suggest high-risk clinical target volumes eight times larger than their colleagues.
This inter-physician variability is a problem for patients, who may be over- or under-dosed based on the doctor they work with. It is also a problem for determining best practices, so standards of care can emerge.
Recently, Carlos Cardenas, a graduate research assistant and PhD candidate at The University of Texas MD Anderson Cancer Center in Houston, Texas, and a team of researchers at MD Anderson, working under the supervision of Laurence Court with support from the National Institutes of Health, developed a new method for automating the contouring of high-risk clinical target volumes using artificial intelligence and deep neural networks.
Cardenas' work focuses on translating a physician's decision-making process into a computer program. "We have a lot of clinical data and radiation therapy treatment plan data at MD Anderson," he said. "If we think about the problem in a smart way, we can replicate the patterns that our physicians are using to treat specific types of tumours."
In their study, they analysed data from 52 oropharyngeal cancer patients who had been treated at MD Anderson between January 2006 to August 2010, and had previously had their gross tumour volumes and clinical tumour volumes contoured for their radiation therapy treatment.
Cardenas spent a lot of time observing the radiation oncology team at MD Anderson, which has one of the few teams of head and neck subspecialist oncologists in the world, trying to determine how they define the targets.
"For high-risk target volumes, a lot of times radiation oncologists use the existing gross tumour disease and apply a non-uniform distance margin based on the shape of the tumour and its adjacent tissues," Cardenas said. "We started by investigating this first, using simple distance vectors."
Cardenas began the project in 2015 and had quickly accumulated an unwieldy amount of data to analyse. He turned to deep learning as a way of mining that data and uncovering the unwritten rules guiding the experts' decisions.
The deep learning algorithm he developed uses auto-encoders — a form of neural networks that can learn how to represent datasets — to identify and recreate physician contouring patterns.
The model uses the gross tumour volume and distance map information from surrounding anatomic structures as its inputs. It then classifies the data to identify voxels — three-dimensional pixels — that are part of the high-risk clinical target volumes. In oropharyngeal cancer cases, the head and neck are usually treated with different volumes for high, low and intermediate risk. The paper described automating the target for the high-risk areas. Additional forthcoming papers will describe the low and intermediate predictions.
In addition to potentially reducing inter-physician variability and allowing comparisons of outcomes in clinical trials, a tertiary advantage of the method is the speed and efficiency it offers. It takes a radiation oncologist two to four hours to determine clinical target volumes. At MD Anderson, this result is then peer reviewed by additional physicians to minimize the risk of missing the disease.
Using the Maverick supercomputer at the Texas Advanced Computing Center (TACC), they were able to produce clinical target volumes in under a minute. Training the system took the longest amount of time, but for that step too, TACC resources helped speed up the research significantly.
"If we were to do it on our local GPU [graphics processing unit], it would have taken two months," Cardenas said. "But we were able to parallelize the process and do the optimization on each patient by sending those paths to TACC and that's where we found a lot of advantages by using the TACC system."
Texas Advanced Computing Center