Introduction
Glaucoma is one of the leading causes of irreversible blindness worldwide. Globally, the number of adults (aged from 40 to 80) with glaucoma was estimated at around 64 million in 2013 and 76 million in 2020 and was expected to increase to 112 million in 2040.1 Glaucoma is typically asymptomatic in the early stages. Without screening, many patients with glaucoma remain undiagnosed until the late stage, especially in developing countries. For example, in the Chinese population, more than 90% of patients with primary open-angle glaucoma were undiagnosed, and nearly 30% were eventually blind in at least one eye.2 In a multiethnic Asian population of adults, more than two-thirds of glaucoma cases were undiagnosed.3 Although glaucoma screening is a significant public health concern, the early detection and treatment of glaucoma can largely reduce visual loss. Therefore, glaucoma is an attractive screening disease because it is asymptomatic, prevalent and treatable.
Historically, glaucoma screening’s main barriers have been cost-effectiveness and the lack of an appropriate test.4 The non-mydriatic retinal images are low-cost, portable, quick and straightforward to operate and interpret. In addition, artificial intelligence (AI) can learn good features from images. It may outperform glaucoma specialists in detecting the disease based on imaging.5
Many studies have proposed AI approaches to diagnosing glaucoma based on fundus retinal images. However, the large-scale datasets they used relied on subjective human gradings of colour fundus retinal images to establish the ‘ground truth’ of labels to train and validate algorithms, which only focused on the structure of glaucomatous changes in optic disc photography.6–9 Nevertheless, relying on retinal images is insufficient to detect glaucoma.10 11 Ophthalmologists’ subjective interpretation of glaucoma on fundus images is often poor-to-modest intra and interobserver agreement12–14 as well as poor reproducibility13 in interpreting images for glaucoma. Essentially, the algorithm cannot outperform the reference standard for the training. Therefore, although their algorithms perform well, their algorithms were limited to subjective gradings used as the reference and the performance may not reflect the true ability of glaucoma detection.
Optical coherence tomography (OCT) is a non-contact, non-invasive and objective structural imaging device for cross-sectional and three-dimensional (3D) viewing of the macula and optic nerve head (ONH). It is a powerful and common tool to diagnose glaucoma, especially mild to moderate glaucoma, by obtaining more objective high-resolution images for the structural assessment of glaucomatous damage11 and enabling ophthalmologists to achieve good diagnostic accuracy in clinical settings15–17 with three structures for the assessment: (1) the ONH parameters (includes disc area, rim area, average cup-to-disc ratio (C/D), vertical C/D and cup volume), (2) the retinal nerve fibre layer (RNFL) thickness and (3) the ganglion cell-inner plexiform layer (GCIPL) thickness.18 Current AI approaches use the overall features of images or only ONH features by segmentation of the optic disc. Few of them used features on the macula area. However, GCIPL thicknesses, a critical characteristic of the macula, also help detect glaucomatous eyes and even showed a higher area under the curve (AUC) and a higher sensitivity than RNFL for detecting glaucoma in the early stages.19 In this sight, features generated from the macular area can be included in developing algorithms for glaucoma detection.
Although OCT and images can provide different information about optic nerve status, both are useful adjuncts to glaucoma detection.10 Unfortunately, due to its expensive and inconvenient features, OCT can be used in the clinical setting but is unlikely to be used in a general medical office or a community screening.20 Moreover, one study found that glaucoma screening by combining OCT with fundus photography showed a 25% higher sensitivity than using fundus photography alone.21 Hence, the classification performance of AI algorithms should be more accurate and objective if we combine fundus images and OCT measurements as the reference standard rather than the grader’s subjective assessment merely based on fundus images.
Therefore, in this study, we aim to not only train the automated retinal image analysis (ARIA) method to detect glaucomatous optic neuropathy (GON) on non-mydriatic retinal images automatically but also improve the performance of GON detection by two methods: (1) using images labelled with the additional results of OCT as the reference standard and (2) combining all the features generated from the entire images, the region of interest (ROI) of the optic disc and the ROI of the macula.