Diagnostic accuracy of code-free deep learning for detection and evaluation of posterior capsule opacification

Josef Huemer; Martin Kronschläger; Manuel Ruiss; Dawn Sim; Pearse A Keane; Oliver Findl; Siegfried K Wagner

doi:10.1136/bmjophth-2022-000992

Article Text

PDF

PDF +
Supplementary
Material

Lens

Original research

Diagnostic accuracy of code-free deep learning for detection and evaluation of posterior capsule opacification

http://orcid.org/0000-0001-5238-8295Josef Huemer1,2,
Martin Kronschläger3,
Manuel Ruiss3,
Dawn Sim1,
Pearse A Keane1,2,4,
Oliver Findl3,
Siegfried K Wagner1,2,4

¹Department of Medical Retina, Moorfields Eye Hospital NHS Foundation Trust, London, UK
²NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust and UCL Institute of Ophthalmology, London, UK
³VIROS-Vienna Institute for Research in Ocular Surgery, a Karl Landsteiner Institute, Hanusch Hospital, Vienna, Austria
⁴Institute of Ophthalmology, UCL, London, UK

Correspondence to Siegfried K Wagner; S.wagner{at}ucl.ac.uk

Abstract

Objective To train and validate a code-free deep learning system (CFDLS) on classifying high-resolution digital retroillumination images of posterior capsule opacification (PCO) and to discriminate between clinically significant and non-significant PCOs.

Methods and analysis For this retrospective registry study, three expert observers graded two independent datasets of 279 images three separate times with no PCO to severe PCO, providing binary labels for clinical significance. The CFDLS was trained and internally validated using 179 images of a training dataset and externally validated with 100 images. Model development was through Google Cloud AutoML Vision. Intraobserver and interobserver variabilities were assessed using Fleiss kappa (κ) coefficients and model performance through sensitivity, specificity and area under the curve (AUC).

Results Intraobserver variability κ values for observers 1, 2 and 3 were 0.90 (95% CI 0.86 to 0.95), 0.94 (95% CI 0.90 to 0.97) and 0.88 (95% CI 0.82 to 0.93). Interobserver agreement was high, ranging from 0.85 (95% CI 0.79 to 0.90) between observers 1 and 2 to 0.90 (95% CI 0.85 to 0.94) for observers 1 and 3. On internal validation, the AUC of the CFDLS was 0.99 (95% CI 0.92 to 1.0); sensitivity was 0.89 at a specificity of 1. On external validation, the AUC was 0.97 (95% CI 0.93 to 0.99); sensitivity was 0.84 and specificity was 0.92.

Conclusion This CFDLS provides highly accurate discrimination between clinically significant and non-significant PCO equivalent to human expert graders. The clinical value as a potential decision support tool in different models of care warrants further research.

diagnostic tests/investigation
imaging
lens and zonules

Data availability statement

Data are available upon request. Interested parties should contact JH.

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

https://doi.org/10.1136/bmjophth-2022-000992

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

What is already known on this topic?

Deep learning (DL) has been proven to be a powerful tool for image analysis and has been applied to cataract-related image classification. Posterior capsule opacification (PCO) can be detected by retroillumination images and is the most common complication of cataract surgery.

What this study adds?

Code-free DL can be used to train DL systems to detect clinically significant PCO. Clinicians can use code-free DL with little coding experience to develop clinically relevant artificial intelligence applications.

How this study might affect research, practice or policy?

This novel use case of code-free DL explores new areas of clinical relevance outside of the classic domains of DL in ophthalmology and serves as a proof of concept to help bridge the gap between research and potential clinical applications.

Introduction

The recent progress in artificial intelligence (AI) is mainly attributed to the development of deep learning (DL), a subdivision of machine learning, with major improvements in the diagnostic performance of image recognition, speech recognition and natural language processing.1 Its use in medicine in particular has been shown to perform on par with humans in imaging-based specialities like radiology, dermatology and ophthalmology.2 Whereas traditional DL relies heavily on vast computing power and coding skills, recent developments of automated, code-free neural networks using transfer learning or neural architecture search have allowed clinicians to investigate datasets independently and to reproduce previously achieved results like predicting sex from colour fundus photographs.3 4

Formation of cataract is the leading cause of treatable blindness, with surgical lens removal as the only option of treatment.5 Multiple aspects of cataracts and the respective surgery have been analysed with AI, including screening and grading of colour slit lamp photographs, optimisation of preoperative intraocular lens (IOL) calculations and posterior capsule opacification (PCO) prediction.6–8 The most common complication after cataract surgery with IOL implantation is the development of PCO.9 10 Incidence of PCO ranges from <5% to 50%11 and recently was reported for monofocal single-piece IOLs to range between 7.1% and 22.6% at 5 years.12 The most common effective treatment of PCO is neodymium-doped yttrium aluminium garnet laser capsulotomy, which occasionally involves the following complications: elevated intraocular pressure, retinal detachment and endophthalmitis.13 Therefore, evaluation of clinically significant versus non-significant PCO is of clinical relevance.

The aim of this study was to investigate a code-free deep learning system (CFDLS) trained to detect clinically significant PCO on retroillumination photography and to compare its outcome to human expert graders.

Materials and methods

This study was in compliance with the Declaration of Helsinki andreporting guidelines for diagnostic accuracy, the Standards for Reporting of Diagnostic Accuracy (STARD).14

Study design

This was a retrospective study using previously acquired data as part of a prospective observation cohort.15 The optical system at the time of recording consisted of a Zeiss 30-slit lamp for observation and imaging, a Zeiss retrolux illumination module with illumination provided by a Zeiss anterior segment flash pack through a fibre-optic cable and beam splitters. A Kodak NC2000 digital camera with high light sensitivity resulting from a 16.0 mm×21.0 mm charge-coupled device (CCD) chip was used, resulting in a high signal-to-noise ratio in the acquired images. The CCD had a geometric resolution of 1268 pixels×1012 pixels and a radiometric resolution of 36 bits (red, green and blue). The images were directly imported into Adobe Photoshop V.5.5 and saved to a hard disk in tagged image file format (TIFF, 3.85 megabytes per image).15

The region of interest (ROI) was defined as the central 4 mm of the IOL not containing any structures of the anterior capsule. This was accomplished by importing the images into Gimp V.2.10.14, an open-source cross-platform imaging editor, and manually cropping the images (figure 1). Patient-identifying information was not accessible.

Figure 1

Examples of non-significant (above) and significant (below) posterior capsule opacification with central 3 mm region of interest highlighted on the left side only available for the human expert graders.

Datasets

The training dataset consists of 179 images with various grades of PCO, containing at least 12 images per grade and is described in detail in table 1. Random partition of the dataset into training, tuning and test (internal validation) was automatically implemented by the Google Cloud AutoML Vision application programming interface (API) in an 80–10–10 distribution.

View this table:

Table 1

Distribution of classes in the development and external validation datasets

To perform an external validation,16 a set of 100 digital images of eyes of 100 patients with an even distribution of mild to severe PCO manually selected by an experienced examiner for a previous study15 was used; patient-identifying information was not accessible (table 1). All images had been imported to Adobe Photoshop V.5.5 and processed as TIFF files in 2002 in a similar fashion as the training set. The external validation dataset was created to assess quality assurance measures in 2002; the training dataset consisted of images taken at the same institution between 2005 and 2008.

Grading

Labels were defined in a binary fashion as clinically non-significant or significant. An opacification in the central 3 mm, previously defined as the most significant area,17 was determined to be significant; examples of clinical grades can be seen in figure 1.

For both training dataset and validation dataset, three sets of data in a random sequence were generated, respectively, using an online randomisation system (https://www.randomizer.org). All three sets of each training dataset and validation dataset were presented to three board-certified ophthalmologists and experienced cataract surgeons. The investigators graded the images completely independent from each other and were masked to the results of each other. The final grading of each grader was determined by the majority vote of the three votes from the same grader for each image.

Development of the DL algorithm

Whereas DL usually requires advanced coding knowledge and intensive computing power using multiple graphical processing units, recent developments of automated neural networks allow clinicians with little coding skills to investigate datasets with AI.3 4 These techniques depend on transfer learning (using previously trained algorithms for different purposes to retrain for a new task) and neural architecture search (a technique of automatic neural network architecture selection). APIs are available by multiple providers. In this study, we leveraged the Google Cloud AutoML Vision API (Google). Anonymised datasets are uploaded through graphical user interfaces (GUIs) in the API onto a cloud bucket for the training and validation process.18 Repeated images were removed and the datasets were handled separately to avoid overfitting. Due to the architecture of the APIs, different experiments were performed for comparability.

Statistical analysis

Intraobserver and interobserver variabilities of categorical variables (significant vs non-significant PCO) were assessed using the Fleiss kappa (κ) statistic for categorical results by multiple graders as described by Landis and Koch with 95% bootstrap CIs estimated through Monte Carlo simulations using 1000 iterations.19

Model performance was through sensitivity, specificity and area under the curve (AUC) with 95% CIs estimated using 2000 stratified bootstrap replicates. Where appropriate, fourfold confusion matrices for internal and external validations and receiver operating characteristic (ROC) curves are shown. All analyses were conducted in R V.4.1.0 (R Core Team, R Foundation for Statistical Computing, Vienna, Austria) with the caret, pROC and raters package for analysis and ggplot for visualisations.

Results

The development dataset consisted of 179 images, 67 of which were without or with non-significant PCO and 76 significant PCOs (table 1). Intraobserver variability κ (95% CI) for the three gradings for observers 1, 2 and 3 were 0.90 (95% CI 0.86 to 0.95), 0.94 (95% CI 0.90 to 0.97) and 0.88 (95% CI 0.82 to 0.93), respectively. Interobserver κ for the final grading for all three observers was 0.84 (95% CI 0.78 to 0.89) and that for all nine gradings was 0.82 (95% CI 0.76 to 0.86). Pairwise comparisons between each observer as well as the majority vote are shown in table 2. Interobserver agreement was generally high, ranging from 0.85 (95% CI 0.79 to 0.90) between observers 1 and 2 to 0.90 (95% CI 0.85 to 0.94) for observers 1 and 3.

View this table:

Table 2

Fleiss κ between observers and majority vote

Fourfold confusion matrices for the internal and external validation sets are shown in figure 2. On the internal validation dataset, sensitivity was 0.89 at a specificity of 1 and the AUC was 0.9861 (95% CI 0.92 to 1.0). The external validation dataset consisted of 100 images, of which 63 were visually significant PCOs. On external validation, sensitivity was 0.84 and specificity was 0.92. The AUC was 0.9661 (95% CI 0.93 to 0.99). ROC curves for the internal and external validation datasets are shown in figure 3.

Figure 2

Fourfold confusion matrices for the internal validation and external validation sets.

Figure 3

Receiver operating characteristic curve showing model performance on the internal and external validation test sets across different thresholds. The boundary of no discrimination is shown in a dotted red line. AUC, area under the curve.

Error auditing

Qualitative review of algorithmic misclassifications were carried out by the three observers. In the cases where the CFDLS predicted the PCO to be false positively significant, peripheral PCO outside the 3 mm ROI could be observed. When analysing the cases, the DLS wrongly predicted as non-significant, two things were striking: first, none of the cases presented with pearls, and in the majority of cases, posterior capsule folds could be observed (online supplemental figure 1).

Supplemental material

[bmjophth-2022-000992supp001.pdf]

Discussion

In this study, we developed and validated a CFDLS classifying between clinically significant and non-significant PCO in retroillumination images. The CFDLS showed a robust performance in detecting clinically significant PCO with a sensitivity of 0.84, a specificity of 0.92 and an AUC of 0.9661 (95% CI 0.93 to 0.99) on external validation. This proof of concept shows that CFDLS can be used to develop potential decision support tools and enables clinicians to expand into the clinical research of AI and explore novel use cases of AI applications.

The validation dataset was created in 2002 and consists of images with different degrees of PCO. Findl et al already used the dataset for comparison of four methods (subjective analysis, Evaluation of Posterior Capsule Opacification [EPCO], Posterior Capsule Opacity [POCO] and Automated Quantification of After-Cataract [AQUA] I studies) of PCO quantification.15 Moreover, Kronschläger et al applied the same dataset in creating an automated qualitative and quantitative assessment tool of PCO, that is, AQUA II.20 AQUA II already showed excellent validity and repeatability. Projecting light into the eye causes Purkinje spots on each tissue interface. Additionally, internal reflections of the optics of the system used may appear. Those light spots and reflections (figure 1) on the image cover pathological changes of the posterior capsule and therefore are missing in the grading process. Because the Purkinje spots change their position in slightly different directions of gaze, merging of images of different gaze positions enables the removal of light-reflection artefacts.21 However, at the time of data collection, this method was yet not published.

A rigorous and sound grading process to establish a ground truth is especially important when labels are provided to develop DL classifiers.22 Krause et al demonstrated the importance of arbitration to improve the algorithmic performance for diabetic retinopathy grades.23 Whereas previous grading approaches for PCO focused on quantitative human grading,15 for this study, we decided to proceed with a binary grading that was aiming to label clinical significance. The rationale behind this is based on the visual significance of the inner area of 3 mm; binary labels were used to reduce the risk of PCO underestimation.24 Good intraobserver and interobserver variabilities were achieved by the three expert graders.

Applications using AI are heading towards all fields of medicine. A recent survey from the American College of Radiology showed that 30% of radiologists were using AI in some form in their clinical practice.25 Teleophthalmology may serve as a solution to increasing demands and stretched services in the field of cataract surgery.8 Wu et al have presented results of a universal AI model for a collaborative management of cataract, with referral decisions for preoperative and postoperative grading, requiring a large dataset for training and bespoke modular architectures.8 The model performance in our study in the external validation was respectable, with a sensitivity of 0.84 and a specificity of 0.92, with an AUC of 0.9661, warranting further research using larger datasets.

With CFDLS, clinicians now have the opportunity to explore clinical datasets using cloud-based APIs and GUIs. The ability to understand the complexity of clinical data in combination with code-free platforms will allow clinicians to further explore clinical use cases. Although little coding skills are required to train such bespoken models, the process of data preparation remains to be a major part of such studies. Furthermore, clinicians need to have a good understanding of the importance of labelling, grading, training, well-balanced distributions and potential hidden confounders when developing CFDLS.26 Automated CFDLSs have been shown to perform comparably with bespoke classifiers in ophthalmology and other fields of medicine. On the other hand, lack of adjustable model architectures during training as well as the ‘black box’ phenomena may serve as limitations.27 As explainability methods still remain to be challenging, Ghassemi et al have argued that rigorous internal and external validations serve as a more achievable goal to evaluate the performance of DL systems.28 Classifiers as developed in this study could help to exclude PCO in triage settings and could be externalised into smartphone-based home screening applications. Once revalidated, it may serve as a decision support tool in a referral refinement process. This proof of concept shows that clinicians can use AI to explore novel applications in ophthalmology outside the classic domains of retinal imaging and glaucoma.

Error auditing showed interestingly that peripheral PCO was noted in the cases to be predicted as false positive. This could be refined by first incorporating a preprocessing step of peripheral cropping. Second, formal occlusion testing of the periphery would bolster this justification but was outside the remit of this project. The importance of error auditing in AI cannot be underestimated to identify and prevent algorithmic bias both inside and outside of healthcare.29 30

The limitations of our study include the size of the datasets and the setting of a single centre with a mainly Caucasian population. No multifocal lenses were included in the dataset as the curation predated multifocal IOLs. The model design of the CFDLS in terms of model architecture and hyperparameters is not transparent; it has the potential to diminish machine learning explainability even further due to a lack of understanding of the model architectures and parameters used. Preprocessing of the images limits the scalability but could be incorporated in a more user-friendly application prior to incorporation. To explore generalisation, further evaluation using a larger dataset representing a multiethical population therefore is warranted.

In conclusion, we trained a CFDLS to classify between significant and non-significant PCO on retroillumination images with high sensitivity and specificity. Moreover, the CFDLS equaled human expert graders in reliability. This CFDLS for PCO serves as proof of concept to support the decision whether PCO needs to be addressed by yttrium aluminium garnet capsulotomy, possibly even in a teleophthalmological or triage setting.

Data availability statement

Data are available upon request. Interested parties should contact JH.

Ethics statements

Patient consent for publication

Ethics approval

Not applicable.

References

↵
1. LeCun Y,
2. Bengio Y,
3. Hinton G
. Deep learning. Nature 2015;521:436–44.doi:10.1038/nature14539pmid:http://www.ncbi.nlm.nih.gov/pubmed/26017442
OpenUrl CrossRef PubMed
↵
1. Liu X,
2. Faes L,
3. Kale AU, et al
. A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis. Lancet Digit Health 2019;1:e271–97.doi:10.1016/S2589-7500(19)30123-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/33323251
OpenUrl CrossRef PubMed
↵
1. Faes L,
2. Wagner SK,
3. Fu DJ, et al
. Automated deep learning design for medical image classification by health-care professionals with no coding experience: a feasibility study. Lancet Digit Health 2019;1:e232–42.doi:10.1016/S2589-7500(19)30108-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/33323271
OpenUrl CrossRef PubMed
↵
1. Korot E,
2. Pontikos N,
3. Liu X, et al
. Predicting sex from retinal fundus photographs using automated deep learning. Sci Rep 2021;11:10286.doi:10.1038/s41598-021-89743-xpmid:http://www.ncbi.nlm.nih.gov/pubmed/33986429
OpenUrl CrossRef PubMed
↵
1. Flaxman SR,
2. Bourne RRA,
3. Resnikoff S, et al
. Global causes of blindness and distance vision impairment 1990-2020: a systematic review and meta-analysis. Lancet Glob Health 2017;5:e1221–34.doi:10.1016/S2214-109X(17)30393-5pmid:http://www.ncbi.nlm.nih.gov/pubmed/29032195
OpenUrl PubMed
↵
1. Gutierrez L,
2. Lim JS,
3. Foo LL, et al
. Application of artificial intelligence in cataract management: current and future directions. Eye Vis 2022;9:3.doi:10.1186/s40662-021-00273-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/34996524
OpenUrl PubMed
↵
1. Goh JHL,
2. Lim ZW,
3. Fang X, et al
. Artificial intelligence for cataract detection and management. Asia Pac J Ophthalmol 2020;9:88–95.doi:10.1097/01.APO.0000656988.16221.04pmid:http://www.ncbi.nlm.nih.gov/pubmed/32349116
OpenUrl PubMed
↵
1. Wu X,
2. Huang Y,
3. Liu Z, et al
. Universal artificial intelligence platform for collaborative management of cataracts. Br J Ophthalmol 2019;103:1553–60.doi:10.1136/bjophthalmol-2019-314729pmid:http://www.ncbi.nlm.nih.gov/pubmed/31481392
OpenUrl Abstract/FREE Full Text
↵
1. Vasavada AR,
2. Raj SM,
3. Shah GD, et al
. Posterior capsule opacification after lens implantation: incidence, risk factors and management. Expert Rev Ophthalmol 2013;8:141–9.doi:10.1586/eop.12.80
OpenUrl
↵
1. Maedel S,
2. Evans JR,
3. Harrer-Seely A, et al
. Intraocular lens optic edge design for the prevention of posterior capsule opacification after cataract surgery. Cochrane Database Syst Rev 2021;8:CD012516.doi:10.1002/14651858.CD012516.pub2pmid:http://www.ncbi.nlm.nih.gov/pubmed/34398965
OpenUrl PubMed
↵
1. Raj SM,
2. Vasavada AR,
3. Johar SRK, et al
. Post-Operative capsular opacification: a review. Int J Biomed Sci 2007;3:237–50.pmid:http://www.ncbi.nlm.nih.gov/pubmed/23675049
OpenUrl PubMed
↵
1. Ursell PG,
2. Dhariwal M,
3. O'Boyle D, et al
. 5 year incidence of YAG capsulotomy and PCO after cataract surgery with single-piece monofocal intraocular lenses: a real-world evidence study of 20,763 eyes. Eye 2020;34:960–8.doi:10.1038/s41433-019-0630-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/31616057
OpenUrl PubMed
↵
1. O’Boyle D,
2. Perez Vives C,
3. Samavedam S, et al
. PMD1 - POST-ND:YAG LASER COMPLICATIONS IN CATARACT PATIENTS TREATED FOR POSTERIOR CAPSULAR OPACIFICATION: A SYSTEMATIC LITERATURE REVIEW. Value in Health 2018;21:S243.doi:10.1016/j.jval.2018.09.1450
OpenUrl
↵
1. Bossuyt PM,
2. Reitsma JB,
3. Bruns DE, et al
. Stard 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ 2015;351:h5527.doi:10.1136/bmj.h5527pmid:http://www.ncbi.nlm.nih.gov/pubmed/26511519
OpenUrl FREE Full Text
↵
1. Findl O,
2. Buehl W,
3. Menapace R, et al
. Comparison of 4 methods for quantifying posterior capsule opacification. J Cataract Refract Surg 2003;29:106–11.doi:10.1016/S0886-3350(02)01509-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/12551676
OpenUrl CrossRef PubMed Web of Science
↵
1. Moons KGM,
2. Kengne AP,
3. Grobbee DE, et al
. Risk prediction models: II. external validation, model updating, and impact assessment. Heart 2012;98:691–8.doi:10.1136/heartjnl-2011-301247pmid:http://www.ncbi.nlm.nih.gov/pubmed/22397946
OpenUrl Abstract/FREE Full Text
↵
1. Meacock WR,
2. Spalton DJ,
3. Boyce J, et al
. The effect of posterior capsule opacification on visual function. Invest Ophthalmol Vis Sci 2003;44:4665–9.doi:10.1167/iovs.02-0634pmid:http://www.ncbi.nlm.nih.gov/pubmed/14578383
OpenUrl Abstract/FREE Full Text
↵
1. AutoML Vision documentation
. Google cloud. Available: https://cloud.google.com/vision/automl/docs [Accessed 17 Jan 2022].
↵
1. Landis JR,
2. Koch GG
. The measurement of observer agreement for categorical data. Biometrics 1977;33:159.doi:10.2307/2529310pmid:http://www.ncbi.nlm.nih.gov/pubmed/843571
OpenUrl CrossRef PubMed Web of Science
↵
1. Kronschläger M,
2. Siegl H,
3. Pinz A, et al
. Automated qualitative and quantitative assessment of posterior capsule opacification by automated quantification of after-cataract II (aqua II) system. BMC Ophthalmol 2019;19:114.doi:10.1186/s12886-019-1116-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/31101021
OpenUrl PubMed
↵
1. Findl O,
2. Buehl W,
3. Siegl H, et al
. Removal of reflections in the photographic assessment of PCO by fusion of digital retroillumination images. Invest Ophthalmol Vis Sci 2003;44:275–80.doi:10.1167/iovs.02-0619pmid:http://www.ncbi.nlm.nih.gov/pubmed/12506085
OpenUrl Abstract/FREE Full Text
↵
1. Chen P-HC,
2. Mermel CH,
3. Liu Y
. Evaluation of artificial intelligence on a reference standard based on subjective interpretation. Lancet Digit Health 2021;3:e693–5.doi:10.1016/S2589-7500(21)00216-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/34561202
OpenUrl PubMed
↵
1. Krause J,
2. Gulshan V,
3. Rahimy E, et al
. Grader variability and the importance of reference standards for evaluating machine learning models for diabetic retinopathy. Ophthalmology 2018;125:1264–72.doi:10.1016/j.ophtha.2018.01.034pmid:http://www.ncbi.nlm.nih.gov/pubmed/29548646
OpenUrl CrossRef PubMed
↵
1. Lu C,
2. Yu S,
3. Song H, et al
. Posterior capsular opacification comparison between morphology and objective visual function. BMC Ophthalmol 2019;19:40.doi:10.1186/s12886-019-1051-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/30717688
OpenUrl PubMed
↵
1. Allen B,
2. Agarwal S,
3. Coombs L, et al
. 2020 ACR data science Institute artificial intelligence survey. J Am Coll Radiol 2021;18:1153–9.doi:10.1016/j.jacr.2021.04.002pmid:http://www.ncbi.nlm.nih.gov/pubmed/33891859
OpenUrl PubMed
↵
1. Faes L,
2. Liu X,
3. Wagner SK, et al
. A clinician's guide to artificial intelligence: how to critically appraise machine learning studies. Transl Vis Sci Technol 2020;9:7.doi:10.1167/tvst.9.2.7pmid:http://www.ncbi.nlm.nih.gov/pubmed/32704413
OpenUrl PubMed
↵
1. O'Byrne C,
2. Abbas A,
3. Korot E, et al
. Automated deep learning in ophthalmology: AI that can build AI. Curr Opin Ophthalmol 2021;32:406–12.doi:10.1097/ICU.0000000000000779pmid:http://www.ncbi.nlm.nih.gov/pubmed/34231529
OpenUrl PubMed
↵
1. Ghassemi M,
2. Oakden-Rayner L,
3. Beam AL
. The false hope of current approaches to explainable artificial intelligence in health care. Lancet Digit Health 2021;3:e745–50.doi:10.1016/S2589-7500(21)00208-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/34711379
OpenUrl CrossRef PubMed
↵
1. de Hond AAH,
2. Leeuwenberg AM,
3. Hooft L, et al
. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review. npj Digital Medicine 2022;5:1–13.doi:10.1038/s41746-021-00549-7
OpenUrl
↵
1. the Supreme Audit Institutions of Finland, Germany, the Netherlands
. Auditing machine learning algorithms, 2020. Available: https://auditingalgorithms.net/index.html [Accessed 16 Jan 2022].

Supplementary material

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1

Footnotes

JH and MK contributed equally.
Contributors All listed authors contributed to the conception or design of the work; or the acquisition, analysis or interpretation of data for the work; drafted or revised the work critically for important intellectual content; and finally approved this version to be published and agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. SKW is responsible for the overall content as guarantor.
Funding SKW is supported by a MRC clinical research training fellowship (MR/TR000953/1). PAK is supported by a Moorfields Eye Charity Career Development Award (R190028A) and a UK Research & Innovation Future Leaders Fellowship (MR/T019050/1).
Competing interests JH received travel grants from Bayer, speaker fees from Carl Zeiss Meditec AG and Bayer, and served on advisory boards for Roche outside of this work. DS received speaker fees from Allergan, Bayer, Novartis and Haag Streit. PAK acted as a consultant for DeepMind, Roche, Novartis and Apellis; was an equity owner in Big Picture Medical; and received speaker fees from Heidelberg Engineering, Topcon, Allergan and Bayer. OF is a scientific advisor to Alcon, Carl Zeiss Meditec AG, Croma Pharma, Johnson & Johnson and Merck.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

[1] ↵
LeCun Y,
Bengio Y,
Hinton G
. Deep learning. Nature 2015;521:436–44.doi:10.1038/nature14539pmid:http://www.ncbi.nlm.nih.gov/pubmed/26017442
OpenUrl CrossRef PubMed

[2] LeCun Y,

[3] Bengio Y,

[4] Hinton G

[5] ↵
Liu X,
Faes L,
Kale AU, et al
. A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis. Lancet Digit Health 2019;1:e271–97.doi:10.1016/S2589-7500(19)30123-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/33323251
OpenUrl CrossRef PubMed

[6] Liu X,

[7] Faes L,

[8] Kale AU, et al

[9] ↵
Faes L,
Wagner SK,
Fu DJ, et al
. Automated deep learning design for medical image classification by health-care professionals with no coding experience: a feasibility study. Lancet Digit Health 2019;1:e232–42.doi:10.1016/S2589-7500(19)30108-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/33323271
OpenUrl CrossRef PubMed

[10] Faes L,

[11] Wagner SK,

[12] Fu DJ, et al

[13] ↵
Korot E,
Pontikos N,
Liu X, et al
. Predicting sex from retinal fundus photographs using automated deep learning. Sci Rep 2021;11:10286.doi:10.1038/s41598-021-89743-xpmid:http://www.ncbi.nlm.nih.gov/pubmed/33986429
OpenUrl CrossRef PubMed

[14] Korot E,

[15] Pontikos N,

[16] Liu X, et al

[17] ↵
Flaxman SR,
Bourne RRA,
Resnikoff S, et al
. Global causes of blindness and distance vision impairment 1990-2020: a systematic review and meta-analysis. Lancet Glob Health 2017;5:e1221–34.doi:10.1016/S2214-109X(17)30393-5pmid:http://www.ncbi.nlm.nih.gov/pubmed/29032195
OpenUrl PubMed

[18] Flaxman SR,

[19] Bourne RRA,

[20] Resnikoff S, et al

[21] ↵
Gutierrez L,
Lim JS,
Foo LL, et al
. Application of artificial intelligence in cataract management: current and future directions. Eye Vis 2022;9:3.doi:10.1186/s40662-021-00273-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/34996524
OpenUrl PubMed

[22] Gutierrez L,

[23] Lim JS,

[24] Foo LL, et al

[25] ↵
Goh JHL,
Lim ZW,
Fang X, et al
. Artificial intelligence for cataract detection and management. Asia Pac J Ophthalmol 2020;9:88–95.doi:10.1097/01.APO.0000656988.16221.04pmid:http://www.ncbi.nlm.nih.gov/pubmed/32349116
OpenUrl PubMed

[26] Goh JHL,

[27] Lim ZW,

[28] Fang X, et al

[29] ↵
Wu X,
Huang Y,
Liu Z, et al
. Universal artificial intelligence platform for collaborative management of cataracts. Br J Ophthalmol 2019;103:1553–60.doi:10.1136/bjophthalmol-2019-314729pmid:http://www.ncbi.nlm.nih.gov/pubmed/31481392
OpenUrl Abstract/FREE Full Text

[30] Wu X,

[31] Huang Y,

[32] Liu Z, et al

[33] ↵
Vasavada AR,
Raj SM,
Shah GD, et al
. Posterior capsule opacification after lens implantation: incidence, risk factors and management. Expert Rev Ophthalmol 2013;8:141–9.doi:10.1586/eop.12.80
OpenUrl

[34] Vasavada AR,

[35] Raj SM,

[36] Shah GD, et al

[37] ↵
Maedel S,
Evans JR,
Harrer-Seely A, et al
. Intraocular lens optic edge design for the prevention of posterior capsule opacification after cataract surgery. Cochrane Database Syst Rev 2021;8:CD012516.doi:10.1002/14651858.CD012516.pub2pmid:http://www.ncbi.nlm.nih.gov/pubmed/34398965
OpenUrl PubMed

[38] Maedel S,

[39] Evans JR,

[40] Harrer-Seely A, et al

[41] ↵
Raj SM,
Vasavada AR,
Johar SRK, et al
. Post-Operative capsular opacification: a review. Int J Biomed Sci 2007;3:237–50.pmid:http://www.ncbi.nlm.nih.gov/pubmed/23675049
OpenUrl PubMed

[42] Raj SM,

[43] Vasavada AR,

[44] Johar SRK, et al

[45] ↵
Ursell PG,
Dhariwal M,
O'Boyle D, et al
. 5 year incidence of YAG capsulotomy and PCO after cataract surgery with single-piece monofocal intraocular lenses: a real-world evidence study of 20,763 eyes. Eye 2020;34:960–8.doi:10.1038/s41433-019-0630-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/31616057
OpenUrl PubMed

[46] Ursell PG,

[47] Dhariwal M,

[48] O'Boyle D, et al

[49] ↵
O’Boyle D,
Perez Vives C,
Samavedam S, et al
. PMD1 - POST-ND:YAG LASER COMPLICATIONS IN CATARACT PATIENTS TREATED FOR POSTERIOR CAPSULAR OPACIFICATION: A SYSTEMATIC LITERATURE REVIEW. Value in Health 2018;21:S243.doi:10.1016/j.jval.2018.09.1450
OpenUrl

[50] O’Boyle D,

[51] Perez Vives C,

[52] Samavedam S, et al

[53] ↵
Bossuyt PM,
Reitsma JB,
Bruns DE, et al
. Stard 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ 2015;351:h5527.doi:10.1136/bmj.h5527pmid:http://www.ncbi.nlm.nih.gov/pubmed/26511519
OpenUrl FREE Full Text

[54] Bossuyt PM,

[55] Reitsma JB,

[56] Bruns DE, et al

[57] ↵
Findl O,
Buehl W,
Menapace R, et al
. Comparison of 4 methods for quantifying posterior capsule opacification. J Cataract Refract Surg 2003;29:106–11.doi:10.1016/S0886-3350(02)01509-2pmid:http://www.ncbi.nlm.nih.gov/pubmed/12551676
OpenUrl CrossRef PubMed Web of Science

[58] Findl O,

[59] Buehl W,

[60] Menapace R, et al

[61] ↵
Moons KGM,
Kengne AP,
Grobbee DE, et al
. Risk prediction models: II. external validation, model updating, and impact assessment. Heart 2012;98:691–8.doi:10.1136/heartjnl-2011-301247pmid:http://www.ncbi.nlm.nih.gov/pubmed/22397946
OpenUrl Abstract/FREE Full Text

[62] Moons KGM,

[63] Kengne AP,

[64] Grobbee DE, et al

[65] ↵
Meacock WR,
Spalton DJ,
Boyce J, et al
. The effect of posterior capsule opacification on visual function. Invest Ophthalmol Vis Sci 2003;44:4665–9.doi:10.1167/iovs.02-0634pmid:http://www.ncbi.nlm.nih.gov/pubmed/14578383
OpenUrl Abstract/FREE Full Text

[66] Meacock WR,

[67] Spalton DJ,

[68] Boyce J, et al

[69] ↵
AutoML Vision documentation
. Google cloud. Available: https://cloud.google.com/vision/automl/docs [Accessed 17 Jan 2022].

[70] AutoML Vision documentation

[71] ↵
Landis JR,
Koch GG
. The measurement of observer agreement for categorical data. Biometrics 1977;33:159.doi:10.2307/2529310pmid:http://www.ncbi.nlm.nih.gov/pubmed/843571
OpenUrl CrossRef PubMed Web of Science

[72] Landis JR,

[73] Koch GG

[74] ↵
Kronschläger M,
Siegl H,
Pinz A, et al
. Automated qualitative and quantitative assessment of posterior capsule opacification by automated quantification of after-cataract II (aqua II) system. BMC Ophthalmol 2019;19:114.doi:10.1186/s12886-019-1116-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/31101021
OpenUrl PubMed

[75] Kronschläger M,

[76] Siegl H,

[77] Pinz A, et al

[78] ↵
Findl O,
Buehl W,
Siegl H, et al
. Removal of reflections in the photographic assessment of PCO by fusion of digital retroillumination images. Invest Ophthalmol Vis Sci 2003;44:275–80.doi:10.1167/iovs.02-0619pmid:http://www.ncbi.nlm.nih.gov/pubmed/12506085
OpenUrl Abstract/FREE Full Text

[79] Findl O,

[80] Buehl W,

[81] Siegl H, et al

[82] ↵
Chen P-HC,
Mermel CH,
Liu Y
. Evaluation of artificial intelligence on a reference standard based on subjective interpretation. Lancet Digit Health 2021;3:e693–5.doi:10.1016/S2589-7500(21)00216-8pmid:http://www.ncbi.nlm.nih.gov/pubmed/34561202
OpenUrl PubMed

[83] Chen P-HC,

[84] Mermel CH,

[85] Liu Y

[86] ↵
Krause J,
Gulshan V,
Rahimy E, et al
. Grader variability and the importance of reference standards for evaluating machine learning models for diabetic retinopathy. Ophthalmology 2018;125:1264–72.doi:10.1016/j.ophtha.2018.01.034pmid:http://www.ncbi.nlm.nih.gov/pubmed/29548646
OpenUrl CrossRef PubMed

[87] Krause J,

[88] Gulshan V,

[89] Rahimy E, et al

[90] ↵
Lu C,
Yu S,
Song H, et al
. Posterior capsular opacification comparison between morphology and objective visual function. BMC Ophthalmol 2019;19:40.doi:10.1186/s12886-019-1051-zpmid:http://www.ncbi.nlm.nih.gov/pubmed/30717688
OpenUrl PubMed

[91] Lu C,

[92] Yu S,

[93] Song H, et al

[94] ↵
Allen B,
Agarwal S,
Coombs L, et al
. 2020 ACR data science Institute artificial intelligence survey. J Am Coll Radiol 2021;18:1153–9.doi:10.1016/j.jacr.2021.04.002pmid:http://www.ncbi.nlm.nih.gov/pubmed/33891859
OpenUrl PubMed

[95] Allen B,

[96] Agarwal S,

[97] Coombs L, et al

[98] ↵
Faes L,
Liu X,
Wagner SK, et al
. A clinician's guide to artificial intelligence: how to critically appraise machine learning studies. Transl Vis Sci Technol 2020;9:7.doi:10.1167/tvst.9.2.7pmid:http://www.ncbi.nlm.nih.gov/pubmed/32704413
OpenUrl PubMed

[99] Faes L,

[100] Liu X,

[101] Wagner SK, et al

[102] ↵
O'Byrne C,
Abbas A,
Korot E, et al
. Automated deep learning in ophthalmology: AI that can build AI. Curr Opin Ophthalmol 2021;32:406–12.doi:10.1097/ICU.0000000000000779pmid:http://www.ncbi.nlm.nih.gov/pubmed/34231529
OpenUrl PubMed

[103] O'Byrne C,

[104] Abbas A,

[105] Korot E, et al

[106] ↵
Ghassemi M,
Oakden-Rayner L,
Beam AL
. The false hope of current approaches to explainable artificial intelligence in health care. Lancet Digit Health 2021;3:e745–50.doi:10.1016/S2589-7500(21)00208-9pmid:http://www.ncbi.nlm.nih.gov/pubmed/34711379
OpenUrl CrossRef PubMed

[107] Ghassemi M,

[108] Oakden-Rayner L,

[109] Beam AL

[110] ↵
de Hond AAH,
Leeuwenberg AM,
Hooft L, et al
. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review. npj Digital Medicine 2022;5:1–13.doi:10.1038/s41746-021-00549-7
OpenUrl

[111] de Hond AAH,

[112] Leeuwenberg AM,

[113] Hooft L, et al

[114] ↵
the Supreme Audit Institutions of Finland, Germany, the Netherlands
. Auditing machine learning algorithms, 2020. Available: https://auditingalgorithms.net/index.html [Accessed 16 Jan 2022].

[115] the Supreme Audit Institutions of Finland, Germany, the Netherlands

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Data availability statement

Statistics from Altmetric.com

Request Permissions

What is already known on this topic?

What this study adds?

How this study might affect research, practice or policy?

Introduction

Materials and methods

Study design

Datasets

Grading

Development of the DL algorithm

Statistical analysis

Results

Error auditing

Supplemental material

Discussion

Data availability statement

Ethics statements

Patient consent for publication

Ethics approval

References

Supplementary material

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password