Discussion
In this study, we aimed to analyse the compatibility of two SD OCT machines for manual measurements of FTMH, determining whether they could be used interchangeably for clinical and research purposes. Although we found that, on careful side-by-side comparisons, the calliper function appears to be calibrated identically between both OCT software; there are several factors that lead to differences depending on the OCT scanning protocols and measurement techniques which are configurable on both machines.
Optical resolution in OCT has been defined as the ability of the machine to distinguish two closely spaced points as physically different from a single point.15 In the scientific literature, a lower resolution has been proposed as an important cause of variability when comparing OCT measurements with different machine15–17 while other studies stated that the differences in measurements could not be fully explained by a lower resolution.1 18 In our study, we confirm that the differences in resolution resulted in different reference points for measuring. Indeed, observer 1 repeatedly measured less than observer 2 for small horizontal distances such as MLD relative to measurements of OCTs performed with Triton (figure 1). The same did not happen with a machine with higher resolution, as Spectralis offers 5.7 µm/pixel lateral resolution and 3.9 µm/pixel axial resolution,19 and Triton has 20 µm lateral resolution with in-depth digital and optical function resolution of 2.6 µm and 8 µm, respectively.20 As shown in figure 4, the two observers measured differently the MLD, due to lower lateral resolution on Triton. Differently, there was less ambiguity in reference points while measuring BD: no difference was found in the larger horizontal measurements between observers on the Bland-Altman plot (figure 1).
Another contributor to interobserver variability was the selection of different OCT horizontal ‘slices’; particularly evident in smaller FTMH and in larger spacing between scans. In the case of the same protocol of OCT acquisition, and therefore, the same scanning density, interobserver variability is higher for smaller FTMHs. Indeed, as shown in figure 3A, the absence of an OCT linear scan that dissects the FTMH exactly at its maximum diameter, frequently in patients with poor vision or eccentric fixation; a wider difference could be observed in smaller FTMHs between apparent (or measured) and true (or anatomical) MLD or between MLDs measured by two observers. This could be explained as they chose to acquire measurements from distinct linear scans. In the case of OCT acquisition of the same, or equally sized, FTMH; higher interobserver variability could be observed for protocols with larger spacing between horizontal b-scans. Indeed, higher-density scans allow better possibility to dissect the FTMH in its maximum diameter, or to detect a smaller difference between apparent and true MLD or between MLDs measured by two observers who chose to acquire measurements from distinct linear scans. This may happen despite the observers’ efforts to choose the widest diameter to measure, which is not always obvious even for experienced observers, particularly in high myopes and oblique scans. This is evident in the wide interobserver variability in horizontal measurements for Spectralis relative to Triton (figure 1), due to its lower scan density (49 horizontal b-scans separated by 125 µm vs 149 scans separated by 50 µm of Triton). For the same reason, figure 3B shows why Spectralis resulted in consistently lower horizontal measurements than Triton (figure 2) particularly for small FTMH, as a small MLD is most sensitive to undermeasurement relative to true width (figure 3B). Different raster scans are available for OCT machines, including horizontal and vertical grid patterns; high-density raster scans, with higher spatial sampling and finer details compared with standard patterns. This is offset by longer acquisition time, increased patient compliance, effective eye tracking facility and higher storage. In addition, the information technology infrastructure may not enable loading high-density scans in day-to-day clinical practice. As such, a balance must be struck between various image acquisition parameters and practicality. Macular radial scans are rapid and consist of a series of radial scan lines originating from the fovea and extending outward in different directions. Among the different types of scans, radial OCT scans have been previously described in the scientific literature as a better option for assessing FTMH.21 Indeed the use of a radial raster could potentially better find the true maximum diameter in case of irregularly shaped FTMHs. However, in case of poor vision/fixation and low density of scans, if the geometric centre of the radial raster is not exactly allocated in the centre of the FTMH, measurement accuracy and precision could be reduced. Differently, the present study demonstrates the importance of high-density raster scans in order to reduce the difference between apparent and true diameter, as well as the interobserver variability.
Finally, a high vertical-to-horizontal scaling is useful in clinical assessment as it allows for vertical magnification of retinal layers. However, as demonstrated in figure 5, this may result in highly erroneous vertical measurements, leading to accidental oblique measurements that are obvious only when viewing images in 1:1 scaling, and therefore, exaggerating vertical height measurements. As Spectralis has higher scaling than Triton by default, this resulted in higher vertical measurements relative to Triton (figure 2). When both software viewing systems were set to 1:1 scaling and measurements repeated, this led to identical vertical measurements between the different OCT machines. Garcia Garrido et al discussed the issue of OCT scaling in murine eyes and advised equal scaling to improve the efficiency of OCT image analysis.22 To the author’s best knowledge, the issue of vertical measurement error due to scaling has not been previously discussed in the literature pertaining to FTMH measurement and we speculate that most researchers measure on the default viewing setting, due to the lack of discussion in the methodology of such papers. Although our paper primarily discusses measurements for FTMH, the scaling vertical measurement effect will be applicable to any condition that requires manual vertical measurements on OCT. An example of this is the thickness of acute submacular haemorrhage secondary to underlying choroidal neovascular membrane, where the vertical height of the haemorrhage at the fovea can be a determinant of the management strategy. There is evidence that smaller bleeds especially with a thickness less than 450 µm do well with anti-VEGF monotherapy, as against a combination treatment of anti-VEGFs with expansile gas and tissue plasminogen activator used in thicker bleeds.23 As such, we strongly recommend that all vertical measurements are performed with 1:1 scaling (labelled 1:1 µm as opposed to 1:1 pixel in Spectralis), especially in cases with clinical implications.
In the scientific literature, other studies have compared intravariability and intervariability in measurements of MLD or BD in FTMH between different observers using the same machine (Spectralis or Topcon), finding no significant difference.24 25 Similarly, Chen et al reported no significant difference for MLD (p=0.69) in FTMH detected between two observers using Spectralis11; however, a significant difference in measurements of BD (mean of the differences 38.75 µm, with observer 2 overestimating BD compared with observer 1; p<0.0001) was described.11 This is the first study to compare the compatibility of two different OCT machines in the acquisition of measurements for FTMHs. Although we describe sources of error and their effect on interobserver variability, we find a very high ICC between both observers and OCT machines.
The limitations of our study include its reduced sample size of 128 measurements in 8 eyes across 2 observers. However, these data were sufficient to highlight sources of repeatable variability between OCT machines. Finally, the scan protocols used were standard protocols used in our clinical settings and higher-density scan protocols, although available, were not acquired. The study evaluated exclusively horizontal grid patterns and did not take in consideration other OCT raster patterns such as the radial one. Horizontal patterns can measure the maximum diameter of an FTMH, which might not be the true MLD in case of irregular shaped FTMH. Finally, the two OCT protocols compared use different horizontal:vertical ratio, both of which can be set to identical scaling. Scans with higher density with Spectralis and 1:1 horizontal:vertical ratio would mitigate many of the differences in measurements we address in the paper.
In conclusion, we find good interobserver and OCT machine agreement in measurements with identically calibrated calliper functionality. However, the paper highlights possible factors that could influence correct measuring and interobserver variability for acquisition of quantitative linear parameters in FTMH: resolution, density scanning protocol of the OCT machine and vertical scaling when measuring, as well as dimension of the hole. Therefore, to increase the reliability of measurements in eyes with full-thickness macular holes, we recommend optimal, use of a a higher scanning density protocol, possible with most OCT manufacturers, to minimise the error caused by the selection of the incorrect/different OCT linear scar dissecting the centre of the FTMH, with an OCT device that enables high resolution, to better identify the reference points for measuring. In the case of acquisition of vertical measurements, 1:1 horizontal:vertical scaling should be deployed as a or postacquisition modification before taking the measurements, to reduce exaggerated vertical measurements relative to true size. Finally, awareness that caution is advised for the interchangeable use of OCT scans for research purposes, particularly for smaller horizontal measurements.