Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Repeatability of flicker modulation sensitivity measured using the Flicker-Plus test

  • Aiman Hafeez ,

    Roles Conceptualization, Data curation, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    * aiman.hafeez@citystgeorges.ac.uk

    Affiliation School of Health and Medical Sciences, Department of Optometry and Visual Science, City, St. George’s University of London, London, United Kingdom

  • Alison Binns,

    Roles Conceptualization, Methodology, Validation, Visualization, Writing – review & editing

    Affiliation School of Health and Medical Sciences, Department of Optometry and Visual Science, City, St. George’s University of London, London, United Kingdom

  • Sajni Bohra,

    Roles Conceptualization, Data curation, Investigation, Methodology, Validation, Visualization

    Affiliation School of Health and Medical Sciences, Department of Optometry and Visual Science, City, St. George’s University of London, London, United Kingdom

  • Irene Ctori,

    Roles Conceptualization, Methodology, Validation, Visualization, Writing – review & editing

    Affiliation School of Health, Medicine, and Life Sciences, University of Hertfordshire, Hatfield, London, United Kingdom

  • John L. Barbur

    Roles Conceptualization, Methodology, Validation, Visualization, Writing – review & editing

    Affiliation School of Health and Medical Sciences, Department of Optometry and Visual Science, City, St. George’s University of London, London, United Kingdom

Abstract

Purpose

To evaluate the test-retest variability of the Flicker-Plus test for each of the two protocols measuring rod and cone-enhanced flicker modulation thresholds (FMT) in healthy individuals. A secondary aim was to evaluate the within-subject variability in repeated measurements.

Methods

Thirty healthy participants aged 19–71 years were examined. None had any history or signs of ocular disease. Monocular FMT were measured at the fovea (0°) and at an eccentricity of 5° in each quadrant, twice by the same investigator under identical conditions within a 2-week period under stimulus conditions that favoured either rods or cones to evaluate the between session repeatability. To assess the within-subject variability, binocular measurements for cone and rod-enhanced FMT were carried out on 15 different occasions over a period of 3-weeks in three of the participants. Coefficient of Repeatability (CoR) was calculated for inter-session repeatability and Bland Altman plots were created for graphical representation. Inter-class correlation coefficient (ICC) with 95% confidence intervals were estimated.

Results

Bland and Altman analysis shows that the mean bias is greater than zero in all 5 testing locations for both rod and cone-enhanced FMTs, suggesting that the threshold at the second visit tended to be lower than at the first, however the difference between visits was not statistically significant for any test condition (paired t-test, p < 0.05). In a sub analysis for those CoR was found to be higher in those aged <45years, compared to those aged ≥45 years. The correlation and agreement between the two measurements estimated by ICC analysis shows good (0.75–0.9) to excellent (>0.9) test-retest reliability of Flicker-Plus test for all measures.

Conclusion

The findings show good to excellent test-retest repeatability for the Flicker-Plus test. This was the case at all locations (foveal and peripheral), under both cone and rod-enhanced conditions. There was no evidence of significant learning effects.

Introduction

Flicker sensitivity can be measured by assessing the smallest flicker modulation a subject just needs to see flicker when viewing a temporally modulated stimulus. Most tests systematically adjust the light modulation amplitude at a fixed temporal frequency without changing the mean light level of the stimulus [1]. A staircase procedure is normally employed to measure a threshold modulation amplitude that corresponds to a predetermined probability of a correct response. The reciprocal of this threshold modulation signal is a measure of flicker sensitivity [2]. The retina serves as one of the most highly active metabolic tissues in the body, necessitating a continuous supply of oxygen by the retinal and choroidal vessels [3]. Studies investigating enzyme activity and oxygen utilising micro-electrodes have shown that the layer of retina containing the outer segments of photoreceptors is the most metabolically active [4,5]. Flickering light stimuli presented at temporal frequencies above 10 Hz have been shown to increase neural activity, blood flow, and haemoglobin oxygen saturation in retinal veins [6,7]. As a result, retinal oxygen metabolism increases in response to the physiological challenge of high frequency temporal stimulation [8]. It follows that loss of rapid flicker sensitivity may provide useful information for early detection of retinal pathologies such as age-related macular degeneration (AMD) [911], glaucoma [12,13], and diabetic retinopathy [14], where hypoxia is a contributing factor in disease progression. Rapid flicker thresholds have high sensitivity and reproducibility and can be useful in assessing the integrity of photoreceptors [15] and any changes in visual function that may arise early in retinal diseases such as AMD [8]. Similarly in glaucoma, the measurement of increased optic nerve head blood flow caused by flicker driven neural activity can be useful for understanding the pathogenesis of glaucomatous optic nerve damage [13]. It is reported that flicker modulation sensitivity is substantially altered in patients with ocular hypertension (OHT) and in early open angle glaucoma [13,1618]. Despite these potential clinical applications and interest in flicker sensitivity, the number of published studies on test-retest repeatability of flicker thresholds are few with only small number of testing methods investigated [9,19,20]. A knowledge of expected inter-subject and within-subject variability in flicker modulation thresholds (FMT) as well as normal limits of flicker thresholds as a function of age would be of great value in the detection of abnormal responses and for the management of patients with retinal disease.

The Flicker-Plus test measures temporal contrast sensitivity under photopic (cone-enhanced) and mesopic (rod-enhanced) conditions. FMTs are measured in central vision and at four discrete diagonal, peripheral locations (5° in each quadrant). The two protocols isolate the function of the rods and cones by appropriate selection of test parameters including retinal illuminance, spectral composition, and spatio-temporal characteristics, that favour either rods or cones. In addition to measuring thresholds at five discrete locations in the visual field, the test requires minimal dark adaptation time.

The primary aim of this study was to assess the variability in test-retest repeats of the AVOT Flicker-Plus test for each of the two protocols measurements of rod- and cone-enhanced FMT in healthy individuals. A secondary aim was to evaluate the within subject variability in repeated measurements.

Methods

Participants

Thirty healthy participants were recruited through adverts placed in cafes, supermarkets, libraries, GP surgeries, the City University eye clinic, through social media and advertising platforms and from City, St. George's University of London students and staff members between 01st November 2023–31st March 2024. To be eligible for inclusion, participants were required to be ≥ 18 years of age, with best corrected visual acuity within the normal range in their test eye for the corresponding age [21], refractive error ≤ ± 6 dioptres and/or ± 3 dioptres of astigmatism with open-anterior chamber angle (grade 3 or 4 with Van-Herick technique) [22]. The status of the ocular media was also checked. Fundus imaging and disc examination was carried out using a slit lamp. Only those subjects with ≤ 2 on all Lens opacities and classification system Ⅲ (LOCS Ⅲ) criteria [23], intraocular pressure ≤ 21 mmHg (on non-contact tonometry), and no history of major systematic disease or medication known to affect retinal structure and function were recruited for the study.

Ethical consideration and consent

This study was approved by the Senate Research Ethics Committee of School of Health and Medical Sciences City, St. George’s University of London (Ethics reference number: ETH2122−1185) and the study fully complied with the standards stated in the Declaration of Helsinki. All participants were given an information leaflet 24 hours before their first visit. The investigator talked through the information leaflet with the participant at the start of the visit and encouraged each participant to ask any questions they may have before consent was obtained. A written informed consent form was received from each subject before any measurements were carried out.

Testing procedure

Data collection took place at City, St. George's University of London. Data were collected by two investigators, AH and SB. In order to evaluate the between session repeatability of monocular FMT, the Flicker-Plus test was carried out twice with each participant by the same investigator under identical conditions within a 2-week period. To carry out an in-depth evaluation of the within subject variability of the test, measurements for cone and rod-enhanced FMT were carried out on 15 different occasions over a period of 3-weeks in three of the participants.

Screening data were collected at the start of the first visit to confirm that eligibility conditions were fulfilled. The evaluation included a patient’s medical and ocular history, brief refraction, monocular logMAR visual acuity (ETDRS chart), K-readings, axial length (ALADDIN (Version 1.6.2, Topcon, Tokyo, Japan)), central corneal thickness and non-contact IOP measurement (Topcon TRK-1P (version 1.46.14) Auto Kerato-Refracto-Tonometer instrument (Topcon, Tokyo, Japan)), non-dilated fundus imaging and macular OCT scan (Topcon Maestro), visual field assessment (SITA standard 24−2, Humphrey Field Analyser), non-dilated slit lamp examination of anterior and posterior segment along with lens clarity assessment and evaluated according to the LOCS III. Both eyes were evaluated to determine eligibility and, in the event that both eyes met the inclusion criteria, the eye with better visual acuity was included for further assessment or, in the case of equal visual acuity, a randomisation schedule was used to select the test eye.

To measure the flicker thresholds, the Flicker-Plus test was run on a desktop computer. The standard protocols for mesopic (rod-enhanced) and photopic (cone-enhanced) stimulus conditions were used which vary in stimulus presentation duration (longer for rod-enhanced stimuli), temporal modulation frequency (lower for rod-enhanced stimuli), background luminance and spectral composition. The flickering disc stimulus was presented randomly at one of five possible locations in the visual field on a fully calibrated high-resolution monitor (Eizo CS2420). The display supports 10-bit dynamic range with a spatial resolution of 1920 x 1200 pixels at a frame rate of 60 Hz. The disc stimulus luminance was modulated sinusoidally at 5 Hz for the rod-enhanced condition and at 15 Hz (with square-wave modulation) for the cone-enhanced condition. The mean luminance of the stimulus was equal to that of the adjacent background (0.5 cd/m2 for mesopic and 22 cd/m2 for photopic protocol). The photopic central stimulus was of 0.5° diameter (30 minute of arc) and the parafoveal stimuli subtended a visual angle of 1° (60 minute of arc). Photopic stimuli were presented for 344ms. Under the mesopic protocol, the stimulus size for central and parafoveal locations was 0.75° (45 minute of arc) and 1.5° (90 minute of arc), respectively, and stimuli were presented for 600 ms. A curtain was used to isolate the stimulus display from a second display used by the examiner. A small, 3W LED lamp placed on the examiner's desk was used to provide mesopic ambient lighting. The examiner's display background also provided some ambient illumination. Within the subject's area, almost all background lighting was derived from the uniform background of the stimulus display. The spectral composition of the light employed for the two stimulus conditions was selected to generate a scotopic to photopic (S/P) luminance ratio of 0.9 for cone-enhanced conditions and 9 for rod-enhanced conditions [24]. Each participant was seated at 1m from the stimulus monitor, with the hood placed in front to position the forehead and chin at a fixed distance. The location of the stimulus was selected randomly. Before each stimulus, the central fixation target (consisting of an outline square and cross) flashed briefly to inform the subject of the next stimulus which followed with a delay varying from 600 to 1000ms. The participants were instructed to look at the central fixation target and to keep fixation on the centre of the screen as indicated by pointing arrows (Fig 1). The display background consisted solely of mid-to-long wavelength light (CIE ϰ = 0.413, y = 0.507) to reduce variations in short-wavelength light absorption caused by the crystalline lens and macular pigment. Monocular thresholds were measured with rod and cone enhanced stimuli at the fovea (0 degrees) and at four parafoveal locations, diagonally displaced 5° away from fixation into each quadrant. The flicker thresholds were measured at each of the five locations randomly using five interleaved 2-down 1-up adaptive staircases with 10 reversals. The threshold was based on the average of the last six reversals. Participants were provided with a numeric keypad with 5 keys that matched spatially the stimulus positions on the screen. A separate key (green) placed well above the others was used to indicate when the subject was completely unaware of the stimulus position in the visual field. This response button, when pressed, instructs the program to allocate randomly the response to any of the five possible stimulus locations. The task was explained to each participant and the instructions were to indicate the location of the stimulus presented on the screen by pressing the corresponding keypad button. The ‘learning mode’ (which employed suprathreshold stimuli) was used initially to familiarise each participant with the procedure before the full test was administered. A brief fixation stimulus made up of a dark square outline and a cross preceded each stimulus presentation. This brief flash attracted the subject's fixation to the centre of the screen in anticipation for the next stimulus. The participant was required to keep their gaze fixed on this marker throughout the test in order to see the peripherally displayed stimuli at 5° eccentricity. Each subject, before being tested in the rod-enhanced condition, had to adapt to the background field for ~2 minutes while wearing spectrally calibrated neutral density filters (Oakley Half Jacket 2.0 –Black Iridium, USA) selected to reduce approximately 10-fold the light flux entering the eye, which were worn throughout the rod-enhanced testing phase. The use of these filters made it possible to achieve a luminance of 0.5 cd/m2 without compromising the expected visual display performance at very low light levels. Preliminary experiments showed that extending the background adaptation period for rod-enhanced condition from 1 to 16 minutes had no significant effect on the thresholds [25]. Each protocol took ~8 minutes to complete. The same testing procedure was repeated on follow-up visit(s). The FMTs are expressed on a linear scale as a percentage of the background luminance.

thumbnail
Fig 1. Schematic presentation of cone-enhanced (A) and rod-enhanced (B) testing panel.

The circles with numbers in (A) indicate the locations of stimulus in four peripheral locations. The central fixation square is located at the location of foveal stimulus presentation point (B).

https://doi.org/10.1371/journal.pone.0333012.g001

Statistical analysis

Data were imported into Microsoft Excel 2017 and analysed using IBM SPSS® Statistical version 29 for Windows (SPSS, Chicago, USA). The Shapiro-Wilk test was used to evaluate the normality of data distribution. The pre-set level of significance was α = 0.05, with p< 0.05 indicating non-normal distributions. Normally distributed data were described by the mean and standard deviation (SD), whilst non normally distributed data were represented by the median and IQR. Flicker thresholds from one eye were used for inter-session repeatability analysis. The coefficient of repeatability (CoR) was calculated by multiplying the standard deviation (SD) of the differences between the FMTs recorded at both visits by 1.96 [26]. Bland Altman plots were created to graphically represent the repeatability of FMTs [26], demonstrating the agreement between the two measurements obtained on the same subject, under identical conditions within two weeks. The y-axis displays the difference between two measurements, while the x-axis represents the average of the repeated measurements. The middle horizontal solid line on the graph represents the average difference in measurements, also referred to as bias. The 95% upper and lower limits of agreement (LoA) represented by dashed lines (mean bias ± 1.96 x SD) were calculated based on the standard deviation of the inter-session differences between the FMT measurements. The effect of healthy ageing on FMT becomes more prominent after 45 years of age, with increasing age resulting in elevated thresholds, especially with respect to rod mediated responses [25], therefore an additional sub analysis was carried out to compare the CoR of FMT measured in individuals aged <45 years, and those of individuals aged 45 years or older. In order to compare CoR in proportional terms, it was also expressed as a percentage of the mean FMT for younger and older subgroups (% CoR). Inter-class correlation coefficients (ICC) with 95% confidence intervals were estimated based on a mean-rating (k = 2), absolute-agreement, and 2-way mixed effect model [27,28]. A paired sample t-test was conducted prior to Bland Altman plot construction to check if there was statistically significant difference between the mean cone and rod FMTs for visit 1 and visit 2, which might indicate a learning effect. An independent sample t-test was conducted to compare FMTs between younger (<45 years) and older age groups (≥45 years).

For intra subject variability, standard deviation (SD) and coefficient of variation (CoV; CoV = SD/mean) of all measurements was calculated. The CoV is a measure of the dispersion of data relative to the mean and provides a measure of the degree of variation detected between repeated measurements that were taken from the same person on multiple occasions [29].

Results

Thirty healthy participants (24 females and 6 males) aged 19–71 years (mean 44 years ±15SD were tested using Flicker-Plus test (Table 1). The average time between visits was 8 days (±2 SD). All data were normally distributed, hence parametric tests were used in the analysis presented below.

thumbnail
Table 1. Total number of participants recruited and participants per decade with mean age (M) and standard deviation (SD).

https://doi.org/10.1371/journal.pone.0333012.t001

Mean values for FMT at all locations, in rod and cone-enhanced conditions are given in Tables 2 and 3, respectively. The CoR tended to be higher for rod-enhanced than cone enhanced conditions, and for foveal compared to parafoveal locations. When CoR was expressed as a proportion of the mean FMT to facilitate comparison (% CoR), the cone- and rod-enhanced stimuli showed broadly comparable repeatability (mean % CoR across locations: 44.1% vs. 41.8% for cones and rods, respectively). Additionally, foveal locations exhibited notably higher % CoR than parafoveal locations, especially for cones, indicating poorer repeatability at the fovea.

thumbnail
Table 2. Inter visit repeatability of cone-enhanced FMT at all testing locations.

https://doi.org/10.1371/journal.pone.0333012.t002

thumbnail
Table 3. Inter visit repeatability of rod-enhanced FMT at all testing locations.

https://doi.org/10.1371/journal.pone.0333012.t003

Between session repeatability data are represented graphically in the Bland Altman plots in Figs 2A2E and 3A3E. Thresholds are presented as percentages. The mean bias is greater than zero in all 5 testing locations for both rod- and cone-enhanced FMT, suggesting that the threshold at the second visit tended to be lower than at the first. However, paired sample t-tests showed the mean difference between visits to be statistically non-significant (p < 0.05) for all five-testing locations in cone and rod mediated responses, suggesting that there was no statistically significant learning effect. The 95% LoAs were wider for rod-mediated thresholds in all 5 testing locations, and foveal rod and cone thresholds had wider spread as compared to peripheral locations. Mean bias, upper and lower 95% LoA, and CoR values are given in Tables 2 and 3, respectively.

thumbnail
Fig 2. Bland Altman plots to show between session repeatability.

(A) inter-visit agreement of inferonasal (−135) cone-enhanced FMT, (B) inter-visit agreement of inferotemporal (−45) cone-enhanced FMT, (C) inter-visit agreement of foveal (0) cone-enhanced FMT, (D) inter-visit agreement of superotemporal (45) cone-enhanced FMT, (E) inter-visit agreement of superonasal (135) cone-enhanced FMT. Middle horizontal solid line represents mean difference between visits cone-enhanced. Upper and lower LoA (mean±1.96) represented by dashed line.

https://doi.org/10.1371/journal.pone.0333012.g002

thumbnail
Fig 3. Bland Altman plots to show between session repeatability.

(A) inter-visit agreement of inferonasal (−135) rod-enhanced FMT, (B) inter-visit agreement of inferotemporal (−45) rod-enhanced FMT, (C) inter-visit agreement of foveal (0) rod-enhanced FMT, (D) inter-visit agreement of superotemporal (45) rod-enhanced FMT, (E) inter-visit agreement of superonasal (135) rod-enhanced FMT. Middle horizontal solid line represents mean difference between visits 1 and 2 for rod-enhanced FMT. Upper and lower LoA (mean±1.96) represented by dashed line.

https://doi.org/10.1371/journal.pone.0333012.g003

When sub analysis was carried out for different age groups, the CoR was found to be higher in the older age group for all conditions and locations (Table 4). It can also be seen that thresholds were higher for all locations in those people (n = 15) aged 45 years and over, compared to younger individuals (n = 15) (p < 0.05, independent sample t-test). In part, the effect of age on CoR is likely to reflect the higher thresholds expressed in older adults. This was further investigated by expressing the CoR as a proportion of the mean FMT for each condition for each age group. It can be seen that the repeatability was more consistent between age groups when expressed in proportional terms but was still poorer for the older participants for 7/10 locations.

thumbnail
Table 4. Age related CoR values for cone and rod mediated responses for 5 testing locations.

https://doi.org/10.1371/journal.pone.0333012.t004

The correlation and agreement between the two FMT measures was also estimated by ICC analysis (Table 5). Based on the ICC results, the test-retest reliability of Flicker-Plus test is good (0.75–0.9) to excellent (>0.9) for all measures [28]. ICC was similar in all locations and for cone and rod-enhanced conditions.

thumbnail
Table 5. ICC estimates with 95% confidence interval for cone and rod-enhanced FMT.

https://doi.org/10.1371/journal.pone.0333012.t005

The three participants who took part in the intra-subject variability analysis were aged 29, 30, and 36 years (subjects 1, 2 and 3, respectively). Intra subject variability of Flicker-Plus test was assessed for both rod and cone-enhanced stimulus conditions, for each of the three subjects repeating the test 16 times, binocularly. Four test runs were performed per visit. Tables 6 and 7 show the mean cone and rod-enhanced FMT for each of the 5 locations repeated 16 times, SD, and the coefficient of variability (CoV) values respectively. The results showed that the foveal cone-enhanced FMT had a variability of 18.82%, 26.28% and 25.04%, whilst the results for rod-enhanced conditions had a variability of 29.29%, 26.01% and 22.48% for subjects 1, 2 and 3 respectively. Figs 4A and 4B shows the foveal cone and rod mediated FMTs for three subjects, in cone and rod enhanced conditions, respectively. Four test runs were carried out per visit. For participants 1 and 2, there was a tendency towards reducing thresholds over the visits, with the values leveling out after visit 2 (8 runs). This finding suggests that, for some individuals, at least 2 visits with multiple iterations are required to eliminate the learning effect. A similar trend was observed in the peripheral locations for both cone and rod mediated FMTs.

thumbnail
Table 6. Showing mean cone-enhanced flicker modulation thresholds for 5 locations tested by flicker-plus test, SD, and the CoV (%) for each subject.

https://doi.org/10.1371/journal.pone.0333012.t006

thumbnail
Table 7. Showing mean rod-enhanced flicker modulation thresholds for 5 locations tested by flicker-plus test, SD, and the CoV (%) for each subject.

https://doi.org/10.1371/journal.pone.0333012.t007

thumbnail
Fig 4. Scatter plot showing foveal cone and rod mediated FMTs in subject 1, 2 and 3.

(A) Foveal cone mediated FMTs in each subject. (B) Foveal rod mediated FMTs in each subject.

https://doi.org/10.1371/journal.pone.0333012.g004

Discussion

The Flicker-Plus test was used to examine the repeatability of flicker modulation sensitivity (FMS) to enhance its potential value in clinical and research settings. This is the first paper demonstrating the repeatability of the Flicker-Plus test for assessing FMTs in healthy individuals across two sessions. The ICC values obtained for the between session repeatability suggested ‘good’ to ‘excellent’ agreement between visits [28]. This was the case at all locations, under both cone and rod-enhanced conditions. However, from a practical viewpoint, a researcher employing this test needs to have a quantifiable measure of the expected variation in the parameters between visits in order to appropriately power study design, and to determine what magnitude of difference constitutes a clinically significant change [27,30]. To enable this, analysis based on the techniques described by Bland and Altman was conducted [26].

According to the Bland and Altman graphical analysis, a bias was seen in all measurements, with visit 2 showing a lower mean threshold for all conditions. This indicates the possible existence of a learning effect. Although the paired t-test showed that this difference was not statistically significant in this sample, the sample size of 30 is only powered to detect a medium effect size of 0.5 so a smaller effect may be present [31]. This is of importance in studies monitoring flicker thresholds over time, as sufficient training will be required to ensure that any perceived changes are not due to the learning effect alone. Multiple measurements were taken from three individuals to explore this further. These data showed a decrease in threshold over subsequent visits in two of the three participants, which only leveled out after four repeats over two visits. However, thresholds tended to increase again after multiple repeats in one visit, suggesting that fatigue effects may become significant upon prolonged repeated testing. It should also be noted that the participants who took part in the intra-subject variability test were familiar with psychophysical testing. The time course of diminution of learning effects may be different for non-trained observers. The learning effect is considered an important artefact in several visual psychophysical tests [32]. Previous literature investigating the learning effect between examinations using standard automated perimetry (SAP) shows an improvement in thresholds, with increasing familiarity of the test in both normal and glaucoma subjects [30,32,33]. Yenice and Temel (2005) reported a statistically significant difference in visual field indices (mean deviation and pattern standard deviation) on the 2nd visit (repeated within a week), due to a learning effect. Some studies report a minimal learning effect in patients with glaucoma and previous experience with perimetry, with more prominent learning effect between first and second examination and a persistent but stable effect between third and fourth visits [32]. However, there is significant variation between individuals, and individualised interpretation of results should be practiced in clinical settings. The Bland and Altman plots demonstrate a certain level of variability between visits, as evidenced by the spread of the LoA around the mean.

Characterising the CoR is a crucial statistical tool in study design. Whether or not a test is considered to be ‘sufficiently’ repeatable depends on the expected magnitude of the disease effect, as a change over time exceeding the CoR may be considered to be a clinically significant decline in visual performance. For example, according to the CoR values obtained, a change in foveal cone enhanced FMT exceeding 3.25 (or ~64%) would be considered to exceed the test-retest variability, and so would be considered significant. This means that in a cohort study design it would be inappropriate to power the study to detect a mean change over time smaller than the test-retest CoR [26]. The CoR for the more peripheral locations was lower, suggesting that the test is able to detect a smaller magnitude of change with the stimulus presented at peripheral locations. For example, for the cone-enhanced test in the superotemporal location, the CoR was 1.81, indicating that a change exceeding 1.81 (38%) in FMT would be clinically significant. For the rod enhanced test, a similar pattern was noted. Whilst the absolute repeatability of rod FMTs tended to be poorer than for cones, the CoR were more comparable when expressed as a percentage of the mean FMT. For example, the % CoR for rod FMT was 34.6% at the inferonasal location, as compared to 32.2% for the same location for the cone enhanced condition.

It is also important to consider that CoR may not be a consistent value for all populations. In the sub analysis conducted we demonstrated that CoR was elevated in people ≥45 years of age. FMT itself has also been shown to increase with increased age [25], so we also evaluated CoR as a proportion of mean FMT for each sub group (see Table 4). The % CoR remained higher in the older age group for 7/10 conditions. This is consistent with other literature in different aspects of psychophysical testing [1,25]. It is also likely that the CoR would be further elevated in people with pathology [34]. It has been shown, for example, in SAP that the variability was higher with visual field loss in glaucoma patients [35,36]. Hence, it would be of value for researchers to carry out further work to determine the CoR in their population group of interest. The study conducted by McKeague, et al., (2014), reported a CoR of 53.4% of the mean threshold of 14 Hz flicker test. Our study demonstrated comparable repeatability, with mean % CoR across locations of 44.1% and 41.8% for cone and rod enhanced stimuli, respectively.

Whilst there is no evidence to date on the expected magnitude of elevation of FMT in people with different pathologies, it is possible to examine other flicker detection studies to determine what change might be expected in proportional terms. Temporal contrast sensitivity (TCS) thresholds are reported to be higher in the people with glaucoma as compared to controls [17,18,20,34,3742]. Although the number of studies investigating TCS in glaucoma are limited, there is still some evidence that this deficit may occur prior to perimetric defects [18], and of an increasing decline in flicker sensitivity with increasing disease severity [17]. Fidalgo et al., (2018) tested peripheral photopic flicker thresholds in glaucoma patients with varying disease severity, showing 123% increase in thresholds as disease progress from early to moderate stage, and 58% increase between moderate and severe disease stages, which is greater than the % CoR for peripheral cone-enhanced condition (32%−44%). Flicker sensitivity is also reduced in individuals with early stages of AMD, and it decreases further with the disease severity [9,10,15]. Another study reported the mean flicker thresholds measured using Flicker-Plus test in individuals with neovascular AMD (n = 13) were significantly higher (262%), which significantly exceeds the % CoR of 43.4% reported here for a comparable test condition [43]. Luu et al., (2013) assessed flicker sensitivity using flicker perimetry in AMD patients and reported the test-retest reproducibility with fixed bias of −0.41dB and 95% LoA were ± 5dB, suggesting that smaller changes should be interpreted carefully. The FMT for foveal rod-enhanced condition showed a fixed bias of 1.36 with LoAs −1.98 and 4.70.

Conclusion

The findings show good to excellent test-retest repeatability of Flicker-Plus test. The CoR was higher (indicating poorer repeatability) for rod-enhanced conditions and at the fovea. There are many factors that can cause this increased variability, and these will be examined in future studies.

Decrease in temporal contrast sensitivity occur as the number of photoreceptors diminishes [10]. Most of the flicker tests have focused on foveal location which is more populated with cone photoreceptors hence assessing the cone mediated flicker sensitivity. It is also reported that the number of cones remains relatively constant throughout human life span [[44]] and thus the effect of age and disease cannot be separated very precisely using these tests. There is evidence that temporal sensitivity assessed at peripheral locations would provide a better measure of visual function loss in diseases such as AMD and glaucoma, in which early changes are not confined to central retina [11]. The excellent repeatability of the rod and cone FMT assessed in the parafoveal retinal locations suggest that this test would be valuable in assessing this functional loss. Sub-analysis also indicated that CoR was higher in older adults (>45 years), both in absolute terms and when expressed as a proportion of the mean FMT. This should be considered in sample size calculations for clinical trial design.

This study will assist clinicians in determining whether reported changes over time are caused by measurement inaccuracy or disease advancement, assuming consistent experimental setting and psychophysical techniques.

Acknowledgments

We acknowledge the J&P Wolf Trust for supporting AH and SB with bursaries for their doctoral studies. No other funding or sponsorship was received to support this study or publication of this article. All named authors meet the International Committee of Medical Journal Editors (ICMJE) criteria for authorship for this article, take responsibility for the integrity of the work as a whole, and have given their approval for this version to be published.

References

  1. 1. Tyler CW. Two processes control variations in flicker sensitivity over the life span. J Opt Soc Am A. 1989;6(4):481–90. pmid:2709193
  2. 2. Tyler CW. Analysis of normal flicker sensitivity and its variability in the visuogram test. Investigative Ophthalmology & Visual Science. 1991;32(9):2552–60.
  3. 3. Kur J, Newman EA, Chan-Ling T. Cellular and physiological mechanisms underlying blood flow regulation in the retina and choroid in health and disease. Prog Retin Eye Res. 2012;31(5):377–406. pmid:22580107
  4. 4. Buttery RG, Hinrichsen CF, Weller WL, Haight JR. How thick should a retina be? A comparative study of mammalian species with and without intraretinal vasculature. Vision Res. 1991;31(2):169–87. pmid:2017880
  5. 5. Alder VA, Ben-Nun J, Cringle SJ. PO2 profiles and oxygen consumption in cat retina with an occluded retinal circulation. Invest Ophthalmol Vis Sci. 1990;31(6):1029–34. pmid:2354908
  6. 6. Shakoor A, Blair NP, Mori M, Shahidi M. Chorioretinal vascular oxygen tension changes in response to light flicker. Invest Ophthalmol Vis Sci. 2006;47(11):4962–5. pmid:17065514
  7. 7. Felder AE, Wanek J, Blair NP, Shahidi M. Retinal Vascular and Oxygen Temporal Dynamic Responses to Light Flicker in Humans. Invest Ophthalmol Vis Sci. 2017;58(13):5666–72. pmid:29098297
  8. 8. McKeague C, Binns AM, Margrain TH. An evaluation of two candidate functional biomarkers for AMD. Optom Vis Sci. 2014;91(8):916–24. pmid:24978867
  9. 9. Luu CD, Dimitrov PN, Wu Z, Ayton LN, Makeyeva G, Aung K-Z, et al. Static and flicker perimetry in age-related macular degeneration. Invest Ophthalmol Vis Sci. 2013;54(5):3560–8. pmid:23620428
  10. 10. Mayer MJ, Spiegler SJ, Ward B, Glucs A, Kim CB. Mid-frequency loss of foveal flicker sensitivity in early stages of age-related maculopathy. Invest Ophthalmol Vis Sci. 1992;33(11):3136–42. pmid:1399418
  11. 11. Brown B, Lovie-kitchin J. Temporal function in age related maculopathy. Clin Exp Optom. 1987;70(4):112–6.
  12. 12. Gugleta K, Türksever C, Polunina A, Orgül S. Effect of ageing on the retinal vascular responsiveness to flicker light in glaucoma patients and in ocular hypertension. Br J Ophthalmol. 2013;97(7):848–51. pmid:23624271
  13. 13. Riva CE, Salgarello T, Logean E, Colotto A, Galan EM, Falsini B. Flicker-evoked response measured at the optic disc rim is reduced in ocular hypertension and early glaucoma. Invest Ophthalmol Vis Sci. 2004;45(10):3662–8. pmid:15452074
  14. 14. Zele AJ, Dang TM, O’Loughlin RK, Guymer RH, Harper A, Vingrys AJ. Adaptation mechanisms, eccentricity profiles, and clinical implementation of red-on-white perimetry. Optom Vis Sci. 2008;85(5):309–17. pmid:18451735
  15. 15. Dimitrov PN, Robman LD, Varsamidis M, Aung KZ, Makeyeva GA, Guymer RH, et al. Visual function tests as potential biomarkers in age-related macular degeneration. Invest Ophthalmol Vis Sci. 2011;52(13):9457–69. pmid:22003115
  16. 16. Tyler CW. Specific deficits of flicker sensitivity in glaucoma and ocular hypertension. Invest Ophthalmol Vis Sci. 1981;20(2):204–12. pmid:7461923
  17. 17. Fidalgo BR, Jindal A, Tyler CW, Ctori I, Lawrenson JG. Development and validation of a new glaucoma screening test using temporally modulated flicker. Ophthalmic Physiol Opt. 2018;38(6):617–28. pmid:30575071
  18. 18. Huchzermeyer C, Horn F, Lämmer R, Mardin C, Kremers J. Summation of Temporal L-Cone- and M-Cone-Contrast in the Magno- and Parvocellular Retino-Geniculate Systems in Glaucoma. Invest Ophthalmol Vis Sci. 2021;62(6):17.
  19. 19. Hathibelagal AR, Bharadwaj SR, Jalali S, Subramanian A, Barbur JL. Evaluation of photoreceptor function in inherited retinal diseases using rod- and cone-enhanced flicker stimuli. Ophthalmic Physiol Opt. 2021;41(4):874–84. pmid:33834501
  20. 20. Pearson P, Swanson WH, Fellman RL. Chromatic and achromatic defects in patients with progressing glaucoma. Vision Res. 2001;41(9):1215–27. pmid:11292509
  21. 21. Radner W, Benesch T. Age-related course of visual acuity obtained with ETDRS 2000 charts in persons with healthy eyes. Graefes Arch Clin Exp Ophthalmol. 2019;257(6):1295–301. pmid:30989362
  22. 22. Van Herick W, Shaffer RN, Schwartz A. Estimation of width of angle of anterior chamber. Incidence and significance of the narrow angle. Am J Ophthalmol. 1969;68(4):626–9. pmid:5344324
  23. 23. Chylack LT, Wolfe JK, Singer DM, Leske MC, Bullimore MA, Bailey IL, et al. The Lens Opacities Classification System III. Archives of Ophthalmology. 1993;111(6):831–6.
  24. 24. Wyszecki G, Stiles WS. Color science: concepts and methods, quantitative data and formulae. 2nd ed. Wiley. 1982.
  25. 25. Hathibelagal AR, Bharadwaj SR, Yadav AR, Subramanian A, Sadler JRE, Barbur JL. Age-related change in flicker thresholds with rod- and cone-enhanced stimuli. PLoS One. 2020;15(7):e0232784. pmid:32639956
  26. 26. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–10. pmid:2868172
  27. 27. Vaz S, Falkmer T, Passmore AE, Parsons R, Andreou P. The case for using the repeatability coefficient when calculating test-retest reliability. PLoS One. 2013;8(9):e73990. pmid:24040139
  28. 28. Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15(2):155–63. pmid:27330520
  29. 29. Reed GF, Lynn F, Meade BD. Use of coefficient of variation in assessing variability of quantitative assays. Clin Diagn Lab Immunol. 2002;9(6):1235–9. pmid:12414755
  30. 30. Matheson GJ. We need to talk about reliability: making better use of test-retest studies for study design and interpretation. PeerJ. 2019;7:e6918. pmid:31179173
  31. 31. Lakens D. Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs. Front Psychol. 2013;4:863. pmid:24324449
  32. 32. Werner EB, Adelson A, Krupin T. Effect of patient experience on the results of automated perimetry in clinically stable glaucoma patients. Ophthalmology. 1988;95(6):764–7.
  33. 33. Yenice O, Temel A. Evaluation of two Humphrey perimetry programs: full threshold and SITA standard testing strategy for learning effect. Eur J Ophthalmol. 2005;15(2):209–12. pmid:15812761
  34. 34. Heijl A, Lindgren G, Olsson J. The Effect of Perimetric Experience in Normal Subjects. Arch Ophthalmol. 1989;107(1):81–6.
  35. 35. Chauhan BC, Johnson CA. Test-retest variability of frequency-doubling perimetry and conventional perimetry in glaucoma patients and normal subjects. Invest Ophthalmol Vis Sci. 1999;40(3):648–56. pmid:10067968
  36. 36. Heijl A, Lindgren G, Olsson J. Normal variability of static perimetric threshold values across the central visual field. Arch Ophthalmol. 1987;105(11):1544–9.
  37. 37. Bierings RAJM, Kuiper M, van Berkel CM, Overkempe T, Jansonius NM. Foveal light and dark adaptation in patients with glaucoma and healthy subjects: A case-control study. PLoS One. 2018;13(3):e0193663. pmid:29509787
  38. 38. Cursiefen C, Horn F, Jünemann AG, Korth M. Reduced recovery of temporal contrast sensitivity after flicker stress in patients with glaucoma. J Glaucoma. 2000;9(4):296–302. pmid:10958602
  39. 39. Stroux A, Korth M, Jünemann A, Jonas JB, Horn F, Ziegler A, et al. A statistical model for the evaluation of sensory tests in glaucoma, depending on optic disc damage. Invest Ophthalmol Vis Sci. 2003;44(7):2879–84. pmid:12824226
  40. 40. Martus P, Junemann A, Wisse M, Budde WM, Horn F, Korth M. Multivariate approach for quantification of morphologic and functional damage in glaucoma. Investigative Ophthalmology & Visual Science. 2000;41(5):1099–110.
  41. 41. Loughman J, Davison P, Flitcroft I. Diagnostic sensitivity/specificity of preattentive vision tests in glaucoma. Optom Vis Sci. 2008;85(7):543–6. pmid:18594347
  42. 42. Joao CAR, Scanferla L, Jansonius NM. Retinal contrast gain control and temporal modulation sensitivity across the visual field in glaucoma at photopic and mesopic light conditions; Canon [Japan], Carl Zeiss Meditec [United States]. Invest Ophthalmol Vis Sci. 2019;60(13):4270–6.
  43. 43. Yadav G, Narayanan R, Hathibelagal AR. Chromatic and flicker threshold changes in age-related macular degeneration following anti-VEGF treatment. Clin Exp Optom. 2022;105(3):313–9. pmid:33941047
  44. 44. Curcio CA, Owsley C, Jackson GR. Spare the rods, save the cones in aging and age-related maculopathy. Invest Ophthalmol Vis Sci. 2000;41(8):2015–8. pmid:10892836