Introduction

Multifocal visual-evoked potentials make it possible to obtain the individual responses produced in either the retina (multifocal electroretinogram: mfERG) or at cortical level (multifocal visual-evoked potential: mfVEP) by light excitation of a large number of sectors of the visual field (typically between 60 and 120).

The basic principles of the multifocal visual excitation technique are the result of research [1] into pseudorandom sequences conducted in the 1990s. Subsequent advances have facilitated its application in research and clinical environments [2].

These responses can be studied individually by sector, although the signal-to-noise ratio improves and the analysis time decreases if the responses are studied by cluster of sectors. Another advantage of the clustering approach is that it facilitates clinical topographic interpretation of the results.

The guidelines on mfERG [3] indicate the possibility of clustering the responses by quadrant, hemiretinal area, normal and abnormal regions of two eyes, or successive rings from centre to periphery. Responses from stimulus sectors associated with a local area of interest can be averaged for comparison with a similar area in an unaffected eye or with data from control subjects.

In a large number of publications, clustering by ring is used to analyse mfERG recordings, especially in diseases that produce altered responses with approximate radial symmetry. Examples include [4] (for diagnosis of glaucoma) and [5] (for identifying retinal toxicity due to treatment with hydroxychloroquine). Another study [6] concludes that using the central ring in mfERGs is an appropriately sensitive technique with which to study the progression of age-related macular degeneration over short periods of time.

Analysis by quadrant makes it possible to carry out comparisons between mfERG studies and optical coherence tomography (OCT) [7, 8]. It is also common to find the results of clustering by ring and quadrant in publications that analyse the technical aspects of mfERG [9, 10]. More recently, [11] detected altered responses in clustering by ring and quadrant in patients with nasopharyngeal carcinoma after radiotherapy.

Due to the irregular structure of the eye, which lacks symmetry of any kind, there is no basis for arguing that the predefined regions are in all cases those that possess the greatest capacity to discriminate between patients and control subjects. We therefore hypothesize that maximum affectation by a particular disease can present in any cluster within the visual field and can have an arbitrary shape and size and may not necessarily present in rings, quadrants or hemispheres.

To date, guidelines for mfVEP procedures have not been published. Clustering of mfVEP responses in various eccentric rings makes it possible to identify patients with multiple sclerosis (MS) [12, 13]. Analysis of mfVEP responses in the inner (0.87°–5.67°) and outer rings (5.68°–24°) is proposed as a method for assessing optic nerve dysfunction in patients with optic disc drusen [14]. The best results achieved in implementing computer-aided diagnosis of multiple sclerosis are obtained by analysing the characteristics of the mfVEP responses when clustered by ring [15]. Clustering mfVEP responses in quadrants are useful in the study of amblyopic eyes [16] or for comparing OCT against mfVEP [17], [18].

The purpose of this paper is to propose a method of grouping the multifocal visual responses with the greatest capacity to discriminate between control subjects and patients affected by any disease identifiable in these responses. This grouping technique can be used for any pathology with neuroretinal or macular impact, and can be applied to various electrophysiological techniques (both in isolation and in combination).

This paper analyses mfERG signals in a comparative cohort of control subjects and patients affected by MS. Although MS is not an ophthalmological disease per se, eye function is very frequently affected. Moreover, as a neurodegenerative disease, MS-derived damage to the retinal nerve fibre layer (RNFL) has been detected using OCT, reflecting the progressive axonal damage caused by the development of the disease which, in addition, has been linked to patient disability, even in those without a history of optic neuritis [19,20,21].

This paper also evaluates the diagnostic capacity of mfERG sector clustering, a technique that has been shown to be effective with mfVEP recordings.

Materials and methods

Patients and method

The system proposed may be used for any multifocal electrophysiology technique (e.g. mfERG or mfVEP) and for diagnosing any type of disease. In this paper, as proof of concept, the method may be useful to diagnose multiple sclerosis (MS) from mfERG recordings.

Subject database and mfERG acquisition

The study procedures were performed in accordance with the tenets of the Declaration of Helsinki, and ethical approval was obtained from the local ethics committee [Aragón Clinical Research Ethics Committee (CEICA, Zaragoza, Spain)]. All subjects were over the age of 18 and signed informed consent prior to study procedures.

The mfERG recordings from both eyes of 15 subjects (mean age 44.46 ± 8.24, M:F = 4:11, n = 30 eyes) with newly diagnosed MS (less than 6 months) and no history of optic neuritis, and those from six control subjects (mean age 35.83 ± 10.65, M:F = 3:3, n = 12 eyes), were used.

A complete neuro-ophthalmic examination was performed on all subjects in order to detect any ocular alteration that might affect functional vision or mfERG results. MS was diagnosed based on the 2010 revision of the McDonald Criteria [22]. The patients had no concomitant ocular diseases, nor any previous history of retinal pathology, glaucoma or significant refractive errors (more than 5 dioptres of spherical equivalent refraction or 3 dioptres of astigmatism), strabism or systemic conditions that could affect the visual system.

The mfERGs were recorded using the RETI-port/scan 21 (Roland Consult, Berlin, Germany) visual electrophysiology system, according to the ISCEV standard [3]. The stimulus array consisted of 61 sectors, arranged hexagonally, displayed at a 60-Hz frame rate. The luminance of each sector was independently alternated between black (< 2 cd/m2 of luminance) and white (200 cd/m2 of luminance) according to a pseudorandom binary m-sequence. The active channels were recorded using Dawson–Trick–Litzkow (DTL) electrodes fabricated from a flexible, lightweight nylon wire impregnated with silver and placed in contact with the conjunctival sac. The electrodes were placed below the lower eyelid, with one end being attached to the external canthus and the other end being attached at the height of the conjunctival sac. Three spoon electrodes were used, two as reference electrodes placed on each temple and the third as an earth electrode placed at the height of the nasion. An amplifier with 104 gain and 10–200 Hz bandwidth was used. The signals were digitized at a sample rate of 1017 samples/s, with the number of samples from each signal being 84 (82.61 ms long). The first-order mfERG kernel was analysed.

Before recording the mfERG signals, the characteristics of the test were explained to the subjects in order to achieve satisfactory fixation. During the test, both the acquisition device and the staff monitored the recordings to ensure that they were taken correctly. The recordings were examined by an expert ophthalmologist to identify potential issues relating to eccentric fixation, among others, before conducting a blind analysis of the signals.

Clustering

The method proposed requires the following inputs:

  • A database of multifocal recordings taken from patients (MS) and control subjects.

  • Definition of one or more parameters used to evaluate the capacity to discriminate between patients and control subjects. For mfERG records, potential parameters could be wave amplitude (AN1, AP1), latencies (LN1, LP1) or other suitable parameters (for example, obtained from the signals’ decomposition wavelet).

  • In order to obtain a minimum signal-to-noise ratio value, each cluster is considered to comprise at least NMIN = 5 sectors. The maximum number of sectors in a possible cluster is equal to the number of sectors in the visual field: NMAX.

  • The capacity to discriminate between patients and control subjects is evaluated using the area under the receiver operator characteristic curve (AUC). The AUC can be taken from one of the parameters or from the mean value of several.

Definition of valid clusters

Given a value of N (NMIN = 5 ≤ N ≤ NMAX = 61), a cluster defined as valid contains N-contiguous sectors (i.e. all the sectors belonging to the cluster are adjacent to at least two others in the cluster). Figure 1a shows two valid clusters while Fig. 1b shows two invalid clusters. In both cases, N = 7.

Fig. 1
figure 1

Examples of possible clusters (N = 7). a Valid clusters. b Invalid clusters

All possible sector clusters were analysed as per the conditions described above, including conventional clustering by ring, with the exception of Ring 1, which comprises a single sector.

Implementation of the method

Drawing on the database of patients and control subjects, for every value of N (NMIN ≤ N ≤ NMAX): (i) the valid clusters are identified; (ii) the mean of the multifocal responses of the sectors that comprise the cluster is calculated; (iii) the parameter or parameters of interest are obtained from the mean signal; (iv) the AUC between patients and control subjects is calculated; and (v) if the value of the AUC is the highest obtained so far, its details (N value, cluster identification) are saved until the procedure is completed.

It is possible to analyse two or more of the recordings’ parameters. To do so, the value of the AUC must be obtained for each of them. For example, Eq. (1) shows the AUC for the mean value of the amplitudes; Eq. (2) shows the mean value of the AUC obtained in analysis of the latencies; and Eq. (3) shows the mean value of the joint analysis of those 4 parameters.

$${\text{AUC}}_{\text{AMP}} = \frac{{{\text{AUC}}_{{A_{{{\text{P}}1}} }} + {\text{AUC}}_{{A_{{{\text{N}}1}} }} }}{2}$$
(1)
$${\text{AUC}}_{\text{LAT}} = \frac{{{\text{AUC}}_{{L_{{{\text{P}}1}} }} + {\text{AUC}}_{{L_{{{\text{N}}1}} }} }}{2}$$
(2)
$${\text{AUC}}_{\text{MEDIA}} = \frac{{{\text{AUC}}_{{A_{{{\text{P}}1}} }} + {\text{AUC}}_{{A_{{{\text{N}}1}} }} + {\text{AUC}}_{{L_{{{\text{P}}1}} }} + {\text{AUC}}_{{L_{{{\text{N}}1}} }} }}{4}$$
(3)

It would also be possible to apply a different weighting to the partial AUC values that make up the overall AUC. Equation (4) shows a hypothetical case in which the discriminant capacity of the latencies is prioritised over that of the amplitudes.

$${\text{AUC}}_{\text{GLOBAL}} = 0.2 * {\text{AUC}}_{{A_{{{\text{P}}1}} }} + 0.2 * {\text{AUC}}_{{A_{{{\text{N}}1}} }} + 0.3 * {\text{AUC}}_{{L_{{{\text{P}}1}} }} + 0.3 * {\text{AUC}}_{{L_{{{\text{N}}1}} }}$$
(4)

The arranged hexagonal sectors of the mfERG response are codified with the system of coordinates (q, r) shown in Fig. 2. The central sector (number 31) has the coordinates (q = 0, r = 0). These coordinates are able to efficiently codify the six neighbouring sectors for every single hexagon (each hexagon has six equidistant neighbours). As example, q and r positive values from Fig. 2 codify sectors nearly corresponding to the inferior nasal quadrant.

Fig. 2
figure 2

System of coordinates used to find clusters

Figure 3 shows the flowchart of the method implemented. For all N values between NMIN and NMAX, a search is conducted for all N-size clusters, taking each of the 61 sectors (Si) as a seed. In Step 1, the coordinates of the seed sector are obtained as per Fig. 2, while in Step 2, all the N-size clusters originating in sector Si are obtained. The condition of Step 3 dictates that for every hexagon in the cluster analysed there must be at least two other hexagons at a distance equal to 1. The distance between two hexagons (H1, H2) in the system of coordinates proposed is calculated as follows:

Fig. 3
figure 3

Flowchart of the proposed method

$${\text{D}}\left( {{\text{H}}_{1} ,{\text{H}}_{2} } \right) = \frac{{\left| {q_{1} - q_{2} } \right| + \left| {\left( {q_{1} + r_{1} } \right) - \left( {q_{2} + r_{2} } \right)} \right| + \left| {r_{1} - r_{2} } \right|}}{2}$$
(5)

If the cluster fulfils the conditions, the signals of the sectors are averaged in both the control subjects and the patients and the value of the AUC (Step 4) is obtained until it is verified that all possibilities have been analysed (Step 5).

The number of clusters formed and analysed depends on the N-size. For example, for a value of N = 5, a total of 2 217 groups is analysed, while for a value of N = 7 the number of groups analysed rises to 26 224.

Results

Table 1 shows the AUC values obtained from conventional analysis by ring. Although clustering by ring (except Ring 1) is considered in cluster formation, these results make it possible to evaluate the advantages of the method proposed.

Table 1 AUC values obtained from analysis by ring

Figure 4 shows the clusters with greatest discriminant capacity (N = 5…N = 16), considering the amplitude of wave P1 (AP1) as the mfERG signal analysis parameter. The greatest discriminant value is obtained for N = 9 (\({\text{AUC}}_{{\left( {A_{P1} , N = 9} \right)}} = 0.7042\)), located in the inferior perifoveal quadrant (Rings 2, 3). For values of N > 16, the AUC values obtained continue to decrease.

Fig. 4
figure 4

Algorithm performance in selecting the best cluster for parameter AP1

Figure 5 shows the results produced by several clusters, taking into account the four parameters (amplitudes and latencies of waves N1 and P1), and therefore, Eq. (3). According to this criterion, the maximum capacity to discriminate between patients with MS and control subjects is \({\text{AUC}}_{{\left( {A_{\text{N1}} , A_{\text{P1}} , L_{\text{N1}} ,L_{\text{P1}} , N = 8} \right)}} = 0.6917\). The AUC values obtained for N > 16 are less than 0.6917.

Fig. 5
figure 5

Changes in the clusters, taking into account the maximum discriminant value for the mean of the two amplitudes and two latencies of waves N1 and P1

In recent publications, [10] and [23] observed that the latency of wave P1 is higher in MS patients. Figure 6 shows the results for this parameter, with the greatest discriminant capacity being \({\text{AUC}}_{{\left( {L_{\text{P1}} , N = 12} \right)}} = 0.8854\), which corresponds to the nasal perifoveal area. In this case as well, clusters comprising more than 16 sectors do not obtain better AUC values.

Fig. 6
figure 6

AUC results for clusters identified using latency P1

Discussion

One of the advantages of the multifocal electrophysiology techniques is that they obtain a high number of responses in the visual field. However, in order to obtain a clear clinical interpretation of the results, it is necessary to apply clustering methods to the responses. Although studies usually group responses into predetermined shapes (rings and quadrants) and sizes (number of sectors) depending on the technique used and the disease to diagnose, it may be beneficial to identify clusters of arbitrary shapes and sizes. When analysing multifocal visual responses, it is very common to use pre-established cluster shapes (e.g. rings, quadrants or hemispheres). With our method, all possible options within the visual field are automatically and arbitrarily explored on the understanding that, for some signals and diseases, the optimal analysis areas do not necessarily coincide with pre-established clusters.

This article proposes an automatic method of selecting those areas of a topographic map of multifocal visual responses that present the greatest differences between patients and control subjects. The method seeks the zone with greatest discriminant capacity, exceeding the AUC values obtained from clustering by ring (Table 1). Its efficacy in diagnosing MS from mfERG signals and using the amplitude and latency parameters of the first-order kernel responses to perform analysis has been demonstrated.

Among the options tested in our database, the highest AUC value is obtained by analysing the latency of wave P1 \(({\text{AUC}}_{{\left( {L_{\text{P1}} , N = 12} \right)}} = 0.8854\)); this result is in agreement with [10] and [23]. Also, for our database, the highest AUC values were obtained by analysing clusters with fewer than 16 sectors.

The potential uses of the method proposed are listed below:

  • With multifocal visual-evoked potentials, it can be used to analyse mfERG, mfVEP or mfPERG signals and can even be used to analyse areas of the automated perimetry. In general, it is applicable to any sector-based technique.

  • The method is valid for any type of disease (MS, glaucoma, etc.) identifiable in these types of recordings.

  • The AUC of one or several valid analysis parameters can be used. If the AUC is obtained from more than two parameters, each of them can be weighted with different coefficients.

  • In studies of temporal evolution, the method can also be effective in detecting the region in which the most significant alterations appear.

It is also feasible to use it to analyse other types of tests, such as OCT. In line with this, [24] detects the retinal nerve fibre layer (RNFL) thicknesses found in angular sectors, which offer the best capacity for diagnosis in glaucoma patients.

With the method proposed in this paper it would be possible to determine which regions (of arbitrary shapes and sizes) are the most relevant for diagnosis. For example, with a Triton device with SS-OCT technology, it is possible to obtain grids of up to 60 * 45 points of measurement of the thickness of various layers of the retina. Detecting the areas suffering greatest alteration would be performed as follows:

  • For one of the layers of the retina for which thickness measurements are available (e.g. RNFL), cells or basic sectors are formed. For example, if each cell comprises 5 * 5 measurements, NMAX = 108 sectors would be obtained.

  • The mean value of the thickness in each of the sectors defined is then calculated.

  • Starting from a minimum N value, a search is conducted for coherent cell clusters up to a maximum size of NMAX. As the cells are square, it may be beneficial to consider that, to qualify as a valid cluster, all the cells should be adjacent to at least one other cell in the cluster. For each of the valid clusters, the AUC is calculated and the algorithm selects the cluster and the N value with the highest AUC.

Our method found clusters that differentiated, with a high AUC, between patients with MS (as proof of concept) and healthy control subjects. However, because of the small size of our database, it is not possible to affirm that the clusters identified as having greatest discriminant capacity are specific to this disease. Our group considers it worthwhile to corroborate the results with further studies involving a higher number of patients and to compare them with other diseases. In such case, if it were possible to obtain a specific cluster for each disease, it would be possible to diagnose and monitor each of them individually. In this regard, the clusters found in MS could be used as a valuable means of strengthening diagnosis or confirming non-definitive suspicions raised in earlier phases, such as when MRI does not reveal any damage.

In conclusion, this paper opens up the opportunity to identify sectors and clusters other than those arbitrarily chosen by the equipment manufacturer, basing them on the actual topographical affectation of a particular pathology at neuroretinal and cell level. This will enable better understanding of the physiopathological mechanisms of the diseases and expedite early diagnosis of them, which is the main objective of electrophysiological tests.