Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Inter-laboratory agreement on embryo classification and clinical decision: Conventional morphological assessment vs. time lapse

  • Luis Martínez-Granados ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft

    Luismargra@gmail.com

    Affiliation U. Reproducción, UGC de Laboratorio Clínico y UGC Obstetricia y Ginecología, Hospital Universitario Virgen de las Nieves, Instituto de Investigación Biosanitaria de Granada (IIBG), Granada, Spain

  • María Serrano,

    Roles Investigation, Project administration, Writing – review & editing

    Affiliation Clínica IFEM, Córdoba, Spain

  • Antonio González-Utor,

    Roles Investigation, Project administration, Writing – review & editing

    Affiliation Centro MasVida Reproducción, Sevilla, Spain

  • Nereyda Ortíz,

    Roles Investigation, Project administration, Writing – review & editing

    Affiliation Instituto Europeo de Fertilidad, Madrid, Spain

  • Vicente Badajoz,

    Roles Data curation, Resources, Writing – review & editing

    Affiliation Ginefiv, Madrid, Spain

  • Enrique Olaya,

    Roles Data curation, Resources, Writing – review & editing

    Affiliation Clínica Tambre, Madrid, Spain

  • Nicolás Prados,

    Roles Data curation, Resources, Writing – review & editing

    Affiliation IVI, Sevilla, Spain

  • Montse Boada,

    Roles Writing – review & editing

    Affiliation Women’s Health Dexeus, Barcelona, Spain

  • Jose A. Castilla,

    Roles Conceptualization, Supervision, Validation, Writing – original draft

    Affiliations U. Reproducción, UGC de Laboratorio Clínico y UGC Obstetricia y Ginecología, Hospital Universitario Virgen de las Nieves, Instituto de Investigación Biosanitaria de Granada (IIBG), Granada, Spain, CEIFER Biobanco, Granada, Spain, Departamento de Anatomía y Embriología Humana, Facultad de Medicina, Universidad de Granada, Granada, Spain

  • on behalf of Special Interest Group in Quality of ASEBIR (Spanish Society for the Study of Reproductive Biology)

    The complete membership of the Special Interest Group in Quality of ASEBIR is provided in the Acknowledgments.

Abstract

The aim of this study is to determine inter-laboratory variability on embryo assessment using time-lapse platform and conventional morphological assessment. This study compares the data obtained from a pilot study of external quality control (EQC) of time lapse, performed in 2014, with the classical EQC of the Spanish Society for the Study of Reproductive Biology (ASEBIR) performed in 2013 and 2014. In total, 24 laboratories (8 using EmbryoScope™, 15 using Primo Vision™ and one with both platforms) took part in the pilot study. The clinics that used EmbryoScope™ analysed 31 embryos and those using Primo Vision™ analysed 35. The classical EQC was implemented by 39 clinics, based on an analysis of 25 embryos per year. Both groups were required to evaluate various qualitative morphological variables (cell fragmentation, the presence of vacuoles, blastomere asymmetry and multinucleation), to classify the embryos in accordance with ASEBIR criteria and to stipulate the clinical decision taken. In the EQC time-lapse pilot study, the groups were asked to determine, as well as the above characteristics, the embryo development times, the number, opposition and size of pronuclei, the direct division of 1 into 3 cells and/or of 3 into 5 cells and false divisions. The degree of agreement was determined by calculating the intra-class correlation coefficients and the coefficient of variation for the quantitative variables and the Gwet index for the qualitative variables. For both EmbryoScope™ and Primo Vision™, two periods of greater inter-laboratory variability were observed in the times of embryo development events. One peak of variability was recorded among the laboratories addressing the first embryo events (extrusion of the second polar body and the appearance of pronuclei); the second peak took place between the times corresponding to the 8-cell and morula stages. In most of the qualitative variables analysed regarding embryo development, there was almost-perfect inter-laboratory agreement among conventional morphological assessment (CMA), EmbryoScope™ and Primo Vision™, except for false divisions, vacuoles and asymmetry (users of all methods) and multinucleation (users of Primo Vision™), where the degree of agreement was lower. The inter-laboratory agreement on embryo classification according to the ASEBIR criteria was moderate-substantial (Gwet 0.41–0.80) for the laboratories using CMA and EmbryoScope™, and fair-moderate (Gwet 0.21–0.60) for those using Primo Vision™. The inter-laboratory agreement for clinical decision was moderate (Gwet 0.41–0.60) on day 5 for CMA users and almost perfect (Gwet 0.81–1) for time-lapse users. In conclusion, time-lapse technology does not improve inter-laboratory agreement on embryo classification or the analysis of each morphological variable. Moreover, depending on the time-lapse platform used, inter-laboratory agreement may be lower than that obtained by CMA. However, inter-laboratory agreement on clinical decisions is improved with the use of time lapse, regardless of the platform used.

Introduction

Precise embryo selection is crucially important to obtaining good results in assisted reproduction techniques. Currently, the two most widespread methods of embryo selection are conventional morphological assessment (CMA) and embryonic morphokinetics using time-lapse (TL) platforms. Inter-laboratory variability in the application of conventional morphology has been demonstrated in embryo classification and in the clinical decision taken [1,2]. In consequence, various scientific societies recommend participation in external quality control (EQC) programmes of embryo evaluation [3,4].

Although TL methods provide a great deal of information on embryo development, their analysis continues to be performed manually [5]. As regards the measurement of times on TL platforms, only one study has evaluated intra- and inter-observer accuracy (among three observers), reporting little variability in this respect [6]. To our knowledge, no study has yet been conducted, or EQC programme implemented, to evaluate inter-laboratory variability in the determination of these times.

Most studies seeking to develop classification algorithms using TL platforms only take into account the timing of embryo development [712]. However, these platforms also allow embryos to be selected by evaluating classical morphological characteristics, such as pronuclei features, blastomere size, symmetry, fragmentation and multinucleation [13]. Ahlstrom et al. [14] showed that this strategy is clinically more useful than an embryo selection method in which only the timing is analysed. Other authors, using various strategies, have incorporated these characteristics into algorithms based on morphokinetic events [8,15,16]. At present, little is known about the performance of TL in the assessment of morphological characteristics such as fragmentation or multinucleation. Sundvall et al. [6], in a study of three observers, obtained a ‘fair’ intraclass correlation coefficient for evenness and multinucleation.

Taking into account the above considerations, the aims of this study are to determine inter-laboratory variability in the assessment of morphokinetic events and morphological characteristics using TL and to compare the inter-laboratory variability in embryo classification and clinical decision after using TL with that observed using CMA. To achieve these study goals, the results of a pilot EQC morphokinetic trial were compared with those obtained by a CMA EQC programme.

Materials and methods

External quality control of embryo morphology using time-lapse technology

ASEBIR (Spanish Society for the Study of Reproductive Biology) e-mailed all its members inviting them to participate in a pilot EQC test of embryo morphology using TL technology, informing them that participation would be voluntary, by each laboratory, and that to participate the laboratory must possess and use one of the two platforms evaluated (EmbryoScope™ [Unisense FertiliTech, Denmark] or Primo Vision™ Version 4.3.1.5 [Vitrolife, Sweden]).

In June 2014, each participating laboratory was sent three batches (on a USB device), with videos of the development up to the blastocyst stage of 9–14 embryos from oocytes inseminated by ICSI. Each batch consisted of images of embryos obtained from a different couple. These videos were made from images taken every 5–20 minutes, at different focal planes, of 31 embryos for the users of EmbryoScope™ and of 35 embryos for those of Primo Vision™.

The videos of the embryos used for the trial were obtained from the ICSI programmes at the following ART centres, each of which had the morphokinetic platforms stipulated to be analysed in this study: GineFIV, Madrid; Clínica Tambre, Madrid; Unidad de Reproducción CHU, Granada. The embryos were cultured in vitro, according to the protocol established at each laboratory. All patients signed informed consent authorising the use of anonymised data and images of their embryos for the dissemination of scientific knowledge. This is a retrospective study of anonymised data and images; it therefore does not require the approval of an ethical committee for clinical investigation, according to Spanish law (OrdenSAS/3470/2009).

For the EmbryoScope™ platform, the videos were obtained using the software provided with the system, which generates a TL video from high-definition static images captured every 10 minutes, on seven focal planes. The embryos were cultured at an oxygen concentration of 5%. The TL videos for the Primo Vision™ platform were generated from images consisting of seven 200 μm focal planes, captured every 5 minutes for the first 24 hours and then every 20 minutes until the day of embryo transfer. These embryos were cultured at an oxygen concentration of 21%.

Together with the images, instructions were sent to each participating laboratory with explanations on the operation of the pilot quality control test and a database for recording the times and other morphological characteristics with which the embryos would be classified, in accordance with the ASEBIR criteria [3].

The following morphokinetic events (quantitative variables) were recorded: time elapsed until the extrusion of the second polar body (tPB2), time to the appearance (tPNa) and fading (tPNf) of the pronuclei, time to the first signs of first division (tSt2), time to cell division (t2, t3, t4, t5, t6, t7, t8, t9), time to end of compaction process (tM), time to the first signs of blastulation (tSB), time to full blastocyst stage (tB), time to start of expansion (tE) and time to start of hatching (tHN).

In addition, the following morphological characteristics (qualitative variables) were recorded: number of pronuclei (PN), opposition of PN (yes or no), size of PN (symmetrical or asymmetrical), whether there was a direct division of 1 into 3 cells and/or of 3 into 5 cells, the highest rate of cell fragmentation reached during development (≤10%, >10–25%, >25–36% or >36%), the absence or presence of vacuoles, the asymmetry of blastomeres (symmetry or asymmetry at the 2, 4 or 8 cell stages), multinucleation from the 2-cell stage to the 4-cell stage (absence or presence) and false cleavage (yes or no).

From the morphological characteristics evaluated (cell fragmentation, presence of vacuoles, blastomere asymmetry and multinucleation), together with the number of cells at the time points 16–18 hours (day 1), 43–45 hours (day 2), 67–69 hours (day 3) and 114–118 hours (day 5) after sperm microinjection, the embryos were classified according to the ASEBIR system, using a flow chart to indicate embryo quality and implantation potential, as follows: A—very high; B—high; C—low; D—very low [3].

The laboratories, after classifying the embryos on days 2, 3 and 5 according to the ASEBIR system, had to decide which embryo was optimal on day 5, per batch, for transfer. For the embryos that were not transferred, they also had to decide whether they should be cryopreserved or discarded. The TL users were not asked for a clinical decision on day 2 or 3.

External quality control programme using conventional morphological assessment

The laboratories that participate in this external quality control program have done so on a voluntary basis, in response to the annual call promoted by ASEBIR since 2003 and which it has organised since 2012 [2,17]. The data analysed in this paper correspond to the 2013–14 editions, in which 39 laboratories took part.

In this programme, the laboratories had online access to a platform, with three batches with videos of zygotes, day 2 and day 3 embryos and two batches with videos of day 4 and day 5 embryos. The optic and magnification used for the recording were Hoffman and 400x, respectively. The recordings were carried out as a daily routine assessment of embryo quality. Each batch consisted of embryo images obtained from different couples. The first three batches were divided into three groups, each one with five videos, of zygotes, day 2 embryos and day 3 embryos, respectively. The last two batches were divided into two groups, one with five videos of day 4 embryos and the second with five videos of day 5 embryos. In the annual ASEBIR external quality control programme, the following morphological characteristics are analysed in day 2 and day 3 embryos: percentage of cell fragmentation, presence or absence of vacuoles, blastomere asymmetry and multinucleation (presence or absence).

The laboratories were asked to classify each day 2, 3 and 5 embryo in accordance with the ASEBIR criteria and to decide, for the last two batches (day 5), which embryo should be transferred. For the last two batches, the laboratories were asked to assume that all embryos were classified as type A at day 3. For the embryos that were not transferred, the laboratories were asked to decide which should be cryopreserved and which should be discarded.

Statistics methods

For both quality control programmes, a high degree of agreement was considered to have been achieved when over 75% of the laboratories reported the same classification or took the same clinical decision for the embryo.

The data obtained in this trial were analysed with the statistical package SPSS 19.0. For each time analysed, the coefficient of variation (CV) and the intraclass correlation coefficient (ICC), together with the corresponding 95% confidence interval, were calculated. To determine the degree of inter-observer agreement in evaluating the qualitative variables, the AgreeStat2015 [Advanced Analytics, LLC (USA)] program was used to calculate the Gwet AC2 (weighted) concordance index for several observers and several categories, together with the respective 95% confidence intervals. The strength of agreement for the Gwet AC2 statistic was expressed using the following intervals: 0 = no agreement; 0.01–0.20 = slight agreement; 0.21–0.40 = fair agreement; 0.41–0.60 = moderate agreement; 0.61–0.80 = substantial agreement; 0.81–1.00 = almost perfect agreement. The mean times of each event for the embryos analysed with each platform were compared by the Student test. Yates’ chi-square tests were used to compare the categoric data. All tests were conducted at the 5% significance level.

Results

Each of the embryo events analysed with the EmbryoScope™ and Primo Vision™ platforms (Fig 1) presented similar mean times, although from the onset of blastulation these times were rather shorter in the embryos cultured in EmbryoScope™. Raw data are shown in S1S6 Tables.

thumbnail
Fig 1. Mean times of the quantitative variables for both morphokinetic platforms (maximum and minimum).

https://doi.org/10.1371/journal.pone.0183328.g001

Both platforms revealed two periods of greater inter-laboratory variability in the timing of embryo development events: first, regarding the initial embryo events (extrusion of the second polar body and the appearance of pronuclei) and then between the 8-cell and morula stages (Figs 2 and 3).

thumbnail
Fig 2. Mean CV and 95% CIs (error bars) obtained at different times of embryo development for both morphokinetic platforms.

https://doi.org/10.1371/journal.pone.0183328.g002

thumbnail
Fig 3. ICC and 95% CIs (error bars) obtained at different times of embryo development, for both morphokinetic platforms.

https://doi.org/10.1371/journal.pone.0183328.g003

In most of the qualitative variables of embryo development, almost perfect inter-laboratory agreement was observed, both among those using CMA and among those using EmbryoScope™ or Primo Vision™, except for false cleavages, vacuoles and asymmetry, for all the methods analysed, and for multinucleation among the users of Primo Vision™, where the degree of agreement was lower (Table 1).

thumbnail
Table 1. Degree of inter-laboratory agreement (Gwet AC2 index) on the evaluation of embryo variables, by TL and CMA.

https://doi.org/10.1371/journal.pone.0183328.t001

The level of inter-laboratory agreement on embryo classification according to the ASEBIR criteria was similar (moderate-substantial) among users of CMA and of EmbryoScope™, for days 2, 3 and 5. However, there was less agreement (fair-moderate) among the users of Primo Vision™ (Table 2). Similar conclusions were obtained concerning the percentage of embryos presenting a high degree of agreement: the laboratories that used CMA (30.4%, 24/79) and EmbryoScope™ (35.2%, 25/71) obtained similar results, which in both cases were higher than those obtained by the users of Primo Vision™ (20%, 17/85) (Table 3).

thumbnail
Table 2. Degree of inter-laboratory agreement (Gwet AC2 index) on embryo classification by ASEBIR criteria and clinical decision.

https://doi.org/10.1371/journal.pone.0183328.t002

thumbnail
Table 3. Percentage of embryos with a high degree of agreement on embryo classification according to ASEBIR criteria.

https://doi.org/10.1371/journal.pone.0183328.t003

The percentage of embryos presenting a high degree of inter-laboratory agreement when classified according to ASEBIR criteria, both with CMA and with the TL platforms, was higher when the embryo was classified as very low quality (type D) by the majority of the participating centres, than for categories A, B and C (Table 3).

With respect to the clinical decision, at day 5, the level of agreement on the clinical decision was higher among the users of TL (almost perfect) than among the laboratories using CMA (moderate) (Table 2). In addition, the percentage of embryos for which there was a high level of agreement regarding the clinical decision was higher among users of the TL platforms than among CMA users (Table 4).

thumbnail
Table 4. Percentage of embryos with a high degree of agreement on clinical decision.

https://doi.org/10.1371/journal.pone.0183328.t004

The percentage of embryos presenting a high level of inter-laboratory agreement regarding the clinical decision, both with CMA and with the TL platforms, was higher for the embryos that were to be discarded than for those to be transferred (Table 4).

Discussion

In evaluating an embryo, the observer factor is one of the most important points, since the subjective component–observation of images of the embryo–is inherent to the assessment. In theory, with platforms based on morphokinetics, this factor should be attenuated, due to the greater information thus obtained and the possibility of viewing the images as many times as necessary without prejudice to the status of the embryo. However, according to Lundin et al. [18], the fact that cell development events can be defined by the starting point, by the mid-event status or by the last photogram observed could hamper the comparison of results obtained by different laboratories. Our results seem to confirm this hypothesis, as events that take longer to occur (30–45 min.) such as the extrusion of the second polar body or the appearance of pronuclei, coincide with greatest inter-laboratory variability. By contrast, in events that occur almost instantaneously, such as cell division or the disappearance of pronuclei, the inter-laboratory variability observed was significantly lower.

The other peak of inter-laboratory variability observed in our study took place following the 8-cell stage, possibly because the cells are then in greater proximity, making it difficult to count the blastomeres as the compaction process begins. Furthermore, this compaction may be only partial, such that some cells are excluded, making it impossible to distinguish clearly whether the cells can still be counted or whether the morula stage has begun.

The systematic review by Kirkegaard et al. [19], on the morphokinetic parameters related to achieving pregnancy or embryo euploidy, concluded that the only parameters with predictive capacity, presented by more than one laboratory, were outside the periods of high inter-laboratory variability discussed above. This suggests that the parameters for these periods of high inter-laboratory variability have little external validity and should not be used in embryo classification algorithms. In fact, one of the most commonly used algorithms, that of Meseguer et al. [7], updated by Basile et al. [10], does not include any of the parameters of high inter-observer variability described in the present study.

The high degree of agreement observed with respect to morphokinetic events coincides with the findings of Sundvall et al. [6]. In our study, the validity of this conclusion was strengthened by the participation of 24 laboratories, compared to the three observers at the single laboratory studied by Sundvall et al. [6].

There was no improvement in the level of inter-laboratory agreement between users of TL technology and of CMA, for any independent morphological characteristic or in embryo classification according to ASEBIR criteria. This may be because the criteria for assessing morphological characteristics, which are often transient, are not well defined [20,21]. This means that although there is a lot of information, there are no clear criteria for its assessment. In addition, it is noteworthy that among the users of Primo Vision™ the level of agreement was significantly lower concerning fragmentation and multinucleation than among those of EmbryoScope™ and CMA. This may be due to technical (optical) limitations of this platform, hampering the evaluation of these characteristics. This finding corroborates previous recommendations [18,22] that any new embryo selection technology should be assessed prior to its implementation in assisted reproduction laboratories. In this respect, our results show that the key performance index depends on the technology used and on the laboratory that uses it.

It is noteworthy that the absence of any improvement in the level of inter-observer agreement among users of TL extends to the evaluation of asymmetry, an area in which a lower level of agreement among CMA users is to be expected, since the users of CMA are not equipped with the accurate instruments needed to evaluate cell asymmetry. In contrast, many morphokinetic platforms do have these instruments. Our results could be explained by reference to three factors. First, not all laboratories use these instruments. Second, not all of them employ a clear-cut definition of embryonic asymmetry. Finally, the above-mentioned transient nature of the morphological characteristics, including asymmetry, would hinder their analysis [23].

Kirkegaard et al. [19] suggested that embryo classification should be used for ranking rather than selection, to avoid the risk of discarding usable embryos, taking into account that no model has yet demonstrated 100% sensitivity and specificity in the prediction of implantation potential. Indeed, Mastenbroek et al. [24], in view of this risk, questioned the need for any embryo selection at all and recommended, at least, the redefinition of embryo selection. To minimise this negative effect of embryo selection, it is crucial to control the accuracy of the embryo selection criteria, which, as our results show, vary according to the technology used.

Unlike in embryo classification, the level of agreement on the clinical decision is higher with TL platforms than with CMA. This result, may be due to the high degree of agreement observed in qualitative variables, which can only be observed by TL [25,26], and that have been related to poor embryo quality [2729]. However, the level of agreement on the variables that can be observed with both methods is lower than that found in the above-described case. This would appear to be a logical outcome, since in their decision-making CMA users only use variables for which there is a lower level of agreement; hence, the clinical decision based on these variables presents a lower level of agreement than when other variables are also taken into consideration (as is the case at laboratories that use TL).

Some authors have suggested that the use of algorithms based on morphokinetic events should be complemented by the observation of classical morphological characteristics by TL [23] or by CMA [30]. However, our results show that this practice remains subject to inter-laboratory variability. The use of TL technology to evaluate classical morphological variables requires these variables to be clearly defined beforehand. For example, the degree of fragmentation to be taken into account can be assessed in various ways: as the degree of maximum fragmentation reached during the entire process of embryo development; as the degree of fragmentation in the last photogram obtained before each cell cycle; or as the degree of fragmentation on the day of the transfer [14]. According to our data, the use of morphokinetic platforms does not guarantee a higher level of inter-laboratory agreement in the assessment of morphological variables than when these same variables are evaluated by CMA. Therefore, we believe that before recommending the incorporation of these variables into the corresponding algorithms, the definition of parameters such as asymmetry, multinucleation and vacuoles, under TL technology, must be standardised.

The results of this study show that assisted reproduction laboratories tend to be in closer agreement on the evaluation of low-quality embryos than of high-quality ones, and of embryos that are to be discarded than on those to be transferred, independently of the embryo evaluation technique used, as reported previously by Arce et al. [1] and Castilla et al. [2] for the CMA method. The high degree of accuracy observed in the classification of low-quality embryos would produce a low rejection rate of viable embryos. On the other hand, the heterogeneity of results observed in the classification of high-quality embryos makes it difficult to obtain reproducible models that would enable a substantial increase in implantation rates [31]. We believe that these factors could be behind the slow incorporation of time-lapse platforms into clinical practice.Our study has several limitations. The embryos evaluated were different in each method and subjected to different culture conditions. This could lead to differences both in embryo development and in embryo morphology, which could make the laboratories more likely to coincide in their assessments with one method than the other. However, the development times of the embryos analysed were similar for each of the platforms and although no significant differences were obtained, from the morula stage onwards there was a decrease in the rate of development for the embryos examined using the Primo Vision™ system. This may be due to the fact that the embryos in the EmbryoScope™ system were cultured in an atmosphere with 5% oxygen, while those cultured in the Primo Vision™ system had an atmosphere with 21% oxygen. This circumstance would be related to previous reports [32,33] that embryo development is faster when it takes place at low oxygen concentrations.

Another limitation is that it would have been preferable to know the implantation rates of the embryos evaluated and to determine whether the inter-laboratory variability on the clinical decision and on embryo quality influenced the effectiveness of assisted reproduction cycles. Furthermore, in this study, no analysis was made of intra-observer agreement, although Sundvall et al. [6] previously reported that this parameter is low, in a similar situation. Finally, the possibility of participation bias cannot be ruled out. As participation in the trial was voluntary, the participating laboratories may have been those which were most motivated and aware of the importance of quality management. Therefore, the inter-laboratory variability results obtained may not represent those actually present in all the centres that were invited to participate, i.e. our results could be under-estimates.

In view of the findings of this study, we conclude that it is necessary to develop guidelines to define and interpret the classical morphological characteristics observed by TL before incorporating these characteristics into diagnostic algorithms based on TL.

Supporting information

S1 Table. Mean times and standard deviations of the morphokinetic events of the embryos analysed with EmbryoScope.

B: Batch, O: Oocyte.

https://doi.org/10.1371/journal.pone.0183328.s001

(PDF)

S2 Table. Mean times and standard deviations of the morphokinetic events of the embryos analysed with Primo Vision.

B: Batch, O: Oocyte.

https://doi.org/10.1371/journal.pone.0183328.s002

(PDF)

S3 Table. Morphological characteristics and clinical decision of the embryos analysed with EmbryoScope.

Majority response (percentage of centres giving the majority response). PN:Pronuclei, NC:No consensus.

https://doi.org/10.1371/journal.pone.0183328.s003

(PDF)

S4 Table. Morphological characteristics and clinical decision of the embryos analysed with Primo Vision.

Majority response (percentage of centres giving the majority response). PN:Pronuclei, NC:No consensus.

https://doi.org/10.1371/journal.pone.0183328.s004

(PDF)

S5 Table. Morphological characteristics of the Day 2 and Day 3 embryos analysed with CMA.

Majority response (percentage of centres giving the majority response).NC: No consensus.

https://doi.org/10.1371/journal.pone.0183328.s005

(PDF)

S6 Table. Morphological classification and clinical decision of the Day 5 embryos analysed with CMA.

Majority response (percentage of centres giving the majority response).

https://doi.org/10.1371/journal.pone.0183328.s006

(PDF)

Acknowledgments

The authors thank the following centres for their participation in the pilot external quality control test of embryo morphology based on time-lapse technology: Clínica Vistahermosa (Alicante), Cefiva Oviedo (Asturias), Instituto de Reproducción Cefer (Barcelona), IVI (Barcelona), Women’s Health Dexeus (Barcelona), Ibilab Palma de Mallorca (Islas Baleares), Clínica Ginefiv (Sevilla), IVI (Sevilla), Biogest (Tarragona), using the EmbryoScope™ platform; and Almería Fiv (Almería), UR Clínica Mediterráneo (Almería), Imara Mataró (Barcelona), Clínica Ginecológica Doctores Povedano (Córdoba), Hospital Virgen de las Nieves (Granada), Clínica Fiv (Madrid), Instituto Ginecológico la Cigüeña (Madrid), Unilabs Reproducción Humana (Madrid), Clínica Fertia Fuengirola (Málaga), Hospital Quirón (Málaga), Clínica Ginefiv (Sevilla), Clínica Ginemed (Sevilla), Institut Conceptum Reus (Tarragona), MasVida Reproducción (Sevilla), Clínica Ginemed (Valencia), Hospital Nuestra Señora de Fátima Vigo (Pontevedra), using the Primo Vision™ platform. The authors also would like to thank ASEBIR Steering Committee and its secretary, María-José Prieto, for their assistance. This article is related to the Ph.D. doctoral thesis of Luis Martínez–Granados.

List of membership of Special Interest Group in Quality of ASEBIR Ardoy M, Bou C, Cabello Y, Delgado S, Esbert M, Farreras A, Freijomil B, Gómez MJ, Iglesias M, López-Regalado ML, Lozano BM, Luna M, Mangrane C, Mantilla A, Marina F, Mauri A. Minguéz Y, Molina-González I, Moreno JM, Muñoz J, Orozco I, Rabanal A, Rámirez JP, Sánchez-Castro L, Uriarte N, Vilches MA, Zamora S.

References

  1. 1. Arce JC, Ziebe S, Lundin K, Janssens R, Helmgaard L, Sørensen P. Interobserver agreement and intraobserver reproducibility of embryo quality assessments. Hum Reprod. 2006;21:2141–2148. pmid:16606640.
  2. 2. Castilla JA, Ruiz de Assin R, Gonzalvo MC, Clavero A, Ramírez JP, Vergara F, et al. External quality control for embryology laboratory. Reprod. Biomed Online. 2010;20:68–74. pmid:20158990.
  3. 3. ASEBIR Special Interest Group of Embryology. Cuadernos de Embriología Clínica. Criterios ASEBIR de valoración morfológica de oocitos, embriones tempranos y blastocistos humanos, 2nd ed. ASEBIR, Madrid; 2008.
  4. 4. ESHRE Guideline Group on Good Practice in IVF Labs, De los Santos MJ, Apter S, Coticchio G, Debrock S, Lundin K, Plancha CE, et al. Revised guidelines for good practice in IVF laboratories (2015). Hum Reprod. 2016;31:685–686. pmid:26908842.
  5. 5. Molder A, Drury S, Costen N, Hartshorne GM, Czanner S. Semiautomated analysis of embryoscope images: Using localized variance of image intensity to detect embryo developmental stages. Cytometry A. 2015;87:119–128. pmid:25614363.
  6. 6. Sundvall L, Ingerslev HJ, Knudsen UB, Kirkegaard K. Inter- and intraobserver variability of time-lapse annotations. Hum Reprod. 2013;28:3215–3221. pmid:24070998.
  7. 7. Meseguer M, Herrero J, Tejera A, Hilligsoe KM, Ramsing N, Remohí J. The use of morphokinetics as a predictor of embryo implantation. Hum Reprod. 2011;26:2658–2671. pmid:21828117
  8. 8. Conaghan J, Chen AA, Willman SP, Ivani K, Chenette PE, Boostanfar R, et al. Improving embryo selection using a computer-automated time-lapse image analysis test plus day 3 morphology: results from a prospective multicenter trial. Fertil Steril. 2013;100:412–419. pmid:23721712.
  9. 9. VerMilyea MD, Tan L, Anthony JT, Conaghan J, Ivani K, Gvakharia M, et al. Computer-automated time-lapse analysis results correlate with embryo implantation and clinical pregnancy: a blinded, multi-centre study. Reprod Biomed Online. 2014;29:729–736. pmid:25444507.
  10. 10. Basile N, Vime P, Florensa M, Aparicio Ruiz B, García Velasco JA, Remohí J, et al. The use of morphokinetics as a predictor of implantation: a multicentric study to define and validate an algorithm for embryo selection. Hum Reprod. 2015;30:276–283. pmid:25527613.
  11. 11. Milewski R, Kuć P, Kuczyńska A, Stankiewicz B, Łukaszuk K, Kuczyński W. A predictive model for blastocyst formation based on morphokinetic parameters in time-lapse monitoring of embryo development. J Assist Reprod Genet. 2015;32:571–579. pmid:25690157.
  12. 12. Petersen BM, Boel M, Montag M, Gardner DK. Development of a generally applicable morphokinetic algorithm capable of predicting the implantation potential of embryos transferred on day 3. Hum Reprod. 2016;31:2231–2244. pmid:27609980.
  13. 13. Gardner DK, Balaban B. Assessment of human embryo development using morphological criteria in an era of time-lapse, algorithms and 'OMICS': is looking good still important? Mol. Hum Reprod. 2016;22:704–718. pmid:27578774.
  14. 14. Ahlstrom A, Park H, Bergh C, Selleskog U, Lundin K. Conventional morphological assessment performs better than morphokinetics for prediction of live birth after day 2 transfer. Reprod Biomed Online. 2016;33:61–70. pmid:27090968.
  15. 15. Diamond MP, Suraj V, Behnke EJ, Yang X, Angle MJ, Lambe-Steinmiller JC, et al. Using the Eeva Test™ adjunctively to traditional day 3 morphology is informative for consistent embryo assessment within a panel of embryologists with diverse experience. J Assist Reprod Genet. 2015;32:61–68. pmid:25331427.
  16. 16. Liu Y, Chapple V, Feenan K, Roberts P, Matosn P. Time-lapse deselection model for human day 3 in vitro fertilization embryos: the combination of qualitative and quantitative measures of embryo growth. Fertil Steril. 2016;105:656–662. pmid:26616439.
  17. 17. Ruiz de Assin R, Clavero A, Gonzalvo MC, Ramirez JP, Zamora S, Fernández A, et al. Comparison of methods to determine the assigned value in an external quality control programme for embryo evaluation. Reprod Biomed Online. 2009;19:824–829. pmid:20031024.
  18. 18. Lundin K, Ahlstrom A. Quality control and standardization of embryo morphology scoring and viability markers. Reprod Biomed Online. 2015;31:459–471. pmid:26277586.
  19. 19. Kirkegaard K, Ahlström A, Ingerslev HJ, Hardarson T. Choosing the best embryo by time-lapse versus standard morphology. Fertil Steril. 2015;103:323–332. pmid:25527231.
  20. 20. Hardarson T, Löfman C, Coull G, Sjögren A, Hamberger L, Edwards RG. Internalization of cellular fragments in a human embryo: Time-lapse recordings. Reprod Biomed Online. 2002;5:36–38. pmid:12470543.
  21. 21. Chavez SL, Loewke KE, Han J, Moussavi F, Colls P, Munne S, et al. Dynamic blastomere behaviour reflects human embryo ploidy by the four-cell stage. Nat Commun. 2012;13:1251. PMID: 23212380.
  22. 22. Harper J, Magli MC, Lundin K, Barratt CL, Brison D. When and how should new technology be introduced into the IVF laboratory? Hum Reprod. 2012;27:303–313. pmid:22166806.
  23. 23. Aparicio-Ruiz B, Basile N, Pérez Albalá S, Bronet F, Remohí J, Meseguer M. Automatic time-lapse instrument is superior to single-point morphology observation for selecting viable embryos: retrospective study in oocyte donation. Fertil Steril. 2016;106:1379–1385. pmid:27530063.
  24. 24. Mastenbroek S, Van der Veen F, Aflatoonian A, Shapiro B, Bossuyt P, Repping S. Embryo selection in IVF. Hum Reprod. 2011;26:964–966. pmid:21372045.
  25. 25. Kirkegaard K, Agerholm IE, Ingerslev HJ. Time-lapse monitoring as a tool for clinical embryo assessment. Hum Reprod. 2012;27:1277–1285. pmid:22419744.
  26. 26. Ciray HN, Campbell A, Agerholm IE, Aguilar J, Chamayou S, Esbert M, et al. Time-Lapse User Group. Proposed guidelines on the nomenclature and annotation of dynamic human embryo monitoring by a time-lapse user group. Hum Reprod. 2014;29:2650–2660. pmid:25344070.
  27. 27. Rubio I, Kuhlmann R, Agerholm I, Kirk J, Herrero J, Escribá MJ, et al. Limited implantation success of direct-cleaved human zygotes: a time-lapse study. Fertil Steril. 2012;98:1458–1463. pmid:22925687.
  28. 28. Athayde Wirka K, Chen AA, Conaghan J, Ivani K, Gvakharia M, Behr B, et al. Atypical embryo phenotypes identified by time-lapse microscopy: high prevalence and association with embryo development. Fertil Steril. 2014;101:1637–1648. pmid:24726214.
  29. 29. Liu Y, Chapple V, Feenan K, Roberts P, Matson P. Clinical significance of intercellular contact at the four-cell stage of human embryos, and the use of abnormal cleavage patterns to identify embryos with low implantation potential: a time-lapse study. Fertil Steril. 2015;103:1485–1491. pmid:25914097.
  30. 30. Kieslinger DC, De Gheselle S, Lambalk CB, De Sutter P, Kostelijk EH, Twisk JW, et al. Embryo selection using time-lapse analysis (Early Embryo Viability Assessment) in conjunction with standard morphology: a prospective two-center pilot study. Hum Reprod. 2016;31:2450–2457. pmid:27619774.
  31. 31. Kirkegaard K, Campbell A, Agerholm I, Bentin-Ley U, Gabrielsen A, Kirk J, et al. Limitations of a time-lapse blastocyst prediction model: a large multicentre outcome analysis. Reprod Biomed Online. 2014;29:156–158. pmid:24912418.
  32. 32. De los Santos MJ, Gámiz P, Albert C, Galán A, Viloria T, Pérez S, et al. Reduced oxygen tension improves embryo quality but not clinical pregnancy rates: a randomized clinical study into ovum donation cycles. Fertil Steril. 2013;100:402–407. pmid:23628109.
  33. 33. Kirkegaard K, Hindkjaer JJ, Ingerslev HJ. Effect of oxygen concentration on human embryo development evaluated by time-lapse monitoring. Fertil Steril. 2013;99:738–744. pmid:23245683.