Beruflich Dokumente
Kultur Dokumente
89 96, 2012
Advanced Access publication on November 7, 2011 doi:10.1093/humrep/der373
background: Near infrared (NIR) spectroscopy is a technology proposed to facilitate non-invasive screening for the most optimal
human embryo for uterine transfer. It has been proposed that the NIR spectral prole of an embryos spent culture medium can be
used to generate a viability score that correlates to implantation potential. As the initial proof of principle studies were all retrospective,
our aim was to investigate whether NIR spectroscopy on spent embryo culture medium in an on-site, prospective setting could improve
the ongoing single embryo transfer (SET) pregnancy rate after Day 2 and 5 transfers.
methods: We conducted a single-centre, double-blinded, randomized controlled trial in which the NIR group was compared with a
control group. The primary outcome was the clinical pregnancy rate after 67 weeks of gestation per randomized patient. In the control
group embryo selection was based only on traditional morphological evaluation while in the treatment group NIR spectroscopy was
added to the morphological evaluation.
results: The study was terminated early as the analysis of the Data Safety Monitoring Board showed a very low conditional power of
superiority for the primary outcome. Of the 752 patients calculated to be included in the study, 164 and 163 patients were randomized into
the NIR and control groups, respectively. No signicant difference in the ongoing pregnancy rate per randomized patient was found between
the NIR and the control group, 34.8 versus 35.6%, (P 0.97). The proportional difference between the study groups mean was 20.8% (95%
condence interval 11.4 to 10.2).
conclusions: This study shows that adding NIR spectroscopy, in its present form, to embryo morphology does not improve the
chance of a viable pregnancy when performing SET. The NIR technology appears to need further development before it can be used as
an objective marker of embryo viability.
clinical trials identifier: ISRCTN23817363.
Key words: RCT / NIR / embryo selection / pregnancy / metabolomic proling
Introduction
During recent years several non-invasive methods have been introduced for choosing the most optimal embryo for uterine transfer
(Brison et al., 2004; Seli et al., 2007; Katz-Jaffe et al., 2009). Among
these new methods is a rapid non-invasive screening technology
using near infrared (NIR) spectroscopy to analyse the chemical
content of spent culture medium. This technology examines NIR
spectra between 920 and 1675 nm and takes advantage of the fact
that molecules absorb specic frequencies that are characteristic of
their structure. Therefore an embryos medium sample will generate
a spectrum due to vibrations of specic functional groups including
& The Author 2011. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved.
For Permissions, please email: journals.permissions@oup.com
Submitted on July 6, 2011; resubmitted on September 26, 2011; accepted on October 5, 2011
90
comparison these studies have used technologies not available to prospectively test in a clinical setting. Several proof of principle studies
examining results from Day 2, 3 and 5 transfers, including studies
from our clinic, have shown, through spectral analysis of large
numbers of medium samples from embryos with known pregnancy
outcomes (positive versus negative), that a predictive algorithm can
be created to predict the outcomes of a blinded sample population
(Hardarson et al., 2007; Seli et al., 2007; Scott et al., 2008;
Vergouw et al., 2008). These studies have, however, all been retrospective in their design and have relied largely on sending spent
frozen culture medium to a single central laboratory for measurement.
The aim of this study was to investigate if the NIR spectroscopy
system, utilizing an on-site instrument loaded with previously established predictive algorithms, was able to improve the ongoing single
embryo transfer (SET) pregnancy rate.
Study design
This was a blinded randomized controlled study comparing the NIR group
with a control group. All consecutive IVF patients at Fertility Center
Scandinavia, Carlanderska Hospital, Gothenburg, between April 2008
and December 2010 were assessed for eligibility. Those patients who fullled the inclusion criteria and signed informed consent forms were
included in the study. Ethical approval was obtained from the ethics committee at Goteborg University. The study was registered as an RCT with
the number of ISRCTN23817363.
The primary end-point was the number of ongoing pregnancies per randomized patient (intention-to-treat, ITT), based on fetal heart activity by
means of ultrasound around 45 days after embryo transfer.
Secondary end-points were the total number of biochemical pregnancies per randomized patient, based on a positive urine pregnancy test
18 days after embryo transfer, and the total number of ongoing pregnancies per randomized patient, based on fetal heart activity around 45 days
after embryo transfer on Day 2 and Day 5, respectively.
The randomization into the two study groups was performed using a
computerized randomization program balancing for all of the following
prognostic variables by way of minimization (1:1): mean female age,
patient age group categorized as ,34, 34 38 and .38; mean number
of good quality embryos (GQE) on the day of transfer; number of GQE
available for transfer categorized by groups 2, 3 4 and .5; mean
number of previous fresh IVF cycles leading to embryo transfer and
reason for infertility (tubal factor, male factor, unexplained and other).
The randomization was performed by the laboratory staff and blinded
from both the patient and physician.
Statistical methods
The following demographics and baseline variables were analysed by treatment group and subsequently corrected for in the statistical analysis as
possible confounders: maternal age, BMI, number of previous IVF treatment cycles, number of fertilized oocytes, method of fertilization,
number of aspirated follicles and oocytes, number of GQE on the day
of transfer, number of embryos transferred, type of pituitary regulation,
total dosage of gonadotrophins administered, day of transfer, medical
cause of infertility, number of cells on transfer Day 2, degree of fragmentation on transfer Day 2, degree of expansion on transfer Day 5, morphological grade of the inner cell mass (ICM) and trophectoderm cells (TE)
and viability score (only NIR group).
All data were analysed from all of the randomized couples, i.e. ITT analysis and from couples who were without any protocol violations that
might have had a signicant effect on the analysis, i.e. as per protocol
(PP) analysis.
The main analysis performed compared between the two groups of the
ITT population. Unadjusted comparison between the two groups was analysed using the Fishers exact test for dichotomous variables, Mann
Whitney U-test for continuous variables, Mantel Haenszel x 2-test for
ordered categorical variables and x 2-test for non-ordered categorical variables. The two groups could be either the two study groups or pregnancy/not pregnancy. Complementary unadjusted analyses were also
performed between the two groups on the PP population. A confounder
was dened as a baseline variable signicantly correlated with the outcome
Hardarson et al.
91
NIR spectroscopy
In the NIR group, all GQE on Day 2 or Day 5 were analysed using a prototype NIR spectroscopy system intended for IVF use (Molecular Biometrics
Inc., Norwood, MA, USA). From the two or more GQE randomized to
the NIR group, a small aliquot (13 ml) was aspirated from the culture
medium droplet of each GQE and injected into a sample cell (Molecular
Biometrics Inc.) for analysis by the instrument. A sample cell was only
used for one embryo and then discarded. The instrument used the NIR
spectroscopic technology as previously described (Seli et al., 2007;
Botros et al., 2008; Scott et al., 2008). An algorithm had previously
been developed using 62 samples for the Day 2 and Day 5 algorithms;
both algorithms were trained from 50% fetal cardiac positive and negative
SET medium samples. The Day 2 algorithm contained four predictive
wavelet regions, two below the water band (1300 1400 nm) and two
above the water band (1500 1600 nm), while the Day 5 algorithm contained three predictive wavelet regions all below the water band
(1300 1400 nm). Both algorithms were developed on a different instrument from that used in the trial. When the NIR analysis had been performed on the GQE cohort, the embryo with the highest numerical
score was chosen for SET. The other GQEs were cryopreserved according
Results
The DSMB analysis was performed when 287 patients had been randomized. The analysis was performed earlier than planned due to a
slower enrolment rate than originally expected. After comparing the
two groups, the advice from the DSMB was to stop the study in
accordance with alternative one in the pre-set study protocol. Recruitment into the study was therefore stopped. During the study period, a
total of 1430 patients underwent IVF treatment at the clinic. A total of
1072 (75%) patients met the inclusion criteria and 972 (68%) patients
signed an informed consent. On the day of randomization, 327 (23%)
had two or more GQEs available for transfer and were randomized
(Fig. 1). The two major reasons for drop-outs between those that
were eligible and those that were randomized were either lack of
two or more GQEs at the day of transfer or administrative reasons.
Of the randomized patients, 164 were allocated to the NIR group
and 163 patients to the control group. No signicant differences
between these groups were found for patient and treatment characteristics (Table I). However, there were signicant differences in the
morphological quality of embryos transferred on Day 5 with higher
grades found in the control group, i.e. grade of expansion
(P 0.0003), ICM (P 0.0096) and TE (P 0.025). The degree of
fragmentation and cell number were also improved in the Day 2
control group; however, these failed to reach signicance. From all
embryos (n 484) randomized to the NIR group, spent medium
samples were successfully analysed by NIR spectroscopy for 454
(93.8%) embryos. For 12 patients, failed NIR analysis led to embryo
selection by morphology alone.
As shown in Table II, no signicant difference in the ongoing pregnancy rate per randomized patient was found between the NIR and
the control group for both the ITT and PP populations, 34.8 versus
35.6%, (P 0.97) and 36.2 versus 36.3% (P 1.00), respectively.
The proportional difference between the study groups mean was
20.8% [95% condence interval (CI) 11.4 to 10.2]. Furthermore,
no signicant difference was found between the two study groups
for the biochemical pregnancy rate (P 0.35) or the ongoing pregnancy rate for Day 2 (P 0.63) or Day 5 transfer (P 0.55).
When adjusting for morphology on Day 5, the odds ratio for the
primary efcacy variable, ongoing pregnancies or live birth was 1.06
(95% CI: 0.54 2.08, P 0.86) and the odds ratio for pregnancies
was 0.62 (95% CI: 0.31 1.26, P 0.19) between the two groups.
For the NIR group, viability scores of transferred embryos ranged
from 20.015 to 1.677. No statistical signicance was found for the
odds of biochemical pregnancy and ongoing pregnancy separated by
the day of embryo transfer in relation to the viability score, as
shown in Table III.
Discussion
The DSMB analysis resulted in a recommendation to stop the study in
accordance with the criteria set out in the study protocol. The analysis
showed that the study had a very small chance of showing any
improvement in the pregnancy rate by using the prototype NIR
92
Hardarson et al.
results from this study failed to support the concept that the NIR technique, in its current state, can improve implantation rates for SET. Our
results are somewhat surprising given the background of initial proof of
principle reports that suggested that the NIR technique should work
to select the best embryo for transfer (Hardarson et al., 2007; Seli
et al., 2007; Scott et al., 2008; Vergouw et al., 2008). Although the
use of the NIR technology showed some benet in retrospective
93
.............................................................................................................................................................................................
Female age
BMI
ICSI
43.3% (71)
41.7% (68)
STD IVF
56.7% (93)
58.3% (95)
0.0% (0)
1.8% (3)
100% (164)
98.2% (160)
Cetrotide
1.8% (3)
1.2% (2)
Orgalutran
72.0% (118)
76.1% (124)
Suprecur
18.9% (31)
15.3% (25)
Suprefact
0.0% (0)
1.2% (2)
Synarela
7.3% (12)
6.1% (10)
53.0% (87)
50.9% (83)
47.0% (77)
49.1% (80)
30.5% (50)
32.5% (53)
Other
22.6% (37)
24.5% (40)
Tubal factor
11.0% (18)
9.2% (15)
Unexplained
36.0% (59)
33.7% (55)
90.8% (79)
97.5% (79)
6.9% (6)
1.2% (1)
2.3% (2)
1.2% (1)
66.7% (58)
80.2% (65)
33.3% (29)
19.8% (16)
33.8% (26)
8.9% (7)
64.9% (50)
89.9% (71)
1.3% (1)
1.3% (1)
61.0% (47)
81.0% (64)
39.0% (30)
19.0% (15)
45.5% (35)
64.6% (51)
54.5% (42)
35.4% (28)
94
Hardarson et al.
NIR
(n 5 164)
NIR control
(n 5 163)
P-value
.............................................................................................................................................................................................
Primary efcacy variable
Number of ongoing pregnancies or live births
34.8% (57)
35.6% (58)
0.97
54.3% (89)
48.5% (79)
0.35
31.0% (27)
26.5% (22)
0.63
39.0% (30)
45.0% (36)
0.55
NIR
(n 5 152)
NIR Control
(n 5 160)
P-value
36.2% (55)
36.3% (58)
1.00
PP population
.............................................................................................................................................................................................
Primary efcacy variable
20.1% (211.2; 11.1)
Number of pregnancies
55.9% (85)
49.4% (79)
0.30
33.3% (27)
27.2% (22)
0.49
39.4% (28)
45.6% (36)
0.55
For categorical variables % (n) is presented. For comparison between groups Fishers exact test was used for dichotomous variables. The condence interval for dichotomous variables
is the unconditional exact condence limits. If no exact limits can be computed the asymptotic Wald condence limits with continuity correction is calculated instead.
P-value
Area
under the
curve
........................................................................................
Ongoing pregnancies or
live births, embryo
transfer Day 2
0.97
0.52
Ongoing pregnancies or
live births, embryo
transfer Day 5
0.22
0.57
studies, it appears that a wide scale application is at this stage not feasible. Instrument variability, threshold of the analytical signal in the
medium and instrument noise to signal may have all inuenced both
algorithm development and its utilization. Variability between different
clinics may also play a signicant role in the global application of this
technology.
It is not uncommon to observe discrepancies between proof of
principle studies and larger RCT studies. This concept was recently
experienced when also examining the use of preimplantation genetic
screening, using uorescence in situ hybridization, in women undergoing IVF (Staessen et al., 2004; Mastenbroek et al., 2007; Hardarson
et al., 2008). Preliminary ndings suggesting a benet of a technique
are often based on smaller retrospective studies. The problem with
the NIR spectroscopy for predicting both disease, and in this case
pregnancy, is mainly related to the creation of the algorithm to be
95
could be argued that the ability to select the best Day 5 embryo compared with one of the cohort of good quality Day 5 embryos does,
however, provide benet. It is apparent from Table II that even
though morphologically lower quality embryos were selected in the
NIR group, pregnancy rates were not signicantly altered. This again
draws attention to the need for secondary measures in addition to
morphology that will provide precise physiological information about
an individual embryo to facilitate selection for SET. The premise of
any technology examining metabolic, metabolomic or proteomic differences between embryos is that this information provides a
greater ability to establish which is the viable embryo within a
cohort of GQEs. From our results it would appear that the current
technology was unable to improve on selection of the most viable
embryo when compared with blastocyst morphology. It therefore
appears that the morphological criteria used to perform SET on
Day 5 are superior compared with a combination of morphology
and metabolomics. On the contrary, the same may not be true for
selection of a SET on Day 2 and the extra information supplied by
this platform may be benecial.
In conclusion, this study has shown that adding NIR spectroscopy to
embryo morphology in its present form does not improve the chance
of a viable pregnancy. The NIR technology appears to need further development before being used as an objective marker of embryo
viability.
Authors roles
All authors played a role in study conception and design. T.H., A.A.
and L.R. took part in collection and assembly of data. T.H., A.A.,
D.S. and M.W. carried out data analysis, interpreted the ndings
and drafted the manuscript. All authors critically reviewed and
approved the nal version of the manuscript.
Acknowledgements
The authors thank Nils-Gunnar Phersson for his and his colleagues invaluable help with the statistical part of the study. The authors also
thank staff members of Molecular Biometrics Inc. for their assistance.
Conict of interest
D.S. has shares in and consults for Molecular Biometrics Inc.
Funding
This study was supported by an unconditional grant from Molecular
Biometrics Inc., USA.
References
Ahlstrom A, Wikland M, Rogberg L, Barnett JS, Tucker M, Hardarson T.
Cross-validation and predictive value of near-infrared spectroscopy
algorithms for day-5 blastocyst transfer. Reprod Biomed Online 2011;
27:477 483.
Botros L, Sakkas D, Seli E. Metabolomics and its application for
non-invasive embryo assessment in IVF. Mol Hum Reprod 2008;
14:679 690.
96
Hardarson et al.