External validation of models for predicting cumulative live birth over multiple complete cycles of IVF treatment.
Can two prediction models developed using data from 1999 to 2009 accurately predict the cumulative probability of live birth per woman over multiple complete cycles of IVF in an updated UK cohort?
After being updated, the models were able to estimate individualized chances of cumulative live birth over multiple complete cycles of IVF with greater accuracy.
The McLernon models were the first to predict cumulative live birth over multiple complete cycles of IVF. They were converted into an online calculator called OPIS (Outcome Prediction In Subfertility) which has 3000 users per month on average. A previous study externally validated the McLernon models using a Dutch prospective cohort containing data from 2011 to 2014. With changes in IVF practice over time, it is important that the McLernon models are externally validated on a more recent cohort of patients to ensure that predictions remain accurate.
A population-based cohort of 91 035 women undergoing IVF in the UK between January 2010 and December 2016 was used for external validation. Data on frozen embryo transfers associated with these complete IVF cycles conducted from 1 January 2017 to 31 December 2017 were also collected.
Data on IVF treatments were obtained from the Human Fertilisation and Embryology Authority (HFEA). The predictive performances of the McLernon models were evaluated in terms of discrimination and calibration. Discrimination was assessed using the c-statistic and calibration was assessed using calibration-in-the-large, calibration slope, and calibration plots. Where any model demonstrated poor calibration in the validation cohort, the models were updated using intercept recalibration, logistic recalibration, or model revision to improve model performance.
Following exclusions, 91 035 women who underwent 144 734 complete cycles were included. The validation cohort had a similar distribution age profile to women in the development cohort. Live birth rates over all complete cycles of IVF per woman were higher in the validation cohort. After calibration assessment, both models required updating. The coefficients of the pre-treatment model were revised, and the updated model showed reasonable discrimination (c-statistic: 0.67, 95% CI: 0.66 to 0.68). After logistic recalibration, the post-treatment model showed good discrimination (c-statistic: 0.75, 95% CI: 0.74 to 0.76). As an example, in the updated pre-treatment model, a 32-year-old woman with 2 years of primary infertility has a 42% chance of having a live birth in the first complete ICSI cycle and a 77% chance over three complete cycles. In a couple with 2 years of primary male factor infertility where a 30-year-old woman has 15 oocytes collected in the first cycle, a single fresh blastocyst embryo transferred in the first cycle and spare embryos cryopreserved, the estimated chance of live birth provided by the post-treatment model is 46% in the first complete ICSI cycle and 81% over three complete cycles.
Two predictors from the original models, duration of infertility and previous pregnancy, which were not available in the recent HFEA dataset, were imputed using data from the older cohort used to develop the models. The HFEA dataset does not contain some other potentially important predictors, e.g. BMI, ethnicity, race, smoking and alcohol intake in women, as well as measures of ovarian reserve such as antral follicle count.
Both updated models show improved predictive ability and provide estimates which are more reflective of current practice and patient case mix. The updated OPIS tool can be used by clinicians to help shape couples' expectations by informing them of their individualized chances of live birth over a sequence of multiple complete cycles of IVF.
This study was supported by an Elphinstone scholarship scheme at the University of Aberdeen and Aberdeen Fertility Centre, University of Aberdeen. S.B. has a commitment of research funding from Merck. D.J.M. and M.B.R. declare support for the present manuscript from Elphinstone scholarship scheme at the University of Aberdeen and Assisted Reproduction Unit at Aberdeen Fertility Centre, University of Aberdeen. D.J.M. declares grants received by University of Aberdeen from NHS Grampian, The Meikle Foundation, and Chief Scientist Office in the past 3 years. D.J.M. declares receiving an honorarium for lectures from Merck. D.J.M. is Associate Editor of Human Reproduction Open and Statistical Advisor for Reproductive BioMed Online. S.B. declares royalties from Cambridge University Press for a book. S.B. declares receiving an honorarium for lectures from Merck, Organon, Ferring, Obstetric and Gynaecological Society of Singapore, and Taiwanese Society for Reproductive Medicine. S.B. has received support from Merck, ESHRE, and Ferring for attending meetings as speaker and is on the METAFOR and CAPRE Trials Data Monitoring Committee.
N/A.
Ratna MB
,Bhattacharya S
,McLernon DJ
《-》
Predicting cumulative live birth for couples beginning their second complete cycle of in vitro fertilization treatment.
Can we develop an IVF prediction model to estimate individualized chances of a live birth over multiple complete cycles of IVF in couples embarking on their second complete cycle of treatment?
Yes, our prediction model can estimate individualized chances of cumulative live birth over three additional complete cycles of IVF.
After the completion of a first complete cycle of IVF, couples who are unsuccessful may choose to undergo further treatment to have their first child, while those who have had a live birth may decide to have more children. Existing prediction models can estimate the overall chances of success in couples before commencing IVF but are unable to revise these chances on the basis of the couple's response to a first treatment cycle in terms of the number of eggs retrieved and pregnancy outcome. This makes it difficult for couples to plan and prepare emotionally and financially for the next step in their treatment.
For model development, a population-based cohort was used of 49 314 women who started their second cycle of IVF including ICSI in the UK from 1999 to 2008 using their own oocytes and their partners' sperm. External validation was performed on data from 39 442 women who underwent their second cycle from 2010 to 2016.
Data about all UK IVF treatments were obtained from the Human Fertilisation and Embryology Authority (HFEA) database. Using a discrete time logistic regression model, we predicted the cumulative probability of live birth from the second up to and including the fourth complete cycles of IVF. Inverse probability weighting was used to account for treatment discontinuation. Discrimination was assessed using c-statistic and calibration was assessed using calibration-in-the-large and calibration slope.
Following exclusions, 49 314 women with 73 053 complete cycles were included. 12 408 (25.2%) had a live birth resulting from their second complete cycle. Cumulatively, 17 394 (35.3%) had a live birth over complete cycles two to four. The model showed moderate discriminative ability (c-statistic: 0.65, 95% CI: 0.64 to 0.65) and evidence of overprediction (calibration-in-the-large = -0.08) and overfitting (calibration slope 0.85, 95% CI: 0.81 to 0.88) in the validation cohort. However, after recalibration the fit was much improved. The recalibrated model identified the following key predictors of live birth: female age (38 versus 32 years-adjusted odds ratio: 0.59, 95% CI: 0.57 to 0.62), number of eggs retrieved in the first complete cycle (12 versus 4 eggs; 1.34, 1.30 to 1.37) and outcome of the first complete cycle (live birth versus no pregnancy; 1.78, 1.66 to 1.91; live birth versus pregnancy loss; 1.29, 1.23 to 1.36). As an example, a 32-year-old with 2 years of non-tubal infertility who had 12 eggs retrieved from her first stimulation and had a live birth during her first complete cycle has a 46% chance of having a further live birth from the second complete cycle of IVF and an 81% chance over a further three cycles.
The developed model was updated using validation data that was 6 to 12 years old. IVF practice continues to evolve over time, which may affect the accuracy of predictions from the model. We were unable to adjust for some potentially important predictors, e.g. BMI, smoking and alcohol intake in women, as well as measures of ovarian reserve such as antral follicle count. These were not available in the linked HFEA dataset.
By appropriately adjusting for couples who discontinue treatment, our novel prediction model will provide more realistic chances of live birth in couples starting a second complete cycle of IVF. Clinicians can use these predictions to inform discussion with couples who wish to plan ahead. This prediction tool will enable couples to prepare emotionally, financially and logistically for IVF treatment.
This study was supported by an Elphinstone scholarship scheme at the University of Aberdeen and Aberdeen Fertility Centre, University of Aberdeen. The authors have no conflict of interest.
N/A.
Ratna MB
,Bhattacharya S
,van Geloven N
,McLernon DJ
... -
《-》
A systematic review of the quality of clinical prediction models in in vitro fertilisation.
What are the best-quality clinical prediction models in IVF (including ICSI) treatment to inform clinicians and their patients of their chance of success?
The review recommends the McLernon post-treatment model for predicting the cumulative chance of live birth over and up to six complete cycles of IVF.
Prediction models in IVF have not found widespread use in routine clinical practice. This could be due to their limited predictive accuracy and clinical utility. A previous systematic review of IVF prediction models, published a decade ago and which has never been updated, did not assess the methodological quality of existing models nor provided recommendations for the best-quality models for use in clinical practice.
The electronic databases OVID MEDLINE, OVID EMBASE and Cochrane library were searched systematically for primary articles published from 1978 to January 2019 using search terms on the development and/or validation (internal and external) of models in predicting pregnancy or live birth. No language or any other restrictions were applied.
The PRISMA flowchart was used for the inclusion of studies after screening. All studies reporting on the development and/or validation of IVF prediction models were included. Articles reporting on women who had any treatment elements involving donor eggs or sperm and surrogacy were excluded. The CHARMS checklist was used to extract and critically appraise the methodological quality of the included articles. We evaluated models' performance by assessing their c-statistics and plots of calibration in studies and assessed correct reporting by calculating the percentage of the TRIPOD 22 checklist items met in each study.
We identified 33 publications reporting on 35 prediction models. Seventeen articles had been published since the last systematic review. The quality of models has improved over time with regard to clinical relevance, methodological rigour and utility. The percentage of TRIPOD score for all included studies ranged from 29 to 95%, and the c-statistics of all externally validated studies ranged between 0.55 and 0.77. Most of the models predicted the chance of pregnancy/live birth for a single fresh cycle. Six models aimed to predict the chance of pregnancy/live birth per individual treatment cycle, and three predicted more clinically relevant outcomes such as cumulative pregnancy/live birth. The McLernon (pre- and post-treatment) models predict the cumulative chance of live birth over multiple complete cycles of IVF per woman where a complete cycle includes all fresh and frozen embryo transfers from the same episode of ovarian stimulation. McLernon models were developed using national UK data and had the highest TRIPOD score, and the post-treatment model performed best on external validation.
To assess the reporting quality of all included studies, we used the TRIPOD checklist, but many of the earlier IVF prediction models were developed and validated before the formal TRIPOD reporting was published in 2015. It should also be noted that two of the authors of this systematic review are authors of the McLernon model article. However, we feel we have conducted our review and made our recommendations using a fair and transparent systematic approach.
This study provides a comprehensive picture of the evolving quality of IVF prediction models. Clinicians should use the most appropriate model to suit their patients' needs. We recommend the McLernon post-treatment model as a counselling tool to inform couples of their predicted chance of success over and up to six complete cycles. However, it requires further external validation to assess applicability in countries with different IVF practices and policies.
The study was funded by the Elphinstone Scholarship Scheme and the Assisted Reproduction Unit, University of Aberdeen. Both D.J.M. and S.B. are authors of the McLernon model article and S.B. is Editor in Chief of Human Reproduction Open. They have completed and submitted the ICMJE forms for Disclosure of potential Conflicts of Interest. The other co-authors have no conflicts of interest to declare.
N/A.
Ratna MB
,Bhattacharya S
,Abdulrahim B
,McLernon DJ
... -
《-》
Elective freezing of embryos versus fresh embryo transfer in IVF: a multicentre randomized controlled trial in the UK (E-Freeze).
Does a policy of elective freezing of embryos, followed by frozen embryo transfer result in a higher healthy baby rate, after first embryo transfer, when compared with the current policy of transferring fresh embryos?
This study, although limited by sample size, provides no evidence to support the adoption of a routine policy of elective freeze in preference to fresh embryo transfer in order to improve IVF effectiveness in obtaining a healthy baby.
The policy of freezing all embryos followed by frozen embryo transfer is associated with a higher live birth rate for high responders but a similar/lower live birth after first embryo transfer and cumulative live birth rate for normal responders. Frozen embryo transfer is associated with a lower risk of ovarian hyperstimulation syndrome (OHSS), preterm delivery and low birthweight babies but a higher risk of large babies and pre-eclampsia. There is also uncertainty about long-term outcomes, hence shifting to a policy of elective freezing for all remains controversial given the delay in treatment and extra costs involved in freezing all embryos.
A pragmatic two-arm parallel randomized controlled trial (E-Freeze) was conducted across 18 clinics in the UK from 2016 to 2019. A total of 619 couples were randomized (309 to elective freeze/310 to fresh). The primary outcome was a healthy baby after first embryo transfer (term, singleton live birth with appropriate weight for gestation); secondary outcomes included OHSS, live birth, clinical pregnancy, pregnancy complications and cost-effectiveness.
Couples undergoing their first, second or third cycle of IVF/ICSI treatment, with at least three good quality embryos on Day 3 where the female partner was ≥18 and <42 years of age were eligible. Those using donor gametes, undergoing preimplantation genetic testing or planning to freeze all their embryos were excluded. IVF/ICSI treatment was carried out according to local protocols. Women were followed up for pregnancy outcome after first embryo transfer following randomization.
Of the 619 couples randomized, 307 and 309 couples in the elective freeze and fresh transfer arms, respectively, were included in the primary analysis. There was no evidence of a statistically significant difference in outcomes in the elective freeze group compared to the fresh embryo transfer group: healthy baby rate {20.3% (62/307) versus 24.4% (75/309); risk ratio (RR), 95% CI: 0.84, 0.62 to 1.15}; OHSS (3.6% versus 8.1%; RR, 99% CI: 0.44, 0.15 to 1.30); live birth rate (28.3% versus 34.3%; RR, 99% CI 0.83, 0.65 to 1.06); and miscarriage (14.3% versus 12.9%; RR, 99% CI: 1.09, 0.72 to 1.66). Adherence to allocation was poor in the elective freeze group. The elective freeze approach was more costly and was unlikely to be cost-effective in a UK National Health Service context.
We have only reported on first embryo transfer after randomization; data on the cumulative live birth rate requires further follow-up. Planned target sample size was not obtained and the non-adherence to allocation rate was high among couples in the elective freeze arm owing to patient preference for fresh embryo transfer, but an analysis which took non-adherence into account showed similar results.
Results from the E-Freeze trial do not lend support to the policy of electively freezing all for everyone, taking both efficacy, safety and costs considerations into account. This method should only be adopted if there is a definite clinical indication.
NIHR Health Technology Assessment programme (13/115/82). This research was funded by the National Institute for Health Research (NIHR) (NIHR unique award identifier) using UK aid from the UK Government to support global health research. The views expressed in this publication are those of the author(s) and not necessarily those of the NIHR or the UK Department of Health and Social Care. J.L.B., C.C., E.J., P.H., J.J.K., L.L. and G.S. report receipt of funding from NIHR, during the conduct of the study. J.L.B., E.J., P.H., K.S. and L.L. report receipt of funding from NIHR, during the conduct of the study and outside the submitted work. A.M. reports grants from NIHR personal fees from Merck Serono, personal fees for lectures from Merck Serono, Ferring and Cooks outside the submitted work; travel/meeting support from Ferring and Pharmasure and participation in a Ferring advisory board. S.B. reports receipt of royalties and licenses from Cambridge University Press, a board membership role for NHS Grampian and other financial or non-financial interests related to his roles as Editor-in-Chief of Human Reproduction Open and Editor and Contributing Author of Reproductive Medicine for the MRCOG, Cambridge University Press. D.B. reports grants from NIHR, during the conduct of the study; grants from European Commission, grants from Diabetes UK, grants from NIHR, grants from ESHRE, grants from MRC, outside the submitted work. Y.C. reports speaker fees from Merck Serono, and advisory board role for Merck Serono and shares in Complete Fertility. P.H. reports membership of the HTA Commissioning Committee. E.J. reports membership of the NHS England and NIHR Partnership Programme, membership of five Data Monitoring Committees (Chair of two), membership of six Trial Steering Committees (Chair of four), membership of the Northern Ireland Clinical Trials Unit Advisory Group and Chair of the board of Oxford Brain Health Clinical Trials Unit. R.M. reports consulting fees from Gedeon Richter, honorarium from Merck, support fees for attendance at educational events and conferences for Merck, Ferring, Bessins and Gedeon Richter, payments for participation on a Merck Safety or Advisory Board, Chair of the British Fertility Society and payments for an advisory role to the Human Fertilisation and Embryology Authority. G.S. reports travel and accommodation fees for attendance at a health economic advisory board from Merck KGaA, Darmstadt, Germany. N.R.-F. reports shares in Nurture Fertility. Other authors' competing interests: none declared.
ISRCTN: 61225414.
29 December 2015.
16 February 2016.
Maheshwari A
,Bell JL
,Bhide P
,Brison D
,Child T
,Chong HY
,Cheong Y
,Cole C
,Coomarasamy A
,Cutting R
,Hardy P
,Hamoda H
,Juszczak E
,Khalaf Y
,Kurinczuk JJ
,Lavery S
,Linsell L
,Macklon N
,Mathur R
,Pundir J
,Raine-Fenning N
,Rajkohwa M
,Scotland G
,Stanbury K
,Troup S
,Bhattacharya S
... -
《-》