Skip to main content

Genome wide association study of response to interval and continuous exercise training: the Predict-HIIT study



Low cardiorespiratory fitness (V̇O2peak) is highly associated with chronic disease and mortality from all causes. Whilst exercise training is recommended in health guidelines to improve V̇O2peak, there is considerable inter-individual variability in the V̇O2peak response to the same dose of exercise. Understanding how genetic factors contribute to V̇O2peak training response may improve personalisation of exercise programs. The aim of this study was to identify genetic variants that are associated with the magnitude of V̇O2peak response following exercise training.


Participant change in objectively measured V̇O2peak from 18 different interventions was obtained from a multi-centre study (Predict-HIIT). A genome-wide association study was completed (n = 507), and a polygenic predictor score (PPS) was developed using alleles from single nucleotide polymorphisms (SNPs) significantly associated (P < 1 × 10–5) with the magnitude of V̇O2peak response. Findings were tested in an independent validation study (n = 39) and compared to previous research.


No variants at the genome-wide significance level were found after adjusting for key covariates (baseline V̇O2peak, individual study, principal components which were significantly associated with the trait). A Quantile–Quantile plot indicates there was minor inflation in the study. Twelve novel loci showed a trend of association with V̇O2peak response that reached suggestive significance (P < 1 × 10–5). The strongest association was found near the membrane associated guanylate kinase, WW and PDZ domain containing 2 (MAGI2) gene (rs6959961, P = 2.61 × 10–7). A PPS created from the 12 lead SNPs was unable to predict V̇O2peak response in a tenfold cross validation, or in an independent (n = 39) validation study (P > 0.1). Significant correlations were found for beta coefficients of variants in the Predict-HIIT (P < 1 × 10–4) and the validation study (P <  × 10–6), indicating that general effects of the loci exist, and that with a higher statistical power, more significant genetic associations may become apparent.


Ongoing research and validation of current and previous findings is needed to determine if genetics does play a large role in V̇O2peak response variance, and whether genomic predictors for V̇O2peak response trainability can inform evidence-based clinical practice.

Trial registration Australian New Zealand Clinical Trials Registry (ANZCTR), Trial Id: ACTRN12618000501246, Date Registered: 06/04/2018,


Cardiorespiratory fitness (CRF) is measured by peak oxygen uptake (V̇O2peak) during a graded exercise test, and is strongly associated with a reduced risk of cardiometabolic diseases and mortality [1]. Improving V̇O2peak can generally be achieved by regular endurance exercise training, in a dose-dependent manner [2]. Data typically supports the notion that a higher dose of exercise (volume and intensity) will elicit greater V̇O2peak gains [3,4,5,6,7]. Interval training, such as sprint interval training (SIT) and high-intensity interval training (HIIT) have shown comparable [8] and greater [9,10,11,12,13] group mean V̇O2peak changes, respectively, compared with moderate-intensity continuous training (MICT). However, there is considerable inter-individual variability in observed V̇O2peak improvements following apparently similar exercise training [7, 14]. Identifying the genetic and environmental determinants that can predict exercise response may pave the way to personalised exercise programs that can maximise health outcomes.

An early genome wide association study (GWAS) using data from the HEalth, RIsk factors, exercise Training And GEnetics (HERITAGE) Family Study reported that 21 variants contributed to 49% of the variance in V̇O2peak response [15]. However, very few of these variants have been replicated in further testing or other studies suggesting that the variants identified in the HERITAGE study were overfitted to the specific population. In a recent systematic review, we identified 35 studies describing 15 cohorts that found 97 possible variants associated with V̇O2peak training response [16]. Only 13 genetic variants were replicated by more than two authors [15, 17,18,19,20,21,22,23,24,25], and none reached genome-wide significance. A lack of replication and significance in previous research is mostly likely due to underpowered studies that have predominantly been candidate-gene focused [26, 27]. Furthermore, a comparator arm is necessary to discriminate true inter-individual variability from random and technical variability, yet very few studies included such a group, nor did they investigate or control for population stratification. This evidence to-date questions the validity of using currently available commercial genetic tests to prescribe exercise interventions.

Larger sample sizes are needed to build upon current research and to overcome random error in V̇O2peak measurement at the individual level. Greater collaboration between research centres using a discovery driven approach free from pre-existing bias is warranted [26]. V̇O2peak response between different population groups and training interventions along with assessing how individual factors modulate response, should also be explored [28]. The aim of this study was to use one of the largest cohorts to-date (multi-centre Predict-HIIT [7] study) to complete a GWAS to investigate genetic variants associated with V̇O2peak response following exercise training interventions. In addition, we attempted to replicate candidate variants from previous studies, and aimed to build and validate a genetic prediction model for V̇O2peak response (polygenic predictor score, PPS) based on the genetic data.



Discovery cohort—‘Predict HIIT’

Predict-HIIT participant characteristics, recruiting and study intervention details have been previously outlined [7]. Ethical approval was obtained from the Bellberry ethical committee at the University of Queensland (#2016-02-062-A-1), and from all the institutions involved. Participant data was collated from 18 exercise training interventions across eight universities from three continents. As outlined in our previous paper [7], participant change in objectively measured V̇O2peak (indirect calorimetry from a graded exercise test to volitional fatigue on a treadmill or cycle ergometer) was obtained following high-volume HIIT (sessions contained ≥ 15 min of high-intensity efforts in total, n = 225), low-volume HIIT/SIT (sessions contained < 15 min of high-intensity efforts in total, n = 76), or MICT (sessions contained 30 + minutes of continuous exercise at 64–76% maximum heart rate, n = 206). The characteristics of the 507 participants from predominantly European descent used in our GWAS are outlined in Table 1 (24% female, age 55.9 ± 16.9 years, 83% with pathologies and/or elderly). The deoxyribonucleic acid (DNA) extraction, preparation and genotyping are outlined below. These details varied based on the study site where the sample was collected, and whether DNA extraction and/or genotyping had already been completed prior to this study. Our quality control measures have limited bias associated with different DNA preparation, extraction methods and genotyping.

Table 1 Genome-wide association study participant characteristics. Mean ± standard deviation

Validation cohort—‘Improve-HIIT’

For replication of our results, we utilised the unpublished findings from an independent study recently performed in our laboratory (Improve-HIIT). The ‘Improve-HIIT’ study examined the response to high-volume HIIT by randomly allocating 40 sedentary (< 1 h of structured exercise each week) but apparently healthy Caucasian adults (age 18–50) to one of two groups: (i) 6 weeks of supervised high-volume HIIT (5 min warm up, 4 min 90–95% heart rate maximum followed by 3 min recovery repeated 4 times, 3×/week) + prebiotic fibre (oligofructose-enriched inulin) supplementation (12 g/day) or (ii) 6 weeks of supervised high-volume HIIT (3×/week) + placebo (maltodextrin) supplementation (12 g/day). There was no difference in the average V̇O2peak response, or the inter-individual variability in V̇O2peak response between study groups; as such, this study was deemed appropriate for validating findings from the Predict-HIIT GWAS. Ethical approval was obtained from the Institutional Human Research Ethics Approval committee at the University of Queensland (#2018000398).

Each participant completed a series of tests and several measures were collated before and after the intervention. Tests relevant to this analysis included the completion of an incremental V̇O2peak test to exhaustion on a treadmill (Ramped Bruce Protocol) using indirect calorimetry (Parvo Medica True One 2400 System, Parvo Medics, Inc., Sandy, UT, USA) before and after the intervention period, and provision of a saliva sample for genetic analysis (Oragene DNA collection kit, DNA Genotek, Ontario, Canada).

Genotyping, imputation and quality control were completed with the same protocol as for the Predict-HIIT cohort. One sample was removed due to high missing genotyping rate, leaving 39 samples for further analysis. V̇O2peak response (post intervention V̇O2peak—pre intervention V̇O2peak) was calculated for each participant. Fibre/placebo supplement, age, sex, body fat percentage and baseline V̇O2peak were not correlated with response and were not included as covariates for analysis. Using PLINK, the top ranked loci (P < 1 × 10−5) from the Predict-HIIT study were compared in the Improve-HIIT study. Lower ranking loci (P < 1 × 10−4) were also examined between cohorts (see Table 2 for study characteristics).

Table 2 Validation study (Improve-HIIT) participant characteristics. Mean ± standard deviation

DNA preparation

DNA extraction from whole blood

Genomic DNA from 58 whole blood samples [29] was extracted using a QIAamp DNA blood midi kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The DNA samples were quantified using a Qubit fluorometer 3.0 and all samples were diluted to 100 ng/µL for genotyping.

DNA extraction from buffy coat

DNA from 93 buffy coats [30] was extracted using a QIAsymphony DSP DNA Mini Kit according to manufacturer’s instructions [31]. The purified genomic DNA was stored at −20 degrees until genotyped.

DNA extraction from saliva samples

DNA from 289 saliva samples from two studies [32, 33] were extracted using a QIAsymphony (Qiagen) DNA MIDI Kit according to manufacturer’s instructions. The yield and purity were measured using a Trinean DropSense-96. DNA from a further 93 saliva samples from 10 studies [34,35,36,37,38,39,40,41,42,43] were extracted using the protocol outlined on the DNA Genotek website [44].


DNA from 417 samples [29, 30, 32, 34,35,36,37,38,39,40,41,42,43] were genotyped using Illumina CoreExome chips 24v1.1 following standard protocols at the Australian Translational Genomics Centre, Princess Alexandra Hospital, Brisbane. A further 116 samples from Norway [33] were genotyped using Illumina CoreExome chips 24v1.2 at the Genomics Core Facility, NTNU. Bead intensity data was processed and normalised for each sample, and genotypes were identified using the Illumina Genome Studio software with corresponding manifest files. SNP coordinates were annotated to the GRCh37 genome build.

Data quality control

Genotypes at individual SNPs from all cohorts were merged according to the manifest and plink files. Quality control was completed separately on individual cohorts, and included assessment of missingness by individual (threshold < 5%), missingness by genotype (threshold < 5%), Hardy–Weinberg equilibrium in controls (P < 1 × 10−6), extreme heterozygosity (threshold > 3 standard deviations from mean) and identity by descent threshold at 0.2 of PI_HAT score (n = 13 excluded individuals). For each pair of related samples (PI_HAT > threshold), the sample with the higher missingness rate was removed (n = 3 excluded individuals). Along with quantitative GWAS analysis, we further defined groups based on their relative change in V̇O2peak (mL/kg/min) for additional comparisons. Samples were classified as a ‘likely-responder’, ‘likely non-responder/adverse responder’ and ‘uncertain responder’ based on their relative change in V̇O2peak (mL/kg/min) following training. As outlined in Williams et al. [7], a likely responder achieved a V̇O2peak response above one minimal clinically important difference (3.5 mL/kg/min) associated with a 10–25% improvement in survival over a 10-year period, plus one technical error of measurement (average baseline V̇O2peak multiplied by coefficient variation of 5.6%; calculated for each study). This high threshold for response was used to increase the confidence in the ‘likely responder’/’likely non-responder’ classification. The thresholds are provided in Table 3.

Table 3 Thresholds for response

SNPs with Minor Allele Frequency (MAF) > 0.05 were then used to perform principal component analysis (PCA) for ethnicity identification using SHELLFISH [45]. Ethnic and ancestry outliers (more than 6 standard deviations from the mean on either of the two first principal components (PCs)) were excluded (n = 10). Then, data was imputed with the Haplotype Reference Consortium (HRC) reference panel 1.1 [45] using the Sanger imputation server. SNPs with low imputation quality (INFO score ≤ 0.6) were excluded from further analysis. In total, 26 samples were removed due to large ethnicity deviations from the group, leaving 507 samples for association testing (Table 1). Genomic inflation factor λ and quantile–quantile (Q–Q) plots were used to compare the genome-wide distribution of the test statistic with the expected null distribution. The genomic inflation factor λ is defined as the median of the observed chi-squared test statistic divided by the expected median of the corresponding chi-squared distribution. A λ close to 1 reflects no evidence of inflation, while values up to 1.10 are generally considered acceptable for a GWAS. Baseline V̇O2peak, the individual study and PC6 (the 6th principal components from the PCA analysis, which was significantly associated with the phenotype) were included as covariates.

Statistical analysis

V̇O2peak response

Normality for V̇O2peak was assessed using the Shapiro–Wilk test. An analysis of variance was used to compare average group V̇O2peak response between training interventions (high-volume HIIT, MICT, low-volume HIIT/SIT). Variability in response was measured by the range of responses for each intervention. A chi-squared test was used to compare the proportion of likely responders, likely non-responders and those participants classified as uncertain between training groups.

Association testing of independent V̇O2peak responses

Similar to previous studies in this area [15] investigating polygenic phenotypes (i.e. V̇O2peak trainability), we used a quantitative approach rather than a case–control analysis to identify variants associated with V̇O2peak response. Association testing was conducted in PLINK [46], using a linear regression. Baseline V̇O2peak, the individual study and the PC6 from the principal component analysis were found to be significantly associated with V̇O2peak response and were included as covariates in analysis. Age and sex were not associated with the trait. Our findings did not change when age and sex were also included in the association analysis. Thus, we included covariates based on a posteriori instead of a priori knowledge. Association analyses of imputed SNPs was assessed with PLINK best-guess genotypes. Genome-wide significance was set at the standard GWAS threshold of P < 5 × 10−8 and suggestive significance was set at P < 1 × 10−5. The single most significant SNP (the lead SNP) was used to represent each of the loci. The cluster plots of the genotyped lead SNP, or supported genotyped SNPs of imputed lead SNP, were checked manually to eliminate poor signals. An analysis of covariance was used to compare the average V̇O2peak response for each genotype of the top-ranked SNPs, including baseline V̇O2peak, the individual study and PC6 as covariates. Statistical analysis was completed using SPSS (version 23.0, SPSS Inc., Chicago, IL, USA).

Polygenic predictor score

A polygenic predictor score (PPS) was calculated for each participant using the beta coefficient of the selected SNPs. The PPS was an extension of the ‘summary predictor score’ outlined by Bouchard et al. [15] using data from the HERITAGE study. In our study, we sought to improve on this model to ensure the ‘high response training alleles’/‘effect’ alleles were weighted by the effect size (beta coefficient) derived from our GWAS (see Eq. 1).

$$PPS = \mathop \sum \limits_{i}^{k} \beta_{i} *n_{i}$$

where i is the index of the SNP in k selected SNPs used to calculate the PPS. \(\beta_{i}\) is the effect size (beta coefficient of linear regression) of SNPi in the PPS model, \(n_{i}\) is the number of effect alleles of SNPi.

Scores were then added across k SNPs to yield a final PPS and a comparison was made between likely responders, likely non-responders and those deemed uncertain (as described earlier). To avoid over-training (inability of model to be generalised to new data), we did a tenfold cross-validation (using MultiBLUP [47]) with the discovery cohort samples and merged the results of 10 test folds for the analysis. The tenfold cross validation was to test the PPS model’s ability to predict V̇O2peak response in new data not related to the development of the PPS model internally.

Replication of candidate loci

The 97 loci identified as candidate loci for V̇O2peak response in our recent systematic review [16] were analysed and compared with the top-ranking loci (α < 1 × 10−5) from the Predict-HIIT study. Lead SNPs from all associated loci were used to calculate the PPS, as well as the 97 genetic variants found previously, were mapped to the nearest gene and submitted as a batch query to the ToppGene pathway analysis software [48]. Biological processes and pathways that appeared in both groups were selected. Genetic variants were also submitted to the GTEx Portal to identify if any SNPs were expressive quantitative trait loci (eQTL) [49].

Power calculation

Power calculations were performed using the Genome-wide Complex Trait Analysis—Genomic-Relatedness-Based Restricted Maximum Likelihood (GCTA-GREML) calculator [50].


V̇O2peak response was normally distributed. Participants included in the GWAS from high-volume HIIT interventions had a greater V̇O2peak response at the group level than participants from a low-volume HIIT/SIT intervention (1.6 mL/kg/min, 95% CI 0.6 to 2.5, P = 0.002), but a comparable group V̇O2peak response to participants from MICT interventions (0.6 mL/kg/min, 95% CI − 0.1 to 1.3, P = 0.1). Participants from MICT and low-volume HIIT/SIT interventions had similar group responses (1.0 mL/kg/min, 95% CI − 0.1 to 2.0, P = 0.05). Despite these group mean changes, there was large variability in individual V̇O2peak training response within each intervention (see Fig. 1 and Table 1 for ranges). High-volume HIIT had more likely responders than MICT (9%, 0.7 to 17.0, P = 0.03) and low-volume HIIT/SIT (14%, 2.7 to 23.1, P = 0.02). However, high-volume HIIT had similar likely non-responders to MICT (− 2%, 95% CI − 7.2 to 11.1, P = 0.7) and less likely non-responders to low-volume HIIT/SIT (− 17%, 95% CI − 4.1 to − 29.3, P = 0.01). Furthermore, high-volume HIIT had similar uncertain responders to MICT (− 7%, 95% CI − 2.0 to 15.9, P = 0.1) and low-volume HIIT/SIT (3%, − 9.4 to 14.0, P = 0.6). To establish the genetic contribution towards this variance in response to each exercise training intervention, we completed a GWAS that was adjusted for significant covariates (individual study that the participant completed, baseline V̇O2peak and PC6).

Fig. 1

Variability in V̇O2peak response between participants included in the GWAS

No SNPs reached the typical threshold for genome-wide significance (P < 5 × 10–8). The Q-Q plot and a genomic inflation factor of 1.002 indicated there was very minor inflation in the study (i.e. population stratification or DNA sample quality), and minor overdispersions of test statistics when compared to the null distribution (Fig. 2). Twelve loci were associated with V̇O2peak response at suggestive significance (P < 1 × 10–5, Fig. 3 and Table 4). The most significant association was found for rs6959961 near the membrane associated guanylate kinase, WW and PDZ domain containing 2 (MAGI2) gene (P = 2.61 × 10–7). Homozygotes for the response allele (TT, n = 93) had a 2.4 mL/kg/min greater (P = 2.8 × 10–7) average V̇O2peak response than those homozygote for the non-response allele (CC, n = 152) and a 1.3 mL/kg/min greater (P = 0.002) average V̇O2peak response than heterozygotes (TC, n = 262). The second most significant association (P = 2.75 × 10–7) was found for rs730747755 near the Unc-80 Homolog, NALCN Channel Complex Subunit (UNC80) gene. Homozygotes for the response allele (AA, n = 66) had a 2.6 mL/kg/min greater (P = 1.2 × 10–7) average V̇O2peak response than those homozygote for the non-response allele (GG, n = 229), and a 1.8 mL/kg/min greater (P = 2.5 × 10–4) average V̇O2peak response than heterozygotes (AG, n = 212).

Fig. 2

Quantile–Quantile (QQ) plot and genomic inflation factor λ. λ is the observed median of test statistic distribution divided by the expected median of the test statistic distribution. A genomic inflation factor greater than 1.1 indicates there may be some inflation of the GWAS P-values; resulting from factors such as population stratification or DNA sample quality. λ = lambda, base = baseline VO2peak, study = individual study participant completed, PC6 = 6th principal component

Fig. 3

Manhattan Plot of whole Training Cohort. The X-axis represents genomic coordinates, with the negative logarithm of the association p-value for each variant displayed on the Y-axis. Different chromosomes are shown with different colours. The blue line indicates the suggestive significance threshold 1 × 10–5

Table 4 Lead SNP at each locus showing a trend for association with V̇O2peak response

A tenfold cross validation found the Pearson correlation coefficient between subject polygenic predictor score (PPS) and V̇O2peak response (likely responder, likely non responder or uncertain) was not significant (R2 = 0.027, P-value = 0.76, see Fig. 4). Similarly, the PPS was not able to predict V̇O2peak training response in the validation (Improve-HIIT) cohort (R2 = 0.001, P = 0.8). None of the 12 lead SNPs from our GWAS had a P-value < 0.05 in the Improve-HIIT study. Furthermore, from the 992 variants with a P-value < 1 × 10–4 in the Predict-HIIT cohort, a correlation of beta coefficients in the discovery (Predict-HIIT) cohort and the Improve-HIIT cohort was found to be significant (R2 = 0.156, P-value = 7.62 × 10–7). This suggests these variants in the Improve-HIIT cohort have a significant similar trend of effect as they do in the Predict-HIIT cohort.

Fig. 4

Tenfold cross validation—no correlation between Polygenic Predictor Score (PPS) and V̇O2peak response (R2 = 0.027, P = 0.76). Red, green and blue dots represent likely non-responders, likely responders and uncertain responders, respectively

Whilst none of the 12 lead SNPs from our GWAS validated SNPs found in previous research, several of our lead 12 SNPs were found near genes that are in similar biological pathways and processes to predictor genes found in previous research (Table 5). Additionally, we were able to validate a number of SNPs from previous research at a nominal level (4 SNPs at P-value < 0.05, see Table 6). Furthermore, we found several SNPs to be eQTL in tissues that may influence training adaptations. For example, rs11647343, is an eQTL of zinc finger DHHC-type palmitoyltransferase 7(ZDHHC7) in whole blood (P = 1.8 × 10–5). The SNP, rs2657147, is an eQTL of tripartite motif containing 68 (TRIM68) in subcutaneous adipose tissue (P = 4.8 × 10–8).

Table 5 Gene interactions and common biological pathways and processes between this study and previous findings
Table 6 Findings from previous studies found significant at a nominal level in the Predict HIIT study

The GCTA power calculator found a cohort of 2960 samples would have 80% power to detect a quantitative trait with a true heritability of 30%.


To our knowledge this is one of the largest multi-centre GWAS to investigate CRF response following exercise training. Compared to previous genetic studies in this field of research, we were able to user newer technology and methodologies that increased the validity and accuracy of our results. Across the 507 participants and irrespective of the intervention completed, there was large variability in individual V̇O2peak response to high-volume HIIT, MICT and low-volume HIIT/SIT. We were unable to identify genetic variants at a genome-wide significant level that explained this variability in response to each training intervention. However, 12 SNPs were found at a suggestive level of significance and warranted further investigation. Several of our lead SNPs seemed possible candidate genes for predicting V̇O2peak response due to their association with previously identified predictor genes, and related biological pathways and processes that may influence training adaptations.

The most significantly associated SNP, MAGI2, can influence neuronal cell activin-mediated signalling, and may supress AKT Serine/Threoine Kinase 1 (AKT1) activation [51]. AKT1 is a V̇O2peak response predictor gene identified from previous research, and is one of three genes from the protein kinase B family that can influence growth, differentiation and metabolism [52]. The SNP, rs1130214, found near AKT1, was significantly associated (P < 0.05) with V̇O2peak response in previous research [52] and was found at a nominal level in our Predict-HIIT cohort (P = 0.06). One of our lead SNPs, rs79687662, is found near the IQ Motif Containing GTPase Activating Protein 1 (IQGAP1) gene. IQGAP1 and AKT1 genes are both found in the E-cadherin signalling in the nascent adherens junction biological pathway, and together with Transcription Factor Hypoxia-Inducible Factor-1 (HIF1A) and Neuropilin 2 (NRP2) (predictor genes from previous research), are found in the signalling events mediated by the Vascular Endothelial Growth Factor Receptor 1 (VEGFR1) and Vascular Endothelial Growth Factor Receptor 2 (VEGFR2) biological pathway. Furthermore, a rat model found the catenin (cadherin-associated protein) gene was upregulated in higher responders to HIIT, which helps to regulate angiogenesis, neurogenesis and tissue development [53]. Another recent rodent study found loss of Iqgap1 may lead to defective AKT and Extracellular Signal-Regulated Kinase 1/2 (ERK1/2) signalling and impaired cardiomyocyte hypertrophy [54].

Our second strongest associated lead SNP, rs730747755, is found near the Unc-80 Homolog, NALCN Channel Complex Subunit (UNC80) gene. UNC80 is a gene that contributes to a large ion channel complex (the ‘NALCN channelosome”), which includes the Sodium Leak Channel, Non Selective (NALCN) gene [55]. NALCN is a V̇O2peak response predictor gene found in previous research [18], and similar to the UNC80 gene, may influence the resting membrane potential of neuronal cells [55]. There is evidence that genes encoding the NALCN channelosome may contribute to the susceptibility for several diseases, including cardiac diseases, some cancers and psychiatric disorders [56].

Two of our associated lead SNPs were found near genes related to peroxisome proliferator-activated (PPAR) activity. The SNP rs14932370 is found near the ASXL Transcriptional Regulator 1 (ASXL1) gene. Overexpression of ASXL1 may reduce adipogenesis by decreasing Peroxisome Proliferator-Activated Receptor y (PPARy) activity [57]. The SNP, rs2236368, is found near the Transcription Factor EB (TFEB) gene. TFEB may regulate mitophagy, and in addition to Peroxisome Proliferator-Activated Receptor Gamma, Coactivator 1 alpha (PGC-1α), is considered important for mitochondrial biogenic regulation [58]. TFEB may also regulate insulin sensitivity, glucose homeostasis and lipid oxidation [59]. Overexpression of TFEB may increase mitochondrial biogenesis and ATP production in skeletal muscle, independently from PCG-1α [59]. A study also found PGC-1α expression can be increased through the dephosphorylation and nuclear translocation of TFEB [60]. With these points in mind, a recent study found completing two high-intensity exercise sessions within a short time frame (2 h) increased the nuclear abundance of TFEB and the transcription of PCG-1α in 8 healthy young men [61]. Furthermore, overexpression of PGC-1α has been associated with improved V̇O2peak at baseline and following endurance training in several studies [62, 63].

Several of our other associated SNPs are found in the same biological processes and pathways to variants identified in previous research [16]. The SNP rs73193458 is found near the Claudin 14 (CLDN14) gene, and together with previously identified V̇O2peak response predictor genes (Neuroligin 1 (NLGN1), Integrin Subunit Beta 8 (ITGB8) and Cluster of Differentiation 6 (CD6)) is involved in the cell adhesion molecules (CAMs) biological pathway. The SNP rs11874598 found near the Piezo Type Mechanosensitive Ion Channel Component 2 (PEIZO2) gene (which is a mechanosensitive ion channel involved in touch, proprioception, and respiratory function [64]); and rs11647343 found near the ATPase Secretory Pathway Ca2 + Transporting 2 (ATP2C2) gene (which is related to nucleotide binding and calcium transporting ATPase activity and cardiac conduction [64]); along with several other genes identified from previous studies, are involved in cation transmembrane transport biological processes. We also found rs11647343 and rs2657147 to be eQTLs of genes associated with whole blood (ZDHHC7) and subcutaneous tissue (TRIM68), respectively [49]. In mice, ZDHHC7 plays a role in glucose transporter type 4 (Glut4) palmitoylation, contributing to glucose homeostasis [65], possibly contributing to metabolic adaptions required for V̇O2peak improvements. TRIM68 variants have been associated with early onset obesity [66] and is upregulated following aerobic exercise [67]. TRIM68 is associated with ubiquitination [67] and may potentially play a role in proteolytic activity and exercise induced muscle damage. Body composition has been associated with exercise capacity, including maximal workload and oxygen uptake [68, 69]. However, more work is needed to go beyond association and to identify causal variants/genes. Future functional studies are needed.


We created a PPS from the top-12 associated loci from the discovery cohort to identify who was more likely to be a responder or non-responder to different forms of training. It was hypothesised that those identified as a lower responder may need a greater training dose than reported in our study to elicit a clinically meaningful response, or other environmental influences may need to be considered. For example, Montero and Lundby [4] have shown non-responders can become responders by increasing the dose of exercise training. Despite many of the suggestively associated SNPs showing a strong connection to previously identified genes, processes and pathways that may influence training adaptions to exercise, there was no significant correlation between the PPS score and V̇O2peak response following the tenfold cross-validation. The variants and model could not accurately explain the variance in V̇O2peak response or predict who may be a lower or higher responder to each of the training interventions. Likewise, an independent cohort validation, from the Improve-HIIT study, did not support an association of the lead 12 SNPs with variance in V̇O2peak response when considered individually, or in the PPS model. This may be due to a relatively small sample size, or in fact that genetics plays a smaller role than previous research has alluded to. Our power calculation found we need at least 2960 samples to detect signals of common variants with a heritability of 30%.

Additionally, we were unable to replicate variants (V̇O2peak response predictor genes) identified from previous research [16] at a genome wide or suggestive level of significance. However, we were able to replicate several genetic variants from previous research at a nominal significance level within the Predict-HIIT cohort, including: rs10751308, rs10921078, rs1535628 and rs2003298 (P < 0.05). Two of these SNPs (rs10921078 and rs2003298) had the same ‘response’ allele in the Predict-HIIT cohort and previous research [15]. These SNPs warrant investigation in future studies. Furthermore, a significant Pearson correlation coefficient was found for the beta coefficient of variants with a P-value < 1 × 10–4 in the Predict-HIIT cohort the Improve-HIIT cohort. This indicates general effects of the loci (as a group) exist, and a larger sample size may detect many of these effects as statistically significant.


Several limitations may have prevented the finding of more significant associations and validating the proposed PPS model. Firstly, V̇O2peak response is considered a complex trait that may result from multiple interactions between genes (epistasis) and epigenetic changes that can affect gene expression [27]. This was made evident with several of the lead Predict-HIIT SNPs sharing common biological pathways and processes to predictor genes identified from previous studies. Larger sample sizes than reported in our study (tens of thousands) are often needed to investigate these gene interactions via a GWAS, and to identify rare variants that may be contributing to overall response [70, 71]. A lack of detail in previous publications prevented some select variants from being replicated. Previous studies have predominantly been candidate-gene focused, and similar to our study, have lacked the necessary statistical power [16]. The validation study also lacked statistical power and the population studied was different to the Predict-HIIT study. The Predict-HIIT study included a mix of healthy, young, older and clinical European population groups from studies with a variety of exercise doses; whereas the validation study was a high volume HIIT intervention on young, healthy but inactive predominantly Caucasian adults, and included a nutrition intervention. Previous studies have predominantly investigated endurance interventions, including participants from a mix of nationalities, and mainly inactive but healthy populations [16]. Moreover, there may be differences in the accuracy of findings between studies based on participant compliance to study protocols. These factors may have influenced the gene expression and the significance of variants discovered in previous research, the validation study and the Predict-HIIT study. If our study had a larger sample size, we could have stratified our analysis to see if associations were different when clustered according to healthy and clinical populations, and training doses. We tried to combat this by including significant covariates in our GWAS model, including the individual study.

Despite limited research, the declining cost of genetic testing has created an abundance of direct-to-consumer (DTC) DNA testing companies [71]. These testing companies often base recommendations on single or very few genes. For example, Alpha-actinin-3 (ACTN3) is a common ‘fitness gene’ found in many DTC tests, whereby consumers are encouraged to modify the intensity, volume or frequency of their exercise training to suit their ACTN3 genotype. Potential ACTN3 ‘genotype-based training protocols’ for strength and endurance training improvements have been outlined in previous research [72]. For V̇O2peak improvement, the authors suggest RR allele homozygotes and RX allele heterozygotes are resistant to muscle damage and better suited to HIIT; whereas XX allele homozygotes have lower skeletal muscle function and poorer recovery, and subsequently are better suited to MICT [72]. Our analysis and other research has shown that exercise-related phenotypes, such as change in V̇O2peak response, is a polygenic trait where multiple genes influence various cellular pathways [3, 14] and each gene may contribute only a small percentage to the overall change [16, 26, 73]. We have established that the significance of these genes and associated variants remain uncertain, questioning the importance of genetics in predicting individual response and the validity of commercial tests reliant on limited variants used for personalised exercise prescription.

An even larger study with more participants is needed to advance this field of research. In other areas of genetic research, this is achieved by combining datasets and completing a meta-analysis of many genome-wide association studies. As outlined by Zeggini and Ionnidis [74], combining many GWAS datasets would require a consortium with various institutions and laboratories combining to develop a robust protocol that addresses selection bias, quality control, heterogeneity of populations studied and the replication of biologically plausible previous findings. Based on our findings, we have calculated that at least 2960 participants would need to be included in well-controlled exercise interventions to measure V̇O2peak response. Future research should also focus on more than just the genome by using epigenomics, transcriptomics and metagenomics. Having large datasets with this information may help to identify with greater confidence the gene and pathway interactions, and epigenetic changes resulting from environmental influences [75]. Analysis of how exercise dose and quantitative traits including diet, sleep, recovery between training sessions, clinical conditions (e.g. coronary artery disease, type 2 diabetes) and how the microbiome may affect epistasis could also be explored. The Athlome Project Consortium is a collaborative initiative between several institutions to find genetic variants associated with athletic performance [76]. A similar concept could be developed specifically for finding genetic variants associated with V̇O2peak response in non-athletes to aerobic training interventions.


In conclusion, we found 12 novel genetic variants associated with V̇O2peak response in the Predict-HIIT study group. These SNPs have common biological pathways and processes to previous research findings, but could not be replicated in a small independent study. Furthermore, cross-validation found the PPS created from the top-associated SNPS did not show significant correlation with whom was likely to be a responder or non-responder to exercise training. Heterogeneity and a lack of power in the discovery (Predict-HIIT) and validation (Improve-HIIT) cohorts may have prevented lead SNPs from being reproduced between studies. Our results highlight the possible risks associated with predictive scores for complex traits. Larger sample sizes with well-prescribed, controlled and accurately measured exercise interventions are required to identify rare variants, gene interactions and epigenetic changes that may influence gene expression and V̇O2peak response, and to find the ideal exercise dose to negate non-response. Ongoing research and validation of current and previous findings is needed to confirm if genetics does play a large role in V̇O2peak response variance, and whether genomic predictors for V̇O2peak response trainability can inform evidence-based clinical practice.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.





AKT Serine/Threoine Kinase 1


ASXL Transcriptional Regulator 1


ATPase Secretory Pathway Ca2 + Transporting 2


Cell adhesion molecules


Cluster of Differentiation 6


Claudin 14


Deoxyribonucleic acid


Direct to consumer


Expression quantitative trait loci


Extracellular Signal-Regulated Protein Kinase


Glucose Transporter Type 4


The Genome-wide Complex Trait Analysis – Genomic-Relatedness-Based Restricted Maximum Likelihood


Genome Wide Association Study


HEalth, RIsk factors, exercise Training and GEnetics


Hypoxia-Inducible Factor 1-Alpha


High intensity interval training


IQ Motif Containing GTPase Activating Protein 1


Integrin Subunit Beta 8


Moderate intensity continuous training


Sodium Leak Channel, Non-Selective


Neuropilin 2


Principal Component Six


Piezo Type Mechanosensitive Ion Channel Component 2


Peroxisome proliferator-activated receptor gamma coactivator 1-alpha


Peroxisome proliferator-activated receptor


Peroxisome proliferator-activated receptor y


Polygenic predictor score




Sprint interval training


Single nucleotide polymorphisms


Transcription Factor EB


Tripartite Motif Containing 68


Unc-80 Homolog, NALCN Channel Complex Subunit


Vascular Endothelial Growth Factor Receptor-1


Vascular Endothelial Growth Factor Receptor-2


Cardiorespiratory fitness (maximal oxygen uptake)


Zinc Finger DHHC-Type Palmitoyltransferase 7


  1. 1.

    Kodama S, Saito K, Tanaka S, Maki M, Yachi Y, Asumi M, et al. Cardiorespiratory fitness as a quantitative predictor of all-cause mortality and cardiovascular events in healthy men and women: a meta-analysis. JAMA. 2009;301(19):2024–35.

    CAS  Article  Google Scholar 

  2. 2.

    WHO. Chronic Diseases and Health Promotion: the World Health Organisation; 2015. Accessed 20 January 2019.

  3. 3.

    Pickering C, Kiely J. Understanding personalized training response: can genetic assessment help? Open Sports Sci J. 2017;10:191–213.

    Article  Google Scholar 

  4. 4.

    Montero D, Lundby C. Refuting the myth of non-response to exercise training: “non-responders” do respond to higher dose of training. JAP. 2017;595:3377–87.

    CAS  Google Scholar 

  5. 5.

    Bonafiglia JT, Rotundo MP, Whittall JP, Scribbans TD, Graham RB, Gurd BJ. Inter-individual variability in the adaptive responses to endurance and sprint interval training: a randomized crossover study. PloS One. 2016;11(12):e0167790.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  6. 6.

    Bacon A, Carter R, Ogle E, Joyner M. VO2max trainability and high intensity interval training in humans: a meta-analysis. PloS One. 2013;8(9):e73182.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  7. 7.

    Williams CJ, Gurd BJ, Bonafiglia JT, Voisin S, Li Z, Harvey N, et al. A multi-center comparison of VO2peak trainability between interval training and moderate intensity continuous training. Front Physiol. 2019;10:19.

    PubMed  PubMed Central  Article  Google Scholar 

  8. 8.

    Gist NH, Fedew MV, Dishman RK, Cureton KJ. Sprint interval training effects on aerobic capacity: a systematic review and meta-analysis. J Sports Med. 2014;44:269–79.

    Article  Google Scholar 

  9. 9.

    Boff W, da Silva AM, Farinha JB, Rodrigues-Krause J, Reischak-Oliveira A, Tschiedel B, et al. Superior effects of high-intensity interval vs. moderate-intensity continuous training on endothelial function and cardiorespiratory fitness in patients with type 1 diabetes: a randomized controlled trial. Front Physiol. 2019;10:450.

    PubMed  PubMed Central  Article  Google Scholar 

  10. 10.

    Bouaziz W, Malgoyre A, Schmitt E, Lang PO, Vogel T, Kanagaratnam L. Effect of high-intensity interval training and continuous endurance training on peak oxygen uptake among seniors aged 65 or older: a meta-analysis of randomized controlled trials. Int J Clin Pract. 2020;74:e13490.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  11. 11.

    Xie B, Yan X, Cai X, Li J. Effects of high-intensity interval training on aerobic capacity in cardiac patients: a systematic review with meta-analysis. Biomed Res Int. 2017;2017:5420840.

    PubMed  PubMed Central  Google Scholar 

  12. 12.

    Ramos J, Dalleck L, Tjonna A, Beetham K, Coombes J. The impact of high-intensity interval training versus moderate-intensity continuous training on Vascular Function: a systematic review and meta-analysis. Sports Med (Auckland, NZ). 2015;45:679–92.

    Article  Google Scholar 

  13. 13.

    Weston K, Wisloff U, Coombes J. High-intensity interval training in patients with lifestyle-induced cardiometabolic disease: a systematic review and meta analysis. Br J Sports Med. 2014;48:1227–34.

    PubMed  PubMed Central  Article  Google Scholar 

  14. 14.

    Vellers HL, Kleeberger SR, Lightfoot JT. Inter-individual variation in adaptations to endurance and resistance exercise training: genetic approaches towards understanding a complex phenotype. Mamm Genome. 2018;29(1–2):48–62.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  15. 15.

    Bouchard C, Sarzynski M, Rice TK, Kraus WE, Church TS, Sung YJ, et al. Genomic predictors of the maximal oxygen uptake response to standardized exercise training programs. J App Physiol. 2011;110:1160–70.

    CAS  Article  Google Scholar 

  16. 16.

    Williams CJ, Williams MG, Eynon N, Ashton KJ, Little JP, Wisloff U, et al. Genes to predict VO2max trainability: a systematic review. BMC Genomics. 2017;18(Suppl 8):831.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  17. 17.

    Defoor J, Vanhees L, Martens K, Matthijs G, Van Vlerken A, Zielinska D, et al. The CAREGENE study: ACE gene I/D polymorphism and effect of physical training on aerobic power in coronary artery disease. Heart. 2006;92(4):527–8.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  18. 18.

    Ghosh S, Vivar J, Sarzynski M, Sung Y, Timmons J, Bouchard C, et al. Integrative pathway analysis of a genome-wide association study of (V)O(2max) response to exercise training. J Appl Physiol. 2013;115(9):1343–59.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  19. 19.

    Rankinen T, Perusse L, Gagnon J, Chagnon Y, Leon A, Skinner J, et al. Angiotensin-converting enzyme ID polymorphism and fitness phenotype in the HERITAGE Family Study. J Appl Physiol. 2000;88(3):1029–35.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  20. 20.

    Rico-Sanz J, Rankinen T, Joanisse D, Leon A, Skinner J, Wilmore J, et al. Associations between cardiorespiratory responses to exercise and the C34T AMPD1 gene polymorphism in the HERITAGE Family Study. Physiol Genomics. 2003;14(2):161–6.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  21. 21.

    Rivera M, Dionne A, Fance T, Simoneau J, Perusse L, Chagnon M, et al. Muscle-specific creatine kinase gene polymorphism and VO2max in the HERITAGE Family Study. Med Sci Sports Exerc. 1997;29(10):1311–7.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  22. 22.

    Rivera M, Perusse L, Simoneau J, Gagnon J, Dionne F, Leon A, et al. Linkage between a muscle-specific CK gene marker and VO2max in the HERITAGE Family Study. Med Sci Sports Exerc. 1999;31(5):698–701.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  23. 23.

    Thomaes T, Thomis M, Onkelinx S, Fagard R, Matthijs G, Buys R, et al. A genetic predisposition score for muscular endophenotypes predicts the increase in aerobic power after training: the CAREGENE study. BMC Genet. 2011;12:84.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  24. 24.

    Thompson PD, Tsongalis GJ, Seip RL, Biblie C, Miles M, Zoeller R, Visich P, Gordon P, Angelopoulos TJ, Pescatello L, Bausserman L, Moyna N. Apolipoprotein E genotype and changes in serum lipids and maximal oxygen uptake with exercise training. Metabolism. 2004;53(2):193–202.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  25. 25.

    Xu Y, Yang H, Ren Z, Yi L. Delta-aminolevulinate synthase 2 polymorphism is associated with maximal oxygen uptake after living-high exercise-high training-low in a male Chinese population. Int J Clin Exp Med. 2015;8(11):21617–22.

    CAS  PubMed  PubMed Central  Google Scholar 

  26. 26.

    Eynon N, Voisin S, Lucia A, Wang G, Pitsiladis Y. Preface: genomics and biology of exercise is undergoing a paradigm shift. BMC Genomics. 2017;18(8):825.

    PubMed  PubMed Central  Article  Google Scholar 

  27. 27.

    Sarzynski MA, Ghosh S, Bouchard C. Genomic and transcriptomic predictors of response levels to endurance exercise training. J Physiol. 2017;595(9):2931–9.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  28. 28.

    Bouchard C, Antunes-Correa LM, Ashley EA, Franklin N, Hwang PM, Mattsson CM, et al. Personalized preventive medicine: genetics and the response to regular exercise in preventive interventions. Prog Cardiovasc Dis. 2015;57(4):337–46.

    PubMed  Article  PubMed Central  Google Scholar 

  29. 29.

    Yan X, Eynon N, Papadimitriou ID, Kuang J, Munson F, Tirosh O, et al. The gene SMART study: method, study design, and preliminary findings. BMC Genomics. 2017;18(Suppl 8):821.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  30. 30.

    Pattyn N, Vanhees L, Cornelissen VA, Coekelberghs E, De Maeyer C, Goetschalckx K, et al. The long-term effects of a randomised trial comparing aerobic interval training versus continuous training in coronary artery disease: 1-year data from the SAINTEX-CAD study. Eur J Prev Cardiol. 2016;23(11):1154–64.

    PubMed  Article  PubMed Central  Google Scholar 

  31. 31.

    QIAsymphony. QIAsymphony DSP DNA Instructions for Use (Handbook). 2015. file:///C:/Users/camil/Downloads/HB-0977–004_1069185_151035723_R4_HB_QS_DSP_DNA_Kit_0815_WW.pdf. Accessed 20 June 2019.

  32. 32.

    Taylor JL, Holland DJ, Keating SE, Leveritt MD, Gomersall SR, Rowlands AV, et al. Short-term and long-term feasibility, safety, and efficacy of high-intensity interval training in cardiac rehabilitation: the FITR heart study randomized clinical trial. JAMA Cardiol. 2020;5:1382−9.

    PubMed  Article  PubMed Central  Google Scholar 

  33. 33.

    Stensvold D, Viken H, Steinshamn SL, Dalen H, Støylen A, Loennechen JP, et al. Effect of exercise training for five years on all cause mortality in older adults—the Generation 100 study: randomised controlled trial. BMJ. 2020;371:m3485.

    PubMed  PubMed Central  Article  Google Scholar 

  34. 34.

    Bonafiglia JT, Edgett BA, Baechler BL, Nelms MW, Simpson CA, Quadrilatero J, et al. Acute upregulation of PGC-1α mRNA correlates with training-induced increases in SDH activity in human skeletal muscle. Appl Physiol Nutr Metab. 2017;42(6):656–66.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  35. 35.

    Bonafiglia JT, Rotundo MP, Whittall JP, Scribbans TD, Graham RB, Gurd BJ. Inter-individual variability in the adaptive responses to endurance and sprint interval training: a randomised crossover study. PloS One. 2016;11:e0167790.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  36. 36.

    Boyd CJ, Simpson CA, Jung ME, Gurd BJ. Reducing the intensity and volume of interval training diminishes cardiovascular adaptation but not mitochondrial biogenesis in overweight/obese men. PloS One. 2013;8(7):e68091.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  37. 37.

    Ma JK, Scribbans TD, Edgett BA, Boyd C, Simpson CA, Little JP, et al. Extremely low-volume, high intensity interval training improves exercise capacity and increases mitochondrial protein content in human skeletal muscle. Eur J Mol Integr Physiol. 2013;3(4).

  38. 38.

    Preobrazenski N, Bonafiglia JT, Nelms MW, Lu S, Robins L, LeBlanc C, et al. Does blood lactate predict the chronic adaptive response to training: a comparison of traditional and talk test prescription methods. Appl Physiol Nutr Metab. 2018.

  39. 39.

    Raleigh JP, Giles MD, Scribbans TD, Edgett BA, Sawula LJ, Bonafiglia JT, et al. The impact of work-matched interval training on VO2peak and VO2 kinetics: diminishing returns with increasing intensity. Appl Physiol Nutr Metab. 2016;41(7):706–13.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  40. 40.

    Scribbans TD, Edgett BA, Vorobei K, MItchell AS, Lioanisse SD, Matusiak JBL, et al. Fibre-specific response to endurance and low-volume high intensity interval training: striking similarities in acute and chronic adaptation. PloS One. 2014;9(6):e98119.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  41. 41.

    Forbes SC, Sletten N, Durrer C, Myette-Cote E, Candow D, Little JP. Creatine monohydrate supplementation does not augment fitness, performance, or body composition adaptations in response to four weeks of high-intensity interval training in young females. Int J Sport Nut Exer. 2017;27(3):285–92.

    CAS  Article  Google Scholar 

  42. 42.

    Francois M, Pistawka KJ, Halperin FA, Little JP. Cardiovascular benefits of combined interval training and post-exercise nutrition in type 2 diabetes. J Diabetes Complicat. 2017;32(2):226–33.

    Article  Google Scholar 

  43. 43.

    Ramos JS, Dalleck LC, Borrani F, Mallard AR, Clark B, Keating SE, et al. The effect of different volumes of high-intensity interval training on proinsulin in participants with the metabolic syndrome: randomised trial. Diabetologia. 2016;59(11):2308–20.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  44. 44.

    Genotek. Laboratory protocol for manual purification of DNA from 0.5mL sample. 2019 Accessed 17 May 2019.

  45. 45.

    Univeristy of Oxford. Shellfish: Parallel PCA and data processing for genome-wide SNP data, UK. 2017. Accessed 17 May 2019.

  46. 46.

    COG-Genomics. PLINK 1.90 beta [v.1.90b3.36]. 2020. Accessed 10 Novr 2020.

  47. 47.

    Speed D, Balding DJ. MultiBLUP: improved SNP-based prediction for complex traits. GenomeR. 2014;24(9):1550–7.

    CAS  Article  Google Scholar 

  48. 48.

    Division of Biomedical Informatics Cincinnatit Children’s Hospital Medical Center. ToppGene: a one-stop portal for gene list enrichment analysis and candidate gene prioritization based on functional annotations and protein interactions network, USA. 2019. Accessed 13 Nov 2020.

  49. 49.

    Broad Insititue of MIT and Harvard. The GTEx Portal, USA & UK. 2020. Accessed 10 Nov 2020.

  50. 50.

    Yang J, Lee Hong S, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;7(88):76–82.

    Article  CAS  Google Scholar 

  51. 51.

    Wu X, Hepner K, Castelino-Prabhu S, Do D, Kaye MB, Yuan XJ, et al. Evidence for regulation of the PTEN tumor suppressor by a membrane-localized multi-PDZ domain containing scaffold protein MAGI-2. Proc Natl Acad Sci USA. 2000;97(8):4233–8.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  52. 52.

    McKenzie J, Witkowski S, Ludlow A, Roth S, Hagberg J. AKT1 G205T genotypeinfluences obesity-related metabolic phenotypes and their responses to aerobic exercise training in older Caucasians. Exp Physiol. 2011;96(3):338–47.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  53. 53.

    Wisløff U, Bye A, Stølen T, Kemi OJ, Pollott GE, Pande M, et al. Blunted cardiomyocyte remodeling response in exercise-resistant rats. J Am Coll Cardiol. 2015;65(13):1378–80.

    PubMed  Article  PubMed Central  Google Scholar 

  54. 54.

    Sbroggiò M, Carnevale D, Bertero A, Cifelli G, De Blasio E, Mascio G, et al. IQGAP1 regulates ERK1/2 and AKT signalling in the heart and sustains functional remodelling upon pressure overload. Cardiovasc Res. 2011;91(3):456–64.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  55. 55.

    Bramswig NC, Bertoli-Avella AM, Albrecht B, Al Aqeel AI, Alhashem A, Al-Sannaa N, et al. Genetic variants in components of the NALCN-UNC80-UNC79 ion channel complex cause a broad clinical phenotype (NALCN channelopathies). Hum Genet. 2018;137(9):753–68.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  56. 56.

    Cochet-Bissuel M, Lory P, Monteil A. The sodium leak channel, NALCN, in health and disease. Front Cell Neurosci. 2014;8:132.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  57. 57.

    Park UH, Yoon SK, Park T, Kim EJ, Um SJ. Additional sex comb-like (ASXL) proteins 1 and 2 play opposite roles in adipogenesis via reciprocal regulation of peroxisome proliferator-activated receptor {gamma}. J Biol Chem. 2011;286(2):1354–63.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  58. 58.

    Erlich AT, Brownlee DM, Beyfuss K, Hood DA. Exercise induces TFEB expression and activity in skeletal muscle in a PGC-1α-dependent manner. Am J Physiol. 2018;314(1):C62-c72.

    Article  CAS  Google Scholar 

  59. 59.

    Mansueto G, Armani A, Viscomi C, D’Orsi L, De Cegli R, Polishchuk EV, et al. Transcription factor EB Controls Metabolic Flexibility During Exercise. Cell Metab. 2017;25(1):182–96.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  60. 60.

    Theeuwes WF, Gosker HR, Schols A, Langen RCJ, Remels AHV. Regulation of PGC-1α expression by a GSK-3β-TFEB signaling axis in skeletal muscle. Biochim Biophys Acta Mol Cell Res. 2020;1867(2):118610.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  61. 61.

    Andrade-Souza VA, Ghiarone T, Sansonio A, Santos Silva KA, Tomazini F, Arcoverde L, et al. Exercise twice-a-day potentiates markers of mitochondrial biogenesis in men. FASEB J. 2020;34(1):1602–19.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  62. 62.

    Petr M, Stastny P, Zajac A, Tufano JJ, Maciejewska-Skrendo A. The role of peroxisome proliferator-activated receptors and their transcriptional coactivators gene variations in human trainability: a systematic review. Int J Mol Sci. 2018;19(5):1472.

    PubMed Central  Article  CAS  Google Scholar 

  63. 63.

    Yaghoob Nezhad F, Verbrugge SAJ, Schönfelder M, Becker L, Hrabě de Angelis M, Wackerhage H. Genes whose gain or loss-of-function increases endurance performance in mice: a systematic literature review. Front Physiol. 2019;10(262).

  64. 64.

    Stelzer G, Rosen N, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, et al. The GeneCards Suite: from gene data mining to disease genome sequence analyses. Curr Protoc Bioinformatics. 2016;54(1).

  65. 65.

    Du K, Murakami S, Sun Y, Kilpatrick C, Luscher B. DHHC7 palmitoylates Glut4 and regulates Glut4 membrane translocation. J Biol Chem. 2017.

    Article  PubMed  PubMed Central  Google Scholar 

  66. 66.

    Serra-Juhé C, Martos-Moreno GÁ, de Pieri F, Flores R, González JR, Rodríguez-Santiago B, et al. Novel genes involved in severe early-onset obesity revealed by rare copy number and sequence variants. PLoS Genetics. 2017;13(5):e1006657.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  67. 67.

    Dickinson J, DiLugos A, Naymik M, Wolfe A, Curtis D, Huentelman M, Carroll C. Transcriptome response of human skeletal muscle to divergent exercise stimuli. JAP. 2018;124(6):1529–40.

    CAS  Google Scholar 

  68. 68.

    Durkalec-Michalski K, Nowaczyk PM, Podgórski T, Kusy K, Osiński W, Jeszka J. Relationship between body composition and the level of aerobic and anaerobic capacity in highly trained male rowers. J Sports Med Phys Fitness. 2019;59(9):1526–35.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  69. 69.

    Radzimiński Ł, Szwarc A, Padrón-Cabo A, Jastrzębski Z. Correlations between body composition, aerobic capacity, speed and distance covered among professional soccer players during official matches. J Sports Med Phys Fitness. 2020;60(2):257–62.

    PubMed  Article  PubMed Central  Google Scholar 

  70. 70.

    Blanco-Gomez A, Castillo-Lluva S, Del Mar S-F, Hontecillas-Prieto L, Mao JH, Castellanos-Martin A, et al. Missing heritability of complex diseases: enlightenment by genetic variants from intermediate phenotypes. BioEssays : News Rev Mol Cell Dev Biol. 2016;38(7):664–73.

    Article  Google Scholar 

  71. 71.

    Webborn N, Williams A, McNamee M, Bouchard C, Pitsiladis Y, Ahmetov I, et al. Direct-to-consumer genetic testing for predicting sports performance and talent identification: consensus statement. Br J Sports Med. 2015;49(23):1486–91.

    PubMed  PubMed Central  Article  Google Scholar 

  72. 72.

    Kikuchi N, Nakazato K. Effective utilization of genetic information for athletes and coaches: focus on ACTN3 R577X polymorphism. J Exerc Nutrition Biochem. 2015;19(3):157–64.

    PubMed  PubMed Central  Article  Google Scholar 

  73. 73.

    MacArther D, North K. Genes and human elite athletic performance. Hum Genet. 2005;116:331–9.

    Article  CAS  Google Scholar 

  74. 74.

    Zeggini E, Ioannidis J. Meta-analysis in genome-wide association studies. Pharmocogenomics. 2009;10(2):191–201.

    Article  Google Scholar 

  75. 75.

    Hoppeler H, Deciphering V. O2, max: limits of the genetic approach. J Exp Biol. 2018;221:21.

    Article  Google Scholar 

  76. 76.

    Pitsiladis YP, Tanaka M, Eynon N, Bouchard C, North KN, Williams AG, et al. Athlome Project Consortium: a concerted effort to discover genomic and other “omic” markers of athletic performance. Physiol Genomics. 2016;48(3):183–90.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

Download references


Professor Ben Levine for providing access to participants from his study, Dr Sarah Voisin for guidance and manuscript edits, and Professor Jian Yang for his assistance with GTCA-GREML calculator.


This research was made possible from the funding received through the Collaborative Research Network for Advancing Exercise & Sports Science (CRN-AESS) – Bond University, Robina, Australia. NH was supported by a PhD stipend also provided by Bond University CRN-AESS. This research was supported by infrastructure purchased with Australian Government EIF Super Science Funds as part of the Therapeutic Innovation Australia-Queensland Node project (LG).

Author information




CW, ZL, NH, RL, JC and NE contributed to the conception and design of the study. CW collated the data for the Predict-HIIT and Improve-HIIT study and completed baseline statistical analysis of data. CW wrote the first draft of the manuscript. ZL completed the genome wide association study for the Predict-HIIT study. ZL completed the cross-validation analysis of the Predict-HIIT study, and validation analysis using data from the Improve-HIIT study. NH provided assistance with manuscript revisions, and guidance with statistical analysis. RL provided guidance with manuscript improvements and statistical analysis. IC, NH, JT, TG, EC, JR, RF, MJ, JL, MF, BG, JB, CH, SS, SK, SJ, EC, PB, VC, NP, EH, UW, AB, DS, DB, IP, XY, MB, LH, LG, KA were investigators involved with the Predict-HIIT study analysis. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jeff S. Coombes.

Ethics declarations

Ethics approval and consent to participate

Predict-HIIT study: Ethical approval was obtained from the Bellberry ethical committee at the University of Queensland (#2016-02-062-A-1), and from all the institutions involved. Improve HIIT study: Ethical approval was obtained from the Institutional Human Research Ethics Approval committee at the University of Queensland (#2018000398).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no conflicts of interest that are relevant to this research paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Williams, C.J., Li, Z., Harvey, N. et al. Genome wide association study of response to interval and continuous exercise training: the Predict-HIIT study. J Biomed Sci 28, 37 (2021).

Download citation


  • Genetics
  • V̇O2peak training response
  • Individual variability
  • GWAS
  • Polygenic predictor score