Archives

  • 2018-07
  • 2018-10
  • 2018-11
  • 2019-04
  • 2019-05
  • 2019-06
  • 2019-07
  • 2019-08
  • 2019-09
  • 2019-10
  • 2019-11
  • 2019-12
  • 2020-01
  • 2020-02
  • 2020-03
  • 2020-04
  • 2020-05
  • 2020-06
  • 2020-07
  • 2020-08
  • 2020-09
  • 2020-10
  • 2020-11
  • 2020-12
  • 2021-01
  • 2021-02
  • 2021-03
  • 2021-04
  • 2021-05
  • 2021-06
  • 2021-07
  • 2021-08
  • 2021-09
  • 2021-10
  • 2021-11
  • 2021-12
  • 2022-01
  • 2022-02
  • 2022-03
  • 2022-04
  • 2022-05
  • 2022-06
  • 2022-07
  • 2022-08
  • 2022-09
  • 2022-10
  • 2022-11
  • 2022-12
  • 2023-01
  • 2023-02
  • 2023-03
  • 2023-04
  • 2023-05
  • 2023-06
  • 2023-08
  • 2023-09
  • 2023-10
  • 2023-11
  • 2023-12
  • 2024-01
  • 2024-02
  • 2024-03
  • 2024-04
  • Most of the unigenes that could be mapped

    2018-10-23

    Most of the unigenes that could be mapped to the pseudomolecules with high sequence similarity hits (threshold value 1E-30) could be mapped to both A and C genomes, enabling a detailed analysis of the collinearity of the genomes. The positions in the respective GSK503 of the 57,246 unigenes that could be mapped to both are illustrated in Fig. 1. This shows the expected high degree of conservation of gene order, but there are numerous segmental rearrangements differentiating them. The extent of rearrangement is perhaps surprising given that they shared a common ancestor only 3.7 MYA [9], but this genome plasticity may be accounted for by their polyploid origins [10]. There is also a background of unigenes mapping to non-homoeologous positions in the A and C genomes, including two prominent segments shadowing each of the collinear blocks. To investigate the possible basis of these, 60 unigenes anchored to linkage group A1 were selected randomly and sequence similarity in the A and C genomes assessed GSK503 by visual inspection of BLAST alignments. The results of this analysis (Additional file 8) showed that the apparent non-collinear mapping is most often an artefact caused by repetitive sequences, but sequences missing from one genome (i.e. copy number variation) are common, too. The latter explains the “shadow” collinear blocks as the best sequence match remaining after loss of a homoeologous sequence will be that of a paralogue; the hexaploid basis of each of the A and C genomes then results in two such paralogous segments being detected.
    Acknowledgements This work was supported by UK Department for Environment, Food and Rural Affairs (Defra IF0144).
    Specifications table
    Value of the data
    Supplementary methods
    MRM data
    Data Supplementary Fig. 1 is a Venn diagram showing differentially-expressed peptides between treatment responders and non-responders in the discovery stage for subjects treated with PR (peg-interferon/ribavirin) and T/PR (telaprevir/peg-interferon/ribavirin). Differentially-expressed peptides are defined as those with false discovery rate less than 0.05 and fold differential expression greater than 2. The overlap between the two treatments is highly statistically significant (p<2×10−16). Supplementary Fig. 2 shows ROC curves for predicting SVR for PR (A) and T/PR (B) subjects using only the five identified peptides as predictors. The model was fitted using only data from the PR subjects and is independently validated on the T/PR subjects. AUCs for A and B are 0.97 and 0.81, respectively. Supplementary Fig. 3 shows the ROC curve for predicting SVR for PR subjects in using MRM data. The AUC for this curve is 0.8. Supplementary Fig. 4 is a comparison of PR week 4 viral declines for the (A) discovery and (B) MRM analysis stages. Supplementary Table 1 lists subject characteristics according to treatment group and response (SVR). Supplementary Table 2 lists oefficients in the model predicting treatment outcome from discovery phase. Supplementary Table 3 lists the clinical trials from which the samples used in the proteomic analyses were obtained. Supplementary Table 4 lists the numbers of differentially-expressed components identified from statistical comparisons in the discovery stage. Supplementary Table 5 lists the gene symbols for the proteins identified in the discovery stage. Supplementary Table 6 lists the coefficients in the model predicting treatment outcome from discovery stage data using only components that are identified. Supplementary Table 7 lists references for liver expression of the proteins that are significantly-differentially expressed between responders to PR. Supplementary Table 8 lists pairwise pearson correlations among differentially-expressed proteins and other clinical covariates. Supplementary Table 9[2–7,10–12] are the references for Fig. 4 in [1].
    Value of the data