Integrating Internal Fragments in the Interpretation of Top-Down Sequencing Data of Larger Oligonucleotides.
Thomas KenderdineWilliam McIntyreGhazaleh YassaghiDaniele RolloAlexander BunkowskiLukas GoerlachDetlev SuckauGuillaume TremintinMichael GreigChristopher BellDaniele FabrisPublished in: Journal of the American Society for Mass Spectrometry (2023)
In the context of direct top-down analysis or concerted bottom-up characterization of nucleic acid samples, the waning yield of terminal fragments as a function of precursor ion size poses a significant challenge to the gas-phase sequencing of progressively larger oligonucleotides. In this report, we examined the behavior of oligoribonucleotide samples ranging from 20 to 364 nt upon collision-induced dissociation (CID). The experimental data showed a progressive shift from terminal to internal fragments as a function of size. The systematic evaluation of experimental factors, such as collision energy, precursor charge, sample temperature, and the presence of chaotropic agents, showed that this trend could be modestly alleviated but not suppressed. This inexorable effect, which has been reported also for other activation techniques, prompted a re-examination of the features that have traditionally discouraged the utilization of internal fragments as a source of sequence information in data interpretation procedures. Our simulations highlighted the ability of internal fragments to produce self-consistent ladders with either end corresponding to each nucleotide in the sequence, which enables both proper alignment and correct recognition of intervening nucleotides. In turn, contiguous ladders display extensive overlaps with one another and with the ladders formed by terminal fragments, which unambiguously constrain their mutual placement within the analyte sequence. The experimental data borne out the predictions by showing ladders with extensive overlaps, which translated into uninterrupted "walks" covering the entire sequence with no gaps from end to end. More significantly, the results showed that combining the information afforded by internal and terminal ladders resulted in much a greater sequence coverage and nucleotide coverage depth than those achievable when either type of information was considered separately. The examination of a series of 58-mer oligonucleotides with high sequence homology showed that the assignment ambiguities engendered by internal fragments did not significantly exceed those afforded by the terminal ones. Therefore, the balance between potential benefits and perils of including the former makes a compelling argument for the development of integrated data interpretation strategies, which are better equipped for dealing with the changing fragmentation patterns obtained from progressively larger oligonucleotides.
Keyphrases
- nucleic acid
- electronic health record
- big data
- multiple sclerosis
- amino acid
- single cell
- data analysis
- healthcare
- oxidative stress
- endothelial cells
- machine learning
- optical coherence tomography
- climate change
- atrial fibrillation
- deep learning
- quantum dots
- risk assessment
- ultrasound guided
- drug induced
- single molecule
- fluorescent probe
- affordable care act
- health insurance