相关概念视频
Gene Duplication and Divergence
8.1K
The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are...
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are...
8.1K
Genome Copying Errors
5.2K
DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their survival. Therefore, the copying errors are checked and repaired at three levels.
5.2K
Point and Frameshift Mutations
1.3K
Point mutations are genetic alterations involving the change of a single nucleotide base pair in DNA. Depending on how the alteration affects protein synthesis, they can lead to various consequences.Point mutations fall into the following types:Silent mutations occur when a nucleotide change does not alter the amino acid sequence due to the redundancy of the genetic code. For instance, changing ACC to ACA still encodes threonine, leaving the protein function unaffected. This occurs because...
1.3K
Evolutionary Relationships through Genome Comparisons
7.1K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
7.1K
Sanger Sequencing
776.0K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
776.0K
Genome Annotation and Assembly
21.2K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
21.2K
您也可能阅读
相关文章
通过共同作者、期刊和引用图与本文相关的文章。
排序
Same author
Population-scale Y chromosome assemblies reveal recurrent remodeling within constrained architectures.
bioRxiv : the preprint server for biology·2026
Same author
Origin and evolution of acrocentric chromosomes in human and great apes.
bioRxiv : the preprint server for biology·2026
Same author
Multispecies pangenomes reveal a pervasive influence of population size on structural variation.
Science (New York, N.Y.)·2025
Same author
Cell line-matched reference enables high-precision functional genomics.
Nature communications·2025
Same author
A complete diploid human genome benchmark for personalized genomics.
bioRxiv : the preprint server for biology·2025
Same journal
Layered social competition coordinates reproductive hierarchy formation in ants.
bioRxiv : the preprint server for biology·2026
Same journal
Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.
bioRxiv : the preprint server for biology·2026
Same journal
Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.
bioRxiv : the preprint server for biology·2026
Same journal
Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.
bioRxiv : the preprint server for biology·2026
Same journal
Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.
bioRxiv : the preprint server for biology·2026
Same journal
Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.
bioRxiv : the preprint server for biology·2026
适应性跟踪点用于骨对齐压缩压缩
bioRxiv : the preprint server for biology
|February 27, 2026
概括
适应性跟踪点通过基于复杂性的智能细分数据,为基因组序列对齐提供了卓越的压缩. 与传统的固定大小编码相比,这种方法显著提高了大规模基因组比较的存储效率.
科学领域:
- 基因组学就是基因组学.
- 生物信息学是一种生物信息学.
- 计算生物学 计算生物学
背景情况:
- 存储大规模的基因组比较数据需要有效的压缩.
- 固定尺寸对齐编码缺乏适应可变序列复杂性的适应性,限制了压缩.
- 在基因组序列中保留的区域提供了错过的压缩机会.
研究的目的:
- 开发一种复杂性意识的对齐编码方法.
- 为了提高序列对齐的压缩效率.
- 确保对齐重建保持或改善对齐得分.
主要方法:
- 引入了自适应跟踪点,一个对齐编码方法.
- 使用可配置复杂度指标 (如编辑距离或对角距离) 的细分对齐.
- 通过与主对角线的差异或偏差来界定的段落,以适应当地特征.
主要成果:
- 对角边界的跟踪点在模拟的长序列上实现了10.5-13.7倍更好的压缩.
- 有编辑限制的跟踪点提供了可调节的压缩和重建成本之间的权衡.
- 真正的泛基组数据显示了23-139倍的压缩,没有得分退化和线性重建时间.
结论:
- 适应性跟踪点为基因组序列对齐提供了显著的压缩收益.
- 该方法适应局部对齐特性,优于固定尺寸方法.
- 该方法在模拟和真实泛基因组数据上得到验证,确保效率和准确性.


