Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.9K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.9K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.1K
3.1K
Improving Translational Accuracy02:07

Improving Translational Accuracy

11.7K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.7K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.0K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.0K
Leaky Scanning02:28

Leaky Scanning

5.2K
During most eukaryotic translation processes, the small 40S ribosome subunit scans an mRNA from its 5' end until it encounters the first start AUG codon. The large 60S ribosomal subunit then joins the smaller one to initiate protein synthesis. The location of the translation initiation is largely determined by the nucleotides near the start codon as there may be multiple translation initiation sites present on the mRNA.  Marilyn Kozak discovered that the sequence RCCAUGG (where R...
5.2K
Protein and Protein Structure02:15

Protein and Protein Structure

79.8K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
79.8K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Fast, accurate construction of multiple sequence alignments from protein language embeddings.

bioRxiv : the preprint server for biology·2026
Same author

Evaluating Pretrained Protein Language Model Embeddings as Proxies for Functional Similarity.

Journal of molecular evolution·2025
Same author

Locality-aware pooling enhances protein language model performance across varied applications.

Bioinformatics (Oxford, England)·2025
Same author

VerteBrain reveals novel neural and non-neural protein assemblies conserved across vertebrate evolution.

bioRxiv : the preprint server for biology·2025
Same author

RECOMB 2024 Special Issue.

Journal of computational biology : a journal of computational molecular cell biology·2024
Same author

Alternative proteoforms and proteoform-dependent assemblies in humans and plants.

Molecular systems biology·2024
Same journal

Complete sequencing of medaka genomes reveals the architecture of centromeric satellites, giant mobile elements, and sex chromosomes.

Genome research·2026
Same journal

Convergence and conflict among telomere specialized transposons across 60 million years of Drosophilid evolution.

Genome research·2026
Same journal

A unified analysis of cell type- and trajectory-associated pathways in single-cell data using Phoenix.

Genome research·2026
Same journal

Resf1 is required for proper placental development and configuration of trophoblast cell-specific heterochromatin.

Genome research·2026
Same journal

Telomere-driven replicative crisis is driven by large-scale changes in genomic architecture.

Genome research·2026
Same journal

Spatially informed reference-free cell-type deconvolution for spatial transcriptomics with SpatialCD.

Genome research·2026
查看所有相关文章

相关实验视频

Updated: Jul 24, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.8K

利用蛋白质语言模型进行精确的多重序列对齐.

Claire D McWhite1, Isabel Armour-Garb2,3, Mona Singh1,3

  • 1Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA; cmcwhite@princeton.edu mona@cs.princeton.edu.

Genome research
|July 6, 2023
PubMed
概括
此摘要是机器生成的。

本研究引入了一种使用蛋白质语言模型的新多重序列对齐 (MSA) 方法. 它通过分析氨基酸嵌入,绕过传统的对齐步骤来实现对具有低序列身份的蛋白质更高的准确性.

更多相关视频

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K
A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.4K

相关实验视频

Last Updated: Jul 24, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.8K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K
A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.4K

科学领域:

  • 生物信息学是一种生物信息学.
  • 计算生物学 计算生物学
  • 结构生物学 结构生物学

背景情况:

  • 多个序列对齐 (MSA) 对于理解蛋白质序列和功能至关重要.
  • 传统的MSA方法与表现出低序列标识 (暮色区) 的蛋白质进行斗争.
  • 蛋白质语言模型通过生成能够捕捉氨基酸性质的上下文嵌入来提供一种新的方法.

研究的目的:

  • 开发一种新的多重序列对齐 (MSA) 方法,利用蛋白质语言模型.
  • 为了提高低序列身份的蛋白质的对齐精度.
  • 为了规避传统MSA算法的局限性.

主要方法:

  • 来自蛋白质语言模型的氨基酸上下文嵌入的聚类和排序.
  • 开发一种基于蛋白质组的语义一致性的新型MSA方法.
  • 避免使用传统的MSA组件,如导向树,对齐对齐,差距处罚和替换矩阵.

主要成果:

  • 新的MSA方法对具有较低氨基酸相似度的结构相似蛋白质表现出更高的准确性.
  • 这种方法有效地利用了来自上下文嵌入的信息.
  • 基于语义一致性的蛋白质组的成功对齐.

结论:

  • 蛋白质语言模型为MSA提供了一个强大的新信息来源.
  • 拟议的方法为对齐具有挑战性的蛋白质集提供了更准确的替代方案.
  • 这种方法预计将成为未来MSA算法的基本组成部分.