Related Videos - A hybrid model based on transformer and Mamba for enhanced sequence modeling

Xiaocui Zhu ¹, Qunsheng Ruan ², Sai Qian ³

⁰Jiangxi Academy Sciences, Institute of Energy, Nanchang, 330029, Jiangxi, China. zhuxiaocui@jxas.ac.cn.

Abstract

State Space Models (SSMs) have made remarkable strides in language modeling in recent years. With the introduction of Mamba, these models have garnered increased attention, often surpassing Transformers in specific areas. Nevertheless, despite Mamba's unique strengths, Transformers remain essential due to their advanced computational capabilities and proven effectiveness. In this paper, we propose a novel model that effectively integrates the strengths of both Transformers and Mamba. Specifically, our model utilizes the Transformer's encoder for encoding tasks while employing Mamba as the decoder for decoding tasks. We introduce a feature fusion technique that combines the features generated by the encoder with the hidden states produced by the decoder. This approach successfully merges the advantages of the Transformer and Mamba, resulting in enhanced performance. Comprehensive experiments across various language tasks demonstrate that our proposed model consistently achieves competitive results, outperforming existing benchmarks.

A hybrid model based on transformer and Mamba for enhanced sequence modeling

Adaptation of a Haptic Robot in a 3T fMRI

The ITS2 Database

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Abstract

Adaptation of a Haptic Robot in a 3T fMRI

The ITS2 Database

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Peptide Identification Using Tandem Mass Spectrometry

ABOUT JoVE

AUTHORS

LIBRARIANS

RESEARCH

EDUCATION

A hybrid model based on transformer and Mamba for enhanced sequence modeling

Related Experiment Videos These videos have been matched automatically. Contact us if they are not relevant.

Adaptation of a Haptic Robot in a 3T fMRI

The ITS2 Database

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Abstract

Related Experiment Videos These videos have been matched automatically. Contact us if they are not relevant.

Adaptation of a Haptic Robot in a 3T fMRI

The ITS2 Database

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Related Concept Videos

Peptide Identification Using Tandem Mass Spectrometry

Share

Related Experiment Videos

These videos have been matched automatically. Contact us if they are not relevant.

Related Experiment Videos

These videos have been matched automatically. Contact us if they are not relevant.