Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Understanding Memory01:19

Understanding Memory

569
Memory is the retention of information or experiences over time, facilitated through three main processes: encoding, storage, and retrieval. Encoding is the process of inputting information into the memory system. For instance, when listening to a lecture, watching a play, reading a book, or having a conversation, the brain is actively encoding information. This initial stage involves transforming sensory input into a form that can be processed and stored by the brain. Various factors, such as...
569
System of Memory01:23

System of Memory

6.3K
Memory is categorized into three major systems: sensory memory, short-term memory (STM), and long-term memory (LTM). These systems differ in their capacity and the duration for which they can hold information. Sensory memory captures raw sensory input from the environment, holding it for just a few seconds or less. For example, on hearing a brief, loud sound, like a car horn honking, the sound seems to linger in the mind for a moment even after it stops. This is an instance of sensory memory...
6.3K
Interference and Decay01:16

Interference and Decay

182
Forgetting is a complex cognitive phenomenon influenced by several factors, among which interference and decay are particularly prominent. These processes explain why individuals often struggle to retrieve specific information from memory, leading to lapses in recall that can be observed in everyday situations.
Interference occurs when competing memories hinder the retrieval of particular information. It can be classified into two types: proactive and retroactive interference. Proactive...
182
Multiple Comparison Tests01:13

Multiple Comparison Tests

4.0K
Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
4.0K
Forgetting01:21

Forgetting

104
Forgetting is an intrinsic aspect of human memory, characterized by the gradual loss or inaccessibility of information over time. Hermann Ebbinghaus, a pioneering psychologist, extensively studied this phenomenon and formulated the forgetting curve. This curve illustrates that memory loss occurs rapidly immediately after learning and then decelerates over time. Several mechanisms contribute to forgetting, including encoding failure, storage decay, retrieval failure, and interference.
Encoding...
104
Flashbulb Memory01:16

Flashbulb Memory

115
A flashbulb memory is a highly vivid and detailed memory, often linked to events of significant emotional impact. These memories stand out in contrast to everyday memories due to their clarity and the precision with which they are recalled. The strong emotions associated with the event act as a catalyst, ensuring that specific details, such as one's location, actions, and even peripheral elements, are etched into memory with remarkable accuracy. For example, many people can vividly recall...
115

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Evaluation of Endobronchial Ultrasound-Guided Transbronchial Needle Aspiration (EBUS-TBNA) Samples from Advanced Non-Small Cell Lung Cancer for Whole Genome, Whole Exome and Comprehensive Panel Sequencing.

Cancers·2024
Same author

Multi-omic features of oesophageal adenocarcinoma in patients treated with preoperative neoadjuvant therapy.

Nature communications·2023
Same author

Comparative Genomics Provides Etiologic and Biological Insight into Melanoma Subtypes.

Cancer discovery·2022
Same author

DNA methylation patterns identify subgroups of pancreatic neuroendocrine tumors with clinical association.

Communications biology·2021
Same author

Whole-genome sequencing of acral melanoma reveals genomic complexity and diversity.

Nature communications·2020
Same author

Using whole-genome sequencing data to derive the homologous recombination deficiency scores.

NPJ breast cancer·2020
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Aug 3, 2025

Highly Multiplexed, Super-resolution Imaging of T Cells Using madSTORM
08:43

Highly Multiplexed, Super-resolution Imaging of T Cells Using madSTORM

Published on: June 24, 2017

7.5K

streammd: fast low-memory duplicate marking using a Bloom filter.

Conrad Leonard1

  • 1Department of Genome Informatics, QIMR Berghofer Medical Research Institute, Herston, QLD 4006, Australia.

Bioinformatics (Oxford, England)
|April 7, 2023
PubMed
Summary
This summary is machine-generated.

streammd is a new tool for identifying duplicate DNA sequences in large datasets. It uses a Bloom filter for fast, memory-efficient processing, outperforming existing methods like SAMBLASTER.

More Related Videos

Multiplex Detection of Bacteria in Complex Clinical and Environmental Samples using Oligonucleotide-coupled Fluorescent Microspheres
11:09

Multiplex Detection of Bacteria in Complex Clinical and Environmental Samples using Oligonucleotide-coupled Fluorescent Microspheres

Published on: October 23, 2011

16.4K
Making Precise and Accurate Single-Molecule FRET Measurements using the Open-Source smfBox
07:12

Making Precise and Accurate Single-Molecule FRET Measurements using the Open-Source smfBox

Published on: July 5, 2021

3.5K

Related Experiment Videos

Last Updated: Aug 3, 2025

Highly Multiplexed, Super-resolution Imaging of T Cells Using madSTORM
08:43

Highly Multiplexed, Super-resolution Imaging of T Cells Using madSTORM

Published on: June 24, 2017

7.5K
Multiplex Detection of Bacteria in Complex Clinical and Environmental Samples using Oligonucleotide-coupled Fluorescent Microspheres
11:09

Multiplex Detection of Bacteria in Complex Clinical and Environmental Samples using Oligonucleotide-coupled Fluorescent Microspheres

Published on: October 23, 2011

16.4K
Making Precise and Accurate Single-Molecule FRET Measurements using the Open-Source smfBox
07:12

Making Precise and Accurate Single-Molecule FRET Measurements using the Open-Source smfBox

Published on: July 5, 2021

3.5K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Duplicate template identification is crucial for bulk sequence analysis.
  • Processing large sequencing libraries can be computationally expensive and memory-intensive.

Purpose of the Study:

  • To introduce streammd, a novel tool for efficient duplicate marking in large sequence datasets.
  • To provide a faster and more memory-efficient alternative to existing duplicate identification methods.

Main Methods:

  • streammd utilizes a Bloom filter for single-pass duplicate marking.
  • The algorithm is implemented in C++ for high performance.

Main Results:

  • streammd achieves performance comparable to Picard MarkDuplicates.
  • It demonstrates significantly faster processing speeds and reduced memory requirements compared to SAMBLASTER.

Conclusions:

  • streammd offers a highly efficient solution for duplicate template identification in large-scale sequencing projects.
  • The tool's speed and memory efficiency make it suitable for resource-intensive bioinformatics workflows.