Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Confidence Interval for Estimating Population Mean

Confidence Interval for Estimating Population Mean

A point estimate of the population mean is obtained from a single sample. Such a point estimate does not represent a population well because it needs to account for variability in the population. Single point estimate can also be biased despite the sample being selected randomly. Thus, a point estimate is often unreliable. A confidence interval is needed to reduce this unreliability.
A confidence interval for the mean is a range of values that provides an estimate of the population mean. As the...

Confidence Coefficient

Confidence Coefficient

The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...

Exceptions to the Octet Rule

Exceptions to the Octet Rule

Many covalent molecules have central atoms that do not have eight electrons in their Lewis structures. These molecules fall into three categories:

Confidence Intervals

Confidence Intervals

An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a sample proportion. However, unlike the point estimate which is a single value, the confidence interval contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A...

Lewis Symbols and the Octet Rule

Lewis Symbols and the Octet Rule

Chemical bonds are complex interactions between two or more atoms or ions, which reduce the potential energy of the molecule. Gilbert N. Lewis developed a model called the Lewis model that simplified the depiction of chemical bond formation and provided straightforward explanations for the chemical bonds seen in most common compounds.

The Aufbau Principle and Hund's Rule

The Aufbau Principle and Hund's Rule

To determine the electron configuration for any particular atom, we can build the structures in the order of atomic numbers. Beginning with hydrogen, and continuing across the periods of the periodic table, we add one proton at a time to the nucleus and one electron to the proper subshell until we have described the electron configurations of all the elements. This procedure is called the aufbau principle, from the German word aufbau (“to build up”). Each added electron occupies the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Retraction Note: sFRP2 in the aged microenvironment drives melanoma metastasis and therapy resistance.

Nature·2025

Same author

Constraints and tunability of antigen-agnostic memory durability.

bioRxiv : the preprint server for biology·2025

Same author

A unified metric of human immune health.

Nature medicine·2024

Same author

Transcriptional changes in the rat brain induced by repetitive transcranial magnetic stimulation.

Frontiers in human neuroscience·2023

Same author

Sequential Early-Life Infections Alter Peripheral Blood Transcriptomics in Aging Female Mice but Not the Response to De Novo Infection with Influenza Virus or M. tuberculosis.

ImmunoHorizons·2023

Same author

NF-κB subunits direct kinetically distinct transcriptional cascades in antigen receptor-activated B cells.

Nature immunology·2023

Same journal

Novel Gene Discovery in the Human Malaria Parasite using Nucleosome Positioning Data.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2014

Same journal

Proceedings of Computational Systems Bioinformatics 2008. August 26-29, 2008. Palo Alto, California, USA.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009

Same journal

Graph wavelet alignment kernels for drug virtual screening.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009

Same journal

Fast multisegment alignments for temporal expression profiles.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009

Same journal

Knowledge representation and data mining for biological imaging.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009

Same journal

Efficient haplotype inference from pedigrees with missing data using linear systems with disjoint-set data structures.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 25, 2026

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

Published on: September 8, 2023

Rule-based human gene normalization in biomedical text with confidence estimation.

William W Lau¹, Calvin A Johnson, Kevin G Becker

¹Center for Information Technology, National Institutes of Health, Bethesda, MD 20892-5624, USA.

Computational Systems Bioinformatics. Computational Systems Bioinformatics Conference

|October 24, 2007

Summary

This summary is machine-generated.

This study introduces a novel rule-based algorithm for accurately identifying and normalizing gene mentions in text. The gene normalization method achieves high precision and recall, crucial for bioinformatics text mining.

More Related Videos

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

Synthesis of Keratin-based Nanofiber for Biomedical Engineering

Synthesis of Keratin-based Nanofiber for Biomedical Engineering

Published on: February 7, 2016

Related Experiment Videos

Last Updated: Jan 25, 2026

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

Published on: September 8, 2023

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

Synthesis of Keratin-based Nanofiber for Biomedical Engineering

Synthesis of Keratin-based Nanofiber for Biomedical Engineering

Published on: February 7, 2016

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

Accurate gene identification and normalization are essential for downstream text mining in bioinformatics.
Existing methods may struggle with the complexity of gene symbol and name variations.

Purpose of the Study:

To develop and evaluate a robust rule-based algorithm for gene mention identification and normalization.
To improve the accuracy of mapping gene mentions to unique identifiers.

Main Methods:

A two-step rule-based algorithm combining pattern matching for gene symbols and approximate term searching for gene names.
Utilized novel features: uniqueness, inverse distance, and coverage for confidence estimation.
Optimized feature weights using the Nealder-Mead simplex method.

Main Results:

Achieved an F-score of 0.7622 on the BioCreAtIvE test dataset.
Obtained an Area Under the Curve (AUC) of 0.7461 for the recall-precision curve.
Demonstrated effective performance using optimized feature weights.

Conclusions:

The developed algorithm provides a reliable method for gene normalization in bioinformatics.
The novel features contribute to improved confidence estimation in gene mention identification.
The approach shows significant potential for enhancing automated literature analysis in genomics.