Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Confidence Interval for Estimating Population Mean01:25

Confidence Interval for Estimating Population Mean

8.8K
A point estimate of the population mean is obtained from a single sample. Such a point estimate does not represent a population well because it needs to account for variability in the population. Single point estimate can also be biased despite the sample being selected randomly. Thus, a point estimate is often unreliable. A confidence interval is needed to reduce this unreliability.
A confidence interval for the mean is a range of values that provides an estimate of the population mean. As the...
8.8K
Confidence Coefficient01:24

Confidence Coefficient

10.5K
The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...
10.5K
Exceptions to the Octet Rule02:55

Exceptions to the Octet Rule

37.3K
Many covalent molecules have central atoms that do not have eight electrons in their Lewis structures. These molecules fall into three categories:
37.3K
Confidence Intervals01:21

Confidence Intervals

10.2K
An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a  sample proportion. However, unlike the point estimate which is a single value, the confidence interval  contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A...
10.2K
Lewis Symbols and the Octet Rule02:36

Lewis Symbols and the Octet Rule

80.4K
Chemical bonds are complex interactions between two or more atoms or ions, which reduce the potential energy of the molecule. Gilbert N. Lewis developed a model called the Lewis model that simplified the depiction of chemical bond formation and provided straightforward explanations for the chemical bonds seen in most common compounds.
80.4K
The Aufbau Principle and Hund's Rule03:02

The Aufbau Principle and Hund's Rule

72.4K
To determine the electron configuration for any particular atom, we can build the structures in the order of atomic numbers. Beginning with hydrogen, and continuing across the periods of the periodic table, we add one proton at a time to the nucleus and one electron to the proper subshell until we have described the electron configurations of all the elements. This procedure is called the aufbau principle, from the German word aufbau (“to build up”). Each added electron occupies the...
72.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Retraction Note: sFRP2 in the aged microenvironment drives melanoma metastasis and therapy resistance.

Nature·2025
Same author

Constraints and tunability of antigen-agnostic memory durability.

bioRxiv : the preprint server for biology·2025
Same author

A unified metric of human immune health.

Nature medicine·2024
Same author

Transcriptional changes in the rat brain induced by repetitive transcranial magnetic stimulation.

Frontiers in human neuroscience·2023
Same author

Sequential Early-Life Infections Alter Peripheral Blood Transcriptomics in Aging Female Mice but Not the Response to De Novo Infection with Influenza Virus or M. tuberculosis.

ImmunoHorizons·2023
Same author

NF-κB subunits direct kinetically distinct transcriptional cascades in antigen receptor-activated B cells.

Nature immunology·2023
Same journal

Novel Gene Discovery in the Human Malaria Parasite using Nucleosome Positioning Data.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2014
Same journal

Proceedings of Computational Systems Bioinformatics 2008. August 26-29, 2008. Palo Alto, California, USA.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009
Same journal

Graph wavelet alignment kernels for drug virtual screening.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009
Same journal

Fast multisegment alignments for temporal expression profiles.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009
Same journal

Knowledge representation and data mining for biological imaging.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009
Same journal

Efficient haplotype inference from pedigrees with missing data using linear systems with disjoint-set data structures.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference·2009
See all related articles

Related Experiment Video

Updated: Jan 25, 2026

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation
06:09

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

Published on: September 8, 2023

937

Rule-based human gene normalization in biomedical text with confidence estimation.

William W Lau1, Calvin A Johnson, Kevin G Becker

  • 1Center for Information Technology, National Institutes of Health, Bethesda, MD 20892-5624, USA.

Computational Systems Bioinformatics. Computational Systems Bioinformatics Conference
|October 24, 2007
PubMed
Summary
This summary is machine-generated.

This study introduces a novel rule-based algorithm for accurately identifying and normalizing gene mentions in text. The gene normalization method achieves high precision and recall, crucial for bioinformatics text mining.

More Related Videos

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.3K
Synthesis of Keratin-based Nanofiber for Biomedical Engineering
14:43

Synthesis of Keratin-based Nanofiber for Biomedical Engineering

Published on: February 7, 2016

15.9K

Related Experiment Videos

Last Updated: Jan 25, 2026

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation
06:09

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

Published on: September 8, 2023

937
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.3K
Synthesis of Keratin-based Nanofiber for Biomedical Engineering
14:43

Synthesis of Keratin-based Nanofiber for Biomedical Engineering

Published on: February 7, 2016

15.9K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Accurate gene identification and normalization are essential for downstream text mining in bioinformatics.
  • Existing methods may struggle with the complexity of gene symbol and name variations.

Purpose of the Study:

  • To develop and evaluate a robust rule-based algorithm for gene mention identification and normalization.
  • To improve the accuracy of mapping gene mentions to unique identifiers.

Main Methods:

  • A two-step rule-based algorithm combining pattern matching for gene symbols and approximate term searching for gene names.
  • Utilized novel features: uniqueness, inverse distance, and coverage for confidence estimation.
  • Optimized feature weights using the Nealder-Mead simplex method.

Main Results:

  • Achieved an F-score of 0.7622 on the BioCreAtIvE test dataset.
  • Obtained an Area Under the Curve (AUC) of 0.7461 for the recall-precision curve.
  • Demonstrated effective performance using optimized feature weights.

Conclusions:

  • The developed algorithm provides a reliable method for gene normalization in bioinformatics.
  • The novel features contribute to improved confidence estimation in gene mention identification.
  • The approach shows significant potential for enhancing automated literature analysis in genomics.