Jove
Visualize
联系我们

相关概念视频

Data: Types and Distribution01:19

Data: Types and Distribution

704
In biostatistics, data are the observations collected for analysis. There are two main types: parametric and non-parametric. Parametric data, which include continuous (e.g., weight) and discrete numerical data (e.g., number of tablets), assume a particular distribution pattern, often the normal distribution. Non-parametric data do not adhere to a specific distribution and typically comprise nominal (e.g., gender) and ordinal categorical data (e.g., pain scale ratings).
Distributions in...
704
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

31.9K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
31.9K
Review and Preview01:13

Review and Preview

8.9K
Data are individual items of information obtained from a population or sample. Data may be classified as qualitative (categorical), quantitative continuous, or quantitative discrete. Because it is not practical to measure the entire population in a study, researchers use samples to represent the population. A random sample is a representative group from the population chosen by using a method that gives each individual in the population an equal chance of being included in the sample. Random...
8.9K
Dimensional Analysis01:23

Dimensional Analysis

853
Dimensional analysis is a powerful tool that is used in physics and engineering to understand and predict the behavior of physical systems. The basic idea behind dimensional analysis is to express physical quantities in terms of fundamental dimensions such as the mass, length, and time. Derived dimensions like the velocity, acceleration, and force are derived from the combinations of these fundamental dimensions.
Dimensional analysis allows us to analyze and compare physical quantities on a...
853
What is Central Tendency?01:14

What is Central Tendency?

14.1K
Descriptive statistics describe or summarize relevant characteristics of a sample and aid in the analysis of data of interest. When analyzing large quantities of data and developing an inference, one needs to identify a value representative of the entire data set. Characteristics such as central tendency, extreme values, range of measurements, or the most repeated value can help better understand the data.
The central tendency is the most conventionally used data characteristic. It is a...
14.1K
How Data are Classified: Numerical Data00:59

How Data are Classified: Numerical Data

27.9K
Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...
27.9K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Impact of electron-phonon interaction on the electronic structure of interfaces between organic molecules and a MoS2 monolayer.

The Journal of chemical physics·2026
Same author

Hybrid Frenkel-Wannier excitons facilitate ultrafast energy transfer at a 2D-organic interface.

Nature physics·2025
Same author

Towards an interoperable perovskite description or how to keep track of 300 perovskite ions.

Nature communications·2025
Same author

Band-Gap Regression with Architecture-Optimized Message-Passing Neural Networks.

Chemistry of materials : a publication of the American Chemical Society·2025
Same author

Decoupling many-body interactions in the CeO<sub>2</sub>(111) oxygen vacancy structure with statistical learning and cluster expansion.

Nanoscale·2025
Same author

Discovering synthesis targets: general discussion.

Faraday discussions·2024
Same journal

Ambient stability and surface adhesion of 2D polyaramid nanofilms.

Faraday discussions·2026
Same journal

Spiers Memorial Lecture: Spin-mediated promotion of magnetic metal catalysts.

Faraday discussions·2026
Same journal

Helium spin-echo as a surface-sensitive probe of vibrational energy dissipation.

Faraday discussions·2026
Same journal

Near-infrared vibrational second harmonic generation: a new nonlinear interfacial vibrational spectroscopy.

Faraday discussions·2026
Same journal

CO on a Rh/Fe<sub>3</sub>O<sub>4</sub> single-atom catalyst: high-resolution infrared spectroscopy and near-ambient-pressure scanning tunnelling microscopy.

Faraday discussions·2026
Same journal

Evolution of size-selected Pt cluster catalysts on prototypical oxide supports.

Faraday discussions·2026
查看所有相关文章
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关实验视频

Updated: Jun 12, 2025

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

6.2K

大数据有多大?

Daniel Speckhard1,2, Tim Bechtel1,2, Luca M Ghiringhelli3

  • 1Physics Department and CSMB, Humboldt-Universität zu Berlin, Zum Großen Windkanal 2, 12489 Berlin, Germany. claudia.draxl@physik.hu-berlin.de.

Faraday discussions
|September 24, 2024
PubMed
概括
此摘要是机器生成的。

材料科学中的大数据机器学习提出了超出数量范围的挑战,包括数据质量,真实性和基础设施. 解决这些问题对于在现场推进预测建模至关重要.

更多相关视频

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.7K
Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging
09:19

Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging

Published on: April 18, 2025

369

相关实验视频

Last Updated: Jun 12, 2025

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

6.2K
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.7K
Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging
09:19

Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging

Published on: April 18, 2025

369

科学领域:

  • 材料科学 材料科学 材料科学
  • 计算机科学 计算机科学
  • 数据科学数据科学数据科学

背景情况:

  • 机器学习模型越来越多地用于材料科学中的预测任务.
  • "大数据"在这个领域的定义和含义需要仔细研究.

研究的目的:

  • 在材料科学机器学习的背景下定义"大数据".
  • 调查与数据量,质量,真实性和基础设施相关的挑战.
  • 探索模型概括,数据聚合,特征工程和计算要求.

主要方法:

  • 分析典型的材料科学机器学习问题.
  • 评估跨数据集的模型概括.
  • 从异质来源收集高质量的数据的案例研究.
  • 评估特征集和模型复杂性对表现力的影响.
  • 检查大型数据和模型培训的基础设施需求.

主要成果:

  • 材料科学中的"大数据"涉及数据量,质量和真实性的复杂相互作用.
  • 模型的概括与数据集特征有很大差异.
  • 不同质的数据源的有效聚合是具有挑战性的,但可行的.
  • 特性工程和模型复杂性对于预测准确性至关重要.
  • 需要大量的基础设施来管理和培训大型材料数据集.

结论:

  • 材料科学中的大数据机器学习带来了多方面的挑战.
  • 需要进一步的研究来解决数据质量,基础设施和模型开发问题.
  • 优化数据处理和模型训练对于释放预测潜力的必要.