Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Responses to Drought and Flooding02:41

Responses to Drought and Flooding

10.2K
Water plays a significant role in the life cycle of plants. However, insufficient or excess of water can be detrimental and pose a serious threat to plants.
10.2K
Data Collection by Survey01:07

Data Collection by Survey

7.6K
The systematic method of obtaining and analyzing accurate information of a population is called data collection. A survey is a standard method of data collection that involves collecting information from a target human population about their experience, opinion, or knowledge of a product, service, or process. The responses are recorded and interpreted. The most common survey examples are written questionnaires, face-to-face or telephonic conversations, focus groups, and electronic (e-mail or...
7.6K
Bar Graph01:07

Bar Graph

17.3K
A bar graph is also called a bar chart and consists of bars that are separated from each other. It either uses horizontal or vertical bars to show comparisons among categories. The bars can be rectangles, or they can be rectangular boxes (used in three-dimensional plots). One axis of the graph represents the specific categories being compared, and the other axis shows a discrete value. In this graph, the length of the bar for each category is proportional to the number or percent of individuals...
17.3K
Applications of Normal Distribution01:22

Applications of Normal Distribution

7.7K
The normal distribution is a useful statistical tool. One of its practical applications is determining the door height after considering the normal distribution of heights of persons, such that many can pass through it easily without striking their heads. The normal distribution can also determine the probability of a person having a height less than a specific height.
The heights of 15 to 18-year-old males from Chile from 1984 to 1985 followed a normal distribution. The mean height is 172.36...
7.7K
Data: Types and Distribution01:19

Data: Types and Distribution

2.2K
In biostatistics, data are the observations collected for analysis. There are two main types: parametric and non-parametric. Parametric data, which include continuous (e.g., weight) and discrete numerical data (e.g., number of tablets), assume a particular distribution pattern, often the normal distribution. Non-parametric data do not adhere to a specific distribution and typically comprise nominal (e.g., gender) and ordinal categorical data (e.g., pain scale ratings).
Distributions in...
2.2K
Bulk Density of Aggregate01:22

Bulk Density of Aggregate

1.6K
Bulk density refers to the mass of aggregate particles that would fill a unit volume. The concept of bulk density originates from the inability to pack aggregate particles in a manner that completely eliminates void spaces. Hence, the term bulk refers to the volume that encompasses both the aggregates and the voids. This measurement is crucial when aggregates are batched by volume and is used to convert quantities by mass to volume.
Most natural mineral aggregates, like sand and gravel,...
1.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

MedCOD: Enhancing English-to-Spanish Medical Translation of Large Language Models Using Enriched Chain-of-Dictionary Framework.

Findings of ACL. EMNLP. Conference on Empirical Methods in Natural Language Processing·2026
Same author

Freezing of Gait Detection Using Gramian Angular Fields and Federated Learning from Wearable Sensors.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Design and Implementation of a Scalable Clinical Data Warehouse for Resource-Constrained Healthcare Systems.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Autism Spectrum Disorder Detection Using Prominent Connectivity Features from Electroencephalography.

International journal of neural systems·2025
Same author

Climate data dynamics: A high-volume real world structured weather dataset.

Data in brief·2024
Same author

Expert opinion elicitation for assisting deep learning based Lyme disease classifier with patient data.

International journal of medical informatics·2024
Same journal

Data from a public participation GIS survey on the everyday active travel experiences of residents from five European cities.

Data in brief·2026
Same journal

Description of the dataset on alkoxycarbonylation catalyzed by supported palladium phosphide nanoparticles.

Data in brief·2026
Same journal

Solar radiation and surface temperature datasets across urban surface materials of Jakarta megacity.

Data in brief·2026
Same journal

Transcriptome data of oil palm dumpy seedlings treated with different concentration of phosphorus in hydroponic system.

Data in brief·2026
Same journal

Survey data of 71,578 ancient and notable trees across 183 counties in Sichuan Province, China (2016-2023).

Data in brief·2026
Same journal

Data on the stated willingness to accept collective agri-environmental schemes for biodiversity conservation of European grassland farmers.

Data in brief·2026
See all related articles

Related Experiment Video

Updated: May 5, 2026

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

8.9K

Bangla news article dataset.

Asif Mohammed Saad1, Umme Niraj Mahi1, Md Shahidul Salim1

  • 1Khulna University of Engineering & Technology, Khulna 9203, Bangladesh.

Data in Brief
|September 18, 2024
PubMed
Summary
This summary is machine-generated.

This research introduces a comprehensive Bangla news dataset, featuring over 1.9 million articles. This resource supports advancements in Bangla natural language processing and domain-specific large language models.

Keywords:
ClassificationData analysisNatural language processing

More Related Videos

Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists
05:22

Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists

Published on: August 11, 2023

1.8K
Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.4K

Related Experiment Videos

Last Updated: May 5, 2026

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

8.9K
Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists
05:22

Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists

Published on: August 11, 2023

1.8K
Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.4K

Area of Science:

  • Natural Language Processing (NLP)
  • Data Science
  • Computational Linguistics

Background:

  • A new, extensive Bangla news dataset has been compiled from nine major news sources.
  • The dataset comprises over 1.9 million articles across diverse categories like sports, economy, politics, and technology.
  • It includes various attributes such as title, content, publication time, tags, and meta information for each article.

Discussion:

  • The dataset provides a valuable resource for researchers in Bangla NLP.
  • It facilitates the investigation and assessment of theories within Bangla language processing.
  • The availability of such a dataset is crucial for developing robust NLP models.

Key Insights:

  • The dataset enables data scientists to explore Bangla language nuances.
  • It supports the development of domain-specific large language models tailored for Bangladesh.
  • Facilitates the creation of machine learning and deep learning models for Bangla text classification.

Outlook:

  • This dataset is expected to significantly boost research and development in Bangla NLP.
  • It will serve as a foundation for future advancements in artificial intelligence applications for the Bangla language.
  • Potential for creating more accurate and context-aware language models for Bangladesh.