Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Dec 25, 2025

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

2.3K

UMUDGA: A dataset for profiling algorithmically generated domain names in botnet detection.

Mattia Zago1, Manuel Gil Pérez1, Gregorio Martínez Pérez1

  • 1Department of Information Engineering and Communications, University of Murcia, Campus Espinardo Murcia 30100 Spain.

Data in Brief
|March 28, 2020
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Opportunities for standardization in emergency scenarios in the European Union.

International journal of medical informatics·2023
Same author

PALANTIR: An NFV-Based Security-as-a-Service Approach for Automating Threat Mitigation.

Sensors (Basel, Switzerland)·2023
Same author

Breaching Subjects' Thoughts Privacy: A Study with Visual Stimuli and Brain-Computer Interfaces.

Journal of healthcare engineering·2021
Same author

S3: An AI-Enabled User Continuous Authentication for Smartphones Based on Sensors, Statistics and Speaker Information.

Sensors (Basel, Switzerland)·2021
Same author

Twitter social bots: The 2019 Spanish general election data.

Data in brief·2020
Same author

BEHACOM - a dataset modelling users' behaviour in computers.

Data in brief·2020
Same journal

A harmonized fast-fashion garment-variant dataset for textile circularity and sustainability assessment.

Data in brief·2026
Same journal

Terahertz reflectivity dataset: Reading text on both sides of the page.

Data in brief·2026
Same journal

High-quality draft genome sequence data of <i>Levilactobacillus brevis</i> 3LB isolated from fermented milk koumiss.

Data in brief·2026
Same journal

Interview dataset: Encouraging the development of industrial symbiosis networks in Slovenia - transition to the circular economy.

Data in brief·2026
Same journal

Timeseries of multispectral and radar data and vegetation indices from Sentinel-1, Sentinel-2 and Landsat-8 at field scale.

Data in brief·2026
Same journal

BACI-VI-Bench: A dataset of variational inequality benchmark instances for multi-agent trade-network equilibrium.

Data in brief·2026
See all related articles

This study introduces a large dataset of over 30 million domain names to improve the detection of botnets using machine learning. The dataset aids researchers in developing advanced cybersecurity solutions against domain generation algorithms (DGAs).

Area of Science:

  • Computer Security
  • Cybersecurity
  • Machine Learning

Background:

  • Botnets pose a significant cyber threat, employing sophisticated techniques like dynamic addressing and domain generation algorithms (DGAs) for concealment.
  • Existing detection methods require enhancement to effectively counter these evolving threats.

Purpose of the Study:

  • To present a comprehensive dataset of manually-labeled, algorithmically generated domain names for machine learning analysis.
  • To facilitate research in developing more effective botnet detection systems.

Main Methods:

  • Generation of over 30 million domain names using DGAs from 50 notorious malware variants in a controlled environment.
  • Extraction of a feature set comprising statistical and natural language processing metrics for each domain.
Keywords:
DataDomain Generation Algorithm (DGA)Machine learningNatural Language Processing (NLP)Network security

More Related Videos

Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions
08:23

Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions

Published on: September 25, 2018

13.8K

Related Experiment Videos

Last Updated: Dec 25, 2025

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

2.3K
Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions
08:23

Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions

Published on: September 25, 2018

13.8K
  • Manual labeling of all generated domain names.
  • Main Results:

    • A ready-to-use dataset for machine learning analysis, including both raw domain lists and extracted features.
    • The dataset enables researchers to bypass data collection and pre-processing, focusing on ML-powered detection.

    Conclusions:

    • The presented dataset significantly advances the field by providing a robust resource for DGA-based botnet detection research.
    • It empowers the development of improved network intrusion detection systems through focused ML analysis.