Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Distribution Reliability and Automation

Distribution Reliability and Automation

Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...

Quantifying Work

Quantifying Work

As a system undergoes a change, its internal energy can change, and energy can be transferred from the system to the surroundings, or from the surroundings to the system.

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Laminar Flow: Problem Solving

Laminar Flow: Problem Solving

Laminar flow occurs when a fluid moves smoothly in parallel layers with minimal mixing and turbulence. In fluid mechanics, ensuring laminar flow within a pipe is essential for precise control of flow characteristics, especially in engineering applications. The key factor in determining whether flow remains laminar is the Reynolds number, a dimensionless quantity that depends on the fluid's velocity, density, viscosity, and the pipe's diameter. A Reynolds number of 2100 or lower...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same journal

Integrated multi-assessment and structural performance index framework for stacking-sequence optimisation of natural fibre reinforced laminates.

Scientific reports·2026

Same journal

SuperiorGAT: graph attention networks for sparse LiDAR point cloud reconstruction in autonomous systems.

Scientific reports·2026

Same journal

The effect of stretching the pectoralis major, sternocleidomastoid, and iliopsoas muscles on 800 m swimming performance in master swimmers.

Scientific reports·2026

Same journal

ISNR-PQC: isometry noise resilience post quantum cryptography primitive.

Scientific reports·2026

Same journal

Identification of high-yielding and stable genotypes of barley in the cold climate of Iran using AMMI and GGE biplot models.

Scientific reports·2026

Same journal

Bayesian negative binomial modelling of spatial and temporal patterns of road traffic deaths in Ghana.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 6, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Generating reliable software project task flows using large language models through prompt engineering and robust

Mohammed Sarim¹, Faraz Masood¹, Manas Maheshwari¹

¹Department of Computer Science, Aligarh Muslim University, Aligarh, Uttar Pradesh, 202002, India.

Scientific Reports

|October 8, 2025

Summary

This summary is machine-generated.

Large Language Models (LLMs) can convert software documentation to task flows. A new metric shows even basic prompts yield reliable results for AI-driven software planning.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Jan 6, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Artificial Intelligence
Software Engineering
Natural Language Processing

Background:

Large Language Models (LLMs) show potential for transforming unstructured software documentation into structured task flows.
However, LLM-generated outputs often lack the procedural reliability essential for software engineering tasks.

Purpose of the Study:

To benchmark leading LLMs (Gemini 2.5 Pro, Grok 3, GPT-Omni, DeepSeek-R1, LLaMA-3) using diverse prompting strategies.
To introduce and validate a novel evaluation metric, the Hybrid Semantic Similarity Metric (HSSM), for assessing procedural reliability.

Main Methods:

Utilized real-world software tutorials from the "Build Your Own X" repository for benchmarking.
Implemented five prompting strategies: Zero-Shot, Chain-of-Thought, and ISO 21502-Guided.
Developed HSSM, combining SentenceTransformer embeddings and context-aware key-term overlap for semantic and procedural evaluation.

Main Results:

HSSM demonstrated superior performance over traditional metrics (BERTScore, SBERT, USE) with lower variance (1.5-2.9% CV) and higher correlation with human judgments.
Even Zero-Shot prompting achieved high alignment (96.33% HSSM) for task flow generation when evaluated with HSSM.
LLMs showed varying performance based on prompting strategies and model architecture.

Conclusions:

The study provides a scalable framework for evaluating LLM-generated task flows in software engineering.
HSSM offers a robust method for assessing procedural coherence, crucial for reliable AI-assisted software planning.
Findings suggest potential for LLMs in AI-driven project management, prompt engineering, and procedural generation tools.