Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

3.5K
3.5K
Improving Translational Accuracy02:07

Improving Translational Accuracy

14.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
14.0K
Distribution Reliability and Automation01:25

Distribution Reliability and Automation

469
Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...
469
Quantifying Work02:30

Quantifying Work

23.8K
As a system undergoes a change, its internal energy can change, and energy can be transferred from the system to the surroundings, or from the surroundings to the system.
23.8K
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

253
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
253
Laminar Flow: Problem Solving01:24

Laminar Flow: Problem Solving

476
Laminar flow occurs when a fluid moves smoothly in parallel layers with minimal mixing and turbulence. In fluid mechanics, ensuring laminar flow within a pipe is essential for precise control of flow characteristics, especially in engineering applications. The key factor in determining whether flow remains laminar is the Reynolds number, a dimensionless quantity that depends on the fluid's velocity, density, viscosity, and the pipe's diameter. A Reynolds number of 2100 or lower...
476

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same journal

Integrated multi-assessment and structural performance index framework for stacking-sequence optimisation of natural fibre reinforced laminates.

Scientific reports·2026
Same journal

SuperiorGAT: graph attention networks for sparse LiDAR point cloud reconstruction in autonomous systems.

Scientific reports·2026
Same journal

The effect of stretching the pectoralis major, sternocleidomastoid, and iliopsoas muscles on 800 m swimming performance in master swimmers.

Scientific reports·2026
Same journal

ISNR-PQC: isometry noise resilience post quantum cryptography primitive.

Scientific reports·2026
Same journal

Identification of high-yielding and stable genotypes of barley in the cold climate of Iran using AMMI and GGE biplot models.

Scientific reports·2026
Same journal

Bayesian negative binomial modelling of spatial and temporal patterns of road traffic deaths in Ghana.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jan 6, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

980

Generating reliable software project task flows using large language models through prompt engineering and robust

Mohammed Sarim1, Faraz Masood1, Manas Maheshwari1

  • 1Department of Computer Science, Aligarh Muslim University, Aligarh, Uttar Pradesh, 202002, India.

Scientific Reports
|October 8, 2025
PubMed
Summary
This summary is machine-generated.

Large Language Models (LLMs) can convert software documentation to task flows. A new metric shows even basic prompts yield reliable results for AI-driven software planning.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.2K

Related Experiment Videos

Last Updated: Jan 6, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

980
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.2K

Area of Science:

  • Artificial Intelligence
  • Software Engineering
  • Natural Language Processing

Background:

  • Large Language Models (LLMs) show potential for transforming unstructured software documentation into structured task flows.
  • However, LLM-generated outputs often lack the procedural reliability essential for software engineering tasks.

Purpose of the Study:

  • To benchmark leading LLMs (Gemini 2.5 Pro, Grok 3, GPT-Omni, DeepSeek-R1, LLaMA-3) using diverse prompting strategies.
  • To introduce and validate a novel evaluation metric, the Hybrid Semantic Similarity Metric (HSSM), for assessing procedural reliability.

Main Methods:

  • Utilized real-world software tutorials from the "Build Your Own X" repository for benchmarking.
  • Implemented five prompting strategies: Zero-Shot, Chain-of-Thought, and ISO 21502-Guided.
  • Developed HSSM, combining SentenceTransformer embeddings and context-aware key-term overlap for semantic and procedural evaluation.

Main Results:

  • HSSM demonstrated superior performance over traditional metrics (BERTScore, SBERT, USE) with lower variance (1.5-2.9% CV) and higher correlation with human judgments.
  • Even Zero-Shot prompting achieved high alignment (96.33% HSSM) for task flow generation when evaluated with HSSM.
  • LLMs showed varying performance based on prompting strategies and model architecture.

Conclusions:

  • The study provides a scalable framework for evaluating LLM-generated task flows in software engineering.
  • HSSM offers a robust method for assessing procedural coherence, crucial for reliable AI-assisted software planning.
  • Findings suggest potential for LLMs in AI-driven project management, prompt engineering, and procedural generation tools.