Evaluating the performance of multilingual models in answer extraction and question generation
View abstract on PubMed
Summary
This summary is machine-generated.This study enhances multiple-choice test generation in Spanish by fine-tuning Transformer models like mT5-base for Answer Extraction (AE) and Question Generation (QG). The mT5-base model, trained on a combined Spanish dataset, achieved superior performance, setting a benchmark for future research.
Area Of Science
- Natural Language Processing (NLP)
- Artificial Intelligence (AI)
- Machine Learning (ML)
Background
- Multiple-choice test generation is a complex NLP task, particularly in non-English languages due to limited prior research.
- Transformer architectures have advanced Answer Extraction (AE) and Question Generation (QG) tasks.
- Existing research often lacks focus on Spanish language NLP challenges for automated test creation.
Purpose Of The Study
- To develop and evaluate improved models for Answer Extraction (AE) and Question Generation (QG) in Spanish.
- To investigate the efficacy of an answer-aware methodology for Spanish NLP tasks.
- To establish a performance benchmark for AE and QG models in Spanish using various evaluation metrics.
Main Methods
- Fine-tuning three multilingual Transformer models: mT5-base, mT0-base, and BLOOMZ-560M.
- Utilizing three datasets: a Spanish translation of SQuAD, the SQAC dataset, and their union (SQuAD + SQAC).
- Evaluating model performance using metrics such as BLEU1-4, METEOR, ROUGE-L, CIDEr, SARI, GLEU, WER, and cosine similarity.
Main Results
- The mT5-base model, fine-tuned on the combined SQuAD + SQAC dataset, demonstrated the best performance for AE and QG tasks.
- Models trained solely on the SQAC dataset also yielded competitive results, indicating dataset effectiveness.
- mT5-base outperformed similar research works based on standard evaluation metrics like BLEU, METEOR, and ROUGE-L.
Conclusions
- The mT5-base model, when trained with an answer-aware methodology on a combined Spanish dataset, is highly effective for AE and QG.
- The study provides a valuable benchmark for future research in Spanish NLP for automated test generation.
- Further exploration with newer models and datasets is recommended to advance the field.
Related Concept Videos
Translation is the process of synthesizing proteins from the genetic information carried by messenger RNA (mRNA). Following transcription, it constitutes the final step in the expression of genes. This process is carried out by ribosomes, complexes of protein and specialized RNA molecules. Ribosomes, transfer RNA (tRNA), and other proteins produce a chain of amino acids—the polypeptide—as the end product of translation.
Translation Produces the Building Blocks of Life
Proteins are...
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...
Social psychologists have documented that feeling good about ourselves and maintaining positive self-esteem is a powerful motivator of human behavior (Tavris & Aronson, 2008). In the United States, members of the predominant culture typically think very highly of themselves and view themselves as good people who are above average on many desirable traits (Ehrlinger, Gilovich, & Ross, 2005). Often, our behavior, attitudes, and beliefs are affected when we experience a threat to our...
A complementation test is a simple cross to identify whether the two mutations are located on the same gene or different genes. It was first performed by Edward Lewis in the 1940s while working on fruit flies. He developed the test to identify the location and arrangement of different mutations on chromosomes.
Organisms heterozygous for different mutations are crossed pairwise in all combinations. If present on different genes, the mutations can complement each other by providing the missing...

