A study on formalizing the knowledge of data curation activities across different fields
View abstract on PubMed
Summary
This summary is machine-generated.This study proposes a universal ontology for data curation processes to enable interdisciplinary research data sharing. It addresses the lack of formalized granularity in data curation tasks, promoting data reusability across diverse scientific fields.
Area Of Science
- Data Science
- Information Science
- Scientific Research
Background
- Open science trends necessitate research data sharing.
- Data curation is crucial for data interpretability and reusability.
- Existing data curation practices are field-specific, hindering interdisciplinary sharing.
Purpose Of The Study
- To survey, analyze, and organize knowledge of data curation across research fields.
- To develop a universal ontology for describing data curation processes.
- To facilitate interdisciplinary data sharing and reuse.
Main Methods
- Collected and compared existing data curation vocabularies and procedures.
- Conducted interviews with data curators from various research fields.
- Developed a data curation process ontology using OWL.
Main Results
- Identified a lack of formalized granularity in data curation tasks and procedures.
- Proposed a universally applicable ontology for data curation processes.
- Demonstrated the ontology's validity, consistency, and ability to represent diverse curation activities.
Conclusions
- The proposed ontology provides a knowledge framework for interdisciplinary understanding of data curation.
- This framework supports the identification of functions for data curation support systems.
- Overcoming granularity gaps is essential for promoting interdisciplinary research data reuse.
Related Concept Videos
Some researchers gain access to large amounts of data without interacting with a single research participant. Instead, they use existing records to answer various research questions. This type of research approach is known as archival research. Archival research relies on looking at past records or data sets to look for interesting patterns or relationships. For example, a researcher might access the academic records of all individuals who enrolled in college within the past ten years and...
Source-oriented records, or SOR, are medical record-keeping organized by the data source. The SOR system was first developed in the mid-1900s to organize the growing patient data in hospitals and other healthcare facilities.
In an SOR, each discipline involved in patient care maintains a separate medical record section. This record-keeping method enables easy tracking of patient progress and ensures healthcare staff have access to up-to-date information.
Key Attributes include the following:
...
Charting by Exception, or CBE, is a method of documentation used in healthcare, particularly in nursing, that focuses on documenting only significant or abnormal findings rather than recording every detail. This approach aims to streamline the documentation process, improve efficiency, and ensure that healthcare providers can quickly identify deviations from normalcy in patient assessments.
In CBE, healthcare professionals establish predefined standards of practice that define what constitutes...
Data collection refers to a systematic way of obtaining, observing, measuring, and analyzing accurate information. Observational studies are one of the most widely used methods of data collection. It involves collecting data by observing the behavior and physical characteristics of a sample without making any modifications to the sample.
An astronomer viewing the motion and brightness of stars in the sky and recording the data is an example of observational data collection. A botanist recording...
Data validation is an essential part of a comprehensive assessment. Validation is confirming or verifying and opening the door to gathering more assessment data as it clarifies vague or unclear data. The process of checking and verifying the collected information is called data validation. The primary purpose of data validation is to ensure data is as free from error, bias, and misinterpretation as possible.
Nursing assessment guides are generally based on holistic models rather than medical...
The case management model is a multidisciplinary approach that involves healthcare professionals from diverse disciplines, such as physicians, nurses, therapists, social workers, and pharmacists, working collaboratively to address the various needs of patients. Each healthcare professional brings unique expertise and perspectives, contributing to a more comprehensive understanding of the patient's condition and tailoring treatment plans accordingly.
For example, a patient with a chronic...

