相关概念视频
Extraction: Advanced Methods
Language Development
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
Components of Language
Improving Translational Accuracy
Language and Cognition
Mass Analyzers: Overview
您也可能阅读
相关文章
通过共同作者、期刊和引用图与本文相关的文章。
Engineering CAR-Vδ2 T cells to boost persistence and anti-tumor function.
Demographic shifts, inter-group contact and environmental conditions drive language extinction and diversification.
Time Toxicity of Endocrine-based Oral CDK4/6 Inhibitor Therapies.
A curated global dataset of social contact between diverse language communities.
Enduring constraints on grammar revealed by Bayesian spatiophylogenetic analyses.
A solid base for scaling up: the structure of numeration systems.
Digital science communication for sustainability literacy: A scoping review.
ERGA-BGE reference genome of <i>Gambusia holbrooki</i>, a globally invasive freshwater fish.
Protocols for <i>in situ</i> continuous monitoring of water relations/potential in soil and leaf.
Generative AI in education: Process-aware pedagogy, assessment integrity, and institutional governance.
词汇库2:用于大规模词汇数据的预计算功能.
Frederic Blum1,2, Carlos Barrientos1, Johannes Englisch1
1Department of Linguistic and Cultural Evolution, Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Saxony, 04103, Germany.
标准化词汇数据集 Lexibank 2 现在包括超过 3100 种语言和 150 万个词形. 本资源促进了历史语言学和类型学的自动化跨语言分析.
更多相关视频
03:14Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
Published on: December 6, 2024
08:17A Semantic Priming Event-related Potential ERP Task to Study Lexico-semantic and Visuo-semantic Processing in Autism Spectrum Disorder
Published on: April 12, 2018
科学领域:
- 计算语言学 计算语言学
- 历史语言学 历史语言学
- 语言类型学语言类型学
背景情况:
- 大规模词汇和语法数据集的标准化对于比较语言学至关重要.
- 现有的数据集在标准化方面面临挑战,阻碍了数据的扩展和重用.
- 莱克西银行一直是跨语言数据的关键资源,但需要更新.
研究的目的:
- 为了呈现莱克西银行数据集的更新和扩展版本.
- 加强跨语言词汇数据的标准化,以提高可用性.
- 通过预先计算的功能和标准化的格式来促进自动化语言分析.
主要方法:
- 使用计算机辅助的工作流程,系统地整理已发布的词汇列表数据.
- 遵守跨语言数据格式 (CLDF) 的数据标准化标准.
- 语义和语音特征的预计算用于自动化分析.
主要成果:
- 莱克西银行更新的数据集现在涵盖了3,100多种语言和150万个词形.
- 已经实施了标准化的引用,语义词汇和语音转录.
- 展示了数据库查询,以推断单词相似性,集词化和语义多样性.
结论:
- 莱克西银行2在创造全面,标准化和可访问的语言资源方面取得了重大进展.
- 数据集及其特征使历史语言学和语言类型学的大规模,自动化分析成为可能.
- 标准化格式促进了数据的重复使用和语言社区内的协作.
