Building an Annotated Corpus for Automatic Metadata Extraction from Multilingual Journal Article References

Academic / Journal Article
Data Science for Social Impact
Choi et al
Link Copied!

This article presents the construction of a high-quality annotated corpus to support the automatic extraction of metadata from multilingual journal references, improving the efficiency of multilingual data processing. The annotated corpus enables more accurate extraction of metadata, which is essential for organizing and retrieving scientific literature in multiple languages. The article discusses the challenges of multilingual metadata extraction and highlights the importance of building comprehensive corpora that can improve data processing workflows. By using this annotated corpus, researchers can enhance their ability to handle multilingual data, facilitating better access to global knowledge and improving research outcomes.

Learn more about the future with ISDM

This is where you add description.