BioNLPOrium – A Unified, AI-ready Collection of Biomedical Corpora for Advancing Natural Language Processing Research
Presentation Time: 10:00 AM - 10:15 AM
Abstract Keywords: Data Sharing, Natural Language Processing, Knowledge Representation and Information Modeling
Primary Track: Applications
Motivated by the FAIR data principles, we are developing a unified, AI-ready collection of biomedical corpora for advancing biomedical NLP research. We aim to create a unified format for easy access, aggregation, interoperability, and reuse of datasets, reducing engineering efforts. Additionally, we developed a toolkit with several scripts for dataset format conversion and preprocessing. We envision our platform, BioNLPOrium, to be a valuable and continuously expanding platform for the biomedical NLP community.
Speaker(s):
Vipina K. Keloth, PhD
Yale University
Presentation Time: 10:00 AM - 10:15 AM
Abstract Keywords: Data Sharing, Natural Language Processing, Knowledge Representation and Information Modeling
Primary Track: Applications
Motivated by the FAIR data principles, we are developing a unified, AI-ready collection of biomedical corpora for advancing biomedical NLP research. We aim to create a unified format for easy access, aggregation, interoperability, and reuse of datasets, reducing engineering efforts. Additionally, we developed a toolkit with several scripts for dataset format conversion and preprocessing. We envision our platform, BioNLPOrium, to be a valuable and continuously expanding platform for the biomedical NLP community.
Speaker(s):
Vipina K. Keloth, PhD
Yale University
BioNLPOrium – A Unified, AI-ready Collection of Biomedical Corpora for Advancing Natural Language Processing Research
Category
Podium Abstract