Computational Linguistics in Bulgaria (CLIB 2024) took place on 9 – 10 September 2024 in Sofia, Bulgaria. The research team, represented by Madalina Chitez, Ana-Maria Bucur, Andreea Dinca & Roxana Rogobete, presented the paper entitled “Towards a Romanian phrasal academic lexicon”.
Our study proposes the first empirically based Romanian phrasal academic lexicon (ROPAL). The paper presents the methodology and data used for the automatic extraction of the ROPAL. We put forth a methodology for the extraction of the ROPAL adapted for the Romanian language, which uses the newly compiled Corpus of Expert Writing in Romanian and English (EXPRES) built at CODHUS. Our lexicon can be used to support academic writing teaching activities and NLP tasks focusing on Romanian.
ROPAL is freely available here: https://codhus.projects.uvt.ro/wp-content/uploads/ROPAL_lista-de-expresii-academice-in-limba-romana-v.1.0.pdf
The proceedings of CLIB 2024 can be accessed here: https://dcl.bas.bg/clib/wp-content/uploads/2024/09/CLIB2024_PROCEEDINGS_v1.0.pdf