[PDF][PDF] Annotation of Clinical Narratives in Bulgarian language.
BiomedicalNLP@ RANLP, 2017•lml.bas.bg
In this paper we describe annotation process of clinical texts with morphosyntactic and
semantic information. The corpus contains 1,300 discharge letters in Bulgarian language for
patients with Endocrinology and Metabolic disorders. The annotated corpus will be used as
a Gold standard for information extraction evaluation of test corpus of 6,200 discharge
letters. The annotation is performed within Clark system—an XML Based System for Corpora
Development. It provides mechanism for semi-automatic annotation. First a pipeline for …
semantic information. The corpus contains 1,300 discharge letters in Bulgarian language for
patients with Endocrinology and Metabolic disorders. The annotated corpus will be used as
a Gold standard for information extraction evaluation of test corpus of 6,200 discharge
letters. The annotation is performed within Clark system—an XML Based System for Corpora
Development. It provides mechanism for semi-automatic annotation. First a pipeline for …
Abstract
In this paper we describe annotation process of clinical texts with morphosyntactic and semantic information. The corpus contains 1,300 discharge letters in Bulgarian language for patients with Endocrinology and Metabolic disorders. The annotated corpus will be used as a Gold standard for information extraction evaluation of test corpus of 6,200 discharge letters. The annotation is performed within Clark system—an XML Based System for Corpora Development. It provides mechanism for semi-automatic annotation. First a pipeline for Bulgarian morphosyntactic annotation and a cascaded regular grammar for semantic annotation are run, then rules for cleaning of frequent errors are applied. At the end the obtained result is manually checked. Our goal is to adapt the morphosyntactic tagger to the domain of clinical narratives as well.
lml.bas.bg
顯示最佳搜尋結果。 查看所有結果