Statistical Methods for Annotation Analysis 9783031037535 (Paperback)

Category

Computational linguistics

Store

Wordery

Brand

Springer international publish

Statistical Methods for Annotation Analysis : Springer : 9783031037535 : 3031037537 : 13 Jan 2022 : Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Machine Learning in Artificial Intelligence (AI), the creation of datasets to be used for training and evaluating AI systems, also known in AI as corpora, has become a central activity in the field as well. Early AI datasets were created on an ad-hoc basis to tackle specific problems. As larger and more reusable datasets were created, requiring greater investment, the need for a more systematic approach to dataset creation arose to ensure increased quality. A range of statistical methods were adopted, often but not exclusively from the medical sciences, to ensure that the labels used were not subjective, or to choose among different labe

54.99 GBP