Title:
EDLINE MeSH Indexing: Lessons Learned From Machine Learning and Future Directions.
Author(s):
Jimeno-Yepes A, Mork JG, Wilkowski B, Demner-Fushman D, Aronson AR.
Institution(s):
1) National Library of Medicine, Bethesda, MD 20894
2) Technical University of
Denmark
DTU Informatics
Richard Petersens Plads
Source:
IHI. Miami, FL. January 2012.
Abstract:
Due to the large yearly growth of MEDLINE, MeSH indexing
is becoming a more difficult task for a relatively
small group of highly qualified indexing staff at the US National
Library of Medicine (NLM). The Medical Text Indexer
(MTI) is a support tool for assisting indexers; this
tool relies on MetaMap and a k-NN approach called PubMed
Related Citations (PRC). Our motivation is to improve the
quality of MTI based on machine learning. Typical machine
learning approaches fit this indexing task into text categorization.
In this work, we have studied some Medical Subject
Headings (MeSH) recommended by MTI and analyzed the
issues when using standard machine learning algorithms. We
show that in some cases machine learning can improve the
annotations already recommended by MTI, that machine
learning based on low variance methods achieves better performance
and that each MeSH heading presents a different
behavior. In addition, there are several factors which make
this task difficult (e.g. limited access to the full-text of the
citations) which provide direction for future work.
Publication Type: CONFERENCE
More about this article:








