National Library of Medicine
Communications Engineering Branch/MSC 3824
Bldg. 38A, Room 10S1015A
8600 Rockville Pike
Bethesda, MD 20894 USA
Dr. In Cheol Kim is a Senior System Analyst at Lockheed Martin MSD, Inc. and has been working at the Lister Hill National Center for Biomedical Communications (LHNCBC), U.S. National Library of Medicine, Bethesda, Maryland since 2004. He has a Ph.D. degree in Information Processing Engineering from the Kyungpook National University, South Korea (2001). His previous experience includes two years as a postdoctoral researcher at the Concordia University, Montreal, Canada. In addition, he worked as a senior system engineer for more than five years in an industrial research lab. Dr. In Cheol Kim’s research interests are Web-based document analysis and processing, pattern recognition and classification, text data mining, neural networks, and statistical learning methods.
WebMARS (Spin-off): WebMARS is an automated data mining system that automates the extraction of bibliographic data from HTML-formatted online biomedical journals on the Web for NLM’s MEDLINE® database. WebMARS consists of five modules: WebCollection for downloading Web documents, WebLabeling for zoning and labeling the HTML-formatted text, CleanUp for extracting and reformatting citation information, WebReconcile for verifying the processing results, and Upload for saving the citation records for expert indexing. I am responsible for designing and implementing the CleanUp module. I developed the algorithms and user interface for this module based on a multi-threaded software architecture, and implemented it using Microsoft Visual C++. In addition, I am involved in another research project involving the automatic identification of “Comment-on,” a field in MEDLINE citations denoting the list of previously published articles commented on by a given article based on the Support Vector M achine and the "PubMed search" function.