National Library of Medicine, HTTP://www.nlm.nih.gov Communications Engineering Branch Title Lister Hill National Center for Biomedical Communications, HTTP://www.lhncbc.nlm.nih.gov/
 

CEB Home
CEB Projects
Related Image Processing Work
Publications
Repositories
NHANES
Student Internships Site Index
Turning The Pages Online: http://archive.nlm.nih.gov/proj/ttp/intro.htm
Use MyMorph document conversion tool to make PDF files http://docmorph.nlm.nih.gov/docmorph/
Medical Article Records GROUNDTRUTH (MARG): http://marg.nlm.nih.gov/index2.asp
MD on Tap: http://mdot.nlm.nih.gov/proj/mdot/mdot.php
AnatQuest: http://anatquest.nlm.nih.gov/

Jong Woo Kim, Ph.D.

Jongwoo Kim National Library of Medicine
Communications Engineering Branch/ MS 55
Bldg. 38A, Room 10S1011G
8600 Rockville Pike
Bethesda, MD 20894 USA

(301) 435-3227 (voice)
(301) 402-0341 (fax)

Dr. Kim has 6 years of software engineering experience and 16 years experience working with a variety of computer systems. He has a strong research background in pattern recognition, image processing, computer/machine vision, fuzzy set theory, neural networks, probability and robust statistics combined with an extended mathematical and electronics background. Dr. Kim has programmed in C, C++, Visual C++, and SQL in Windows Me/2000 and UNIX environments.

Dr. Kim joined the National Library of Medicine in 1998. He is currently working for the Communications Engineering Branch at the Lister Hill Center. His job is involved in the Medical Article Record System Project to develop Artificial Intelligence Modules for automatic article labeling systems using MS Visual C++, C#, and MS Windows NT environment.

Dr. Kim received his B.S. and M.S. degrees in Electrical and Computer Engineering in 1989 and 1991 at the Kyungpook National University in Taegu, Korea. Also, Dr. Kim received his Ph.D. in Computer Engineering and Computer Science in 1997 at the University of Missouri in Columbia, Missouri.


Current Projects

WebMARS/WebMARS-SpinOff: The CEB has developed an automated system, called Web Based Medical Article Record System (WebMARS), to produce bibliographic records for its MEDLINEâ database from full text versions of HTML (PDF) format online journal articles. The WebMARS employs document image analysis and understanding techniques, and DOM technology to complement existing MARS. I am responsible for developing a labeling module callded Web Labeling Module. The module detects rubric, title, vernacular, author, corporate author, affiliation, abstract, pagination, grant number, e-mail, zip code, databank accession number, and support zones (grant number, databank accession number, and support zones for WebMARS-SpinOff) automatically from HTML-format journal articles using Fuzzy/Crisp rule-based algorithms and statistical information. Visual C++, ADO, and SQL are used to implement this module. I am also responsible for Web Updating module. The module extracts journal specific information of rubric, title, vernacular, author, corporate author, affiliation, abstract, pagination, grant number, e-mail, zip code, databank accession number, and support zones automatically from HTML-format journal articles using string matching algorithms. Visual C++, ADO, and SQL are used to implement these modules.

MARSII: The CEB has developed an automated system to produce bibliographic records for its MEDLINEŽ database from hard-copy medical journals. This system, named Medical Article Record System (MARS), employs document image analysis and understanding techniques and optical character recognition (OCR). The system is composed of eleven modules. I am in charge of developing and maintaining a labeling module callded ZoneCzar. The module labels title, author, affiliation, and abstract zones automatically from scanned journal articles, using rule-based algorithms. Visual C++, Rogue Wave, and SQL are used to implement this module.

MTA Project:The Bibliographic Services Division (BSD), a part of Library Operations at the NLM, asked the Communications Engineering Branch, NLM, to develop an automated system to produce bibliographic records for its database from several AIDS related conference journals. This project, called the Meeting Abstract (MTA), employs document image analysis and understanding techniques and optical character recognition (OCR). The system was currently composed of ten modules. I developed a labeling module for MTA. The module labeled title, author, affiliation, abstract, keyword, rubric, pagination, abstract number, grant number, e-mail, zip code, databank, and corporate zones automatically from scanned articles, using rule-based algorithms. Visual C++, Rogue Wave, and SQL were used for the development.




    Return to top of page

CEB Home | CEB Projects | Related Work | Publications | Repositories | NHANES | Site Index

U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894
National Institutes of Health | U.S. Dept. of Health and Human Services
Copyright information | Privacy policy | NLM Accessibility
USA.gov | Need a plug-in? | RSS

URL: http://archive.nlm.nih.gov/staff/kim.php
Last updated February 11, 2008

Send questions or comments about this site to