Item type |
学術雑誌論文 / Journal Article(1) |
公開日 |
2009-02-06 |
タイトル |
|
|
タイトル |
Document recognition and XML generation of tabular form discharge summaries for analogous case search system |
|
言語 |
en |
言語 |
|
|
言語 |
eng |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
document structure recognition |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
analogous case search |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
tabular form documents |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
patient discharges |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
computer-assisted image processing |
資源タイプ |
|
|
資源タイプ識別子 |
http://purl.org/coar/resource_type/c_6501 |
|
資源タイプ |
journal article |
著者 |
Kawanaka, H.
Sumida, T.
Yamamoto, K.
Shinogi, T.
Tsuruoka, S.
|
抄録 |
|
|
内容記述タイプ |
Abstract |
|
内容記述 |
OBJECTIVES: This paper discusses and develops a document image recognition, keyword extraction and automatic XML generation system to search analogous cases from paper-based documents. In this paper, we propose the document structure recognition method and automatic XML generation method for the tabular form discharge summary documents. This paper also develops the prototype system using the proposed method. Evaluation experiments using actual documents are done to discuss the effectiveness of the developed system. METHODS: The developed system consists of the following methods. Paper-based summary documents are scanned by a scanner using 300 dpi first. Noise and tilt of the image are reduced by pre-processing, and the table structures are identified. Characters in the table are recognized and converted to text data by the OCR engine. XML documents are automatically generated using obtained results. RESULTS: In this paper, patient discharge summary documents archived at Mie University Hospital were used. The results show that XML documents can be automatically generated when standard tabular form documents are input into the developed system. In this experiment, it takes about 20 seconds to generate an XML document using the general personal computer. This paper also compares the developed system with a commercial product to discuss the effectiveness of the present system. Experimental results also show that the accuracy of table structure recognition is high and it can be used in a practical situation. CONCLUSIONS: This paper showed the effectiveness of the proposed method to recognize the tabular form document images to generate XML documents. |
書誌情報 |
Methods of information in medicine : journal of methodology in medical research information and documentation
巻 46,
号 6,
p. 700-708,
発行日 2007-01-01
|
ISSN |
|
|
収録物識別子タイプ |
PISSN |
|
収録物識別子 |
0026-1270 |
書誌レコードID |
|
|
収録物識別子タイプ |
NCID |
|
収録物識別子 |
AA00737631 |
PubMed番号 |
|
|
関連タイプ |
isIdenticalTo |
|
|
識別子タイプ |
PMID |
|
|
関連識別子 |
18066422 |
フォーマット |
|
|
内容記述タイプ |
Other |
|
内容記述 |
application/pdf |
著者版フラグ |
|
|
出版タイプ |
VoR |
|
出版タイプResource |
http://purl.org/coar/version/c_970fb48d4fbd8a85 |
日本十進分類法 |
|
|
主題Scheme |
NDC |
|
主題 |
490 |
出版者 |
|
|
出版者 |
Schattauer |
資源タイプ(三重大) |
|
|
値 |
Journal Article / 学術雑誌論文 |