Item type |
紀要論文 / Departmental Bulletin Paper(1) |
公開日 |
2007-07-02 |
タイトル |
|
|
タイトル |
Information Extraction from Electronic Mail |
|
言語 |
en |
言語 |
|
|
言語 |
eng |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
information extraction |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
tabular forms |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
itemizations |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
document structure |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
2-dimensional arrangement |
資源タイプ |
|
|
資源タイプ識別子 |
http://purl.org/coar/resource_type/c_6501 |
|
資源タイプ |
departmental bulletin paper |
著者 |
河合, 敦夫
塚本, 雅之
椎野, 努
|
抄録 |
|
|
内容記述タイプ |
Abstract |
|
内容記述 |
The former researchers on information extraction are only from documents with only sentences. But many documents include non-sentence areas, i.e., tabular areas such as itemizations, tabular forms and marsharing of nouns. This paper describes an information extraction method from tabular areas. The new alforithm consists of three steps. First, tabular areas are recognized by using a 2-dimensional arrangement of letters, blank spaces, parts of speech, and semantic features of nouns. Second, the tabular areas are subdivided into blocks. And last, information extraction is done by matching the block against the frame. Words themselves and semantic feature of words are the clues clues to fill in the slots in the frame. Desk simulation for electronic mail of computer sale information shows a precision rate of 83%, 92% and 78% for each step. |
書誌情報 |
Research reports of the Faculty of Engineering, Mie University
巻 19,
p. 191-198,
発行日 1994-12-21
|
ISSN |
|
|
収録物識別子タイプ |
PISSN |
|
収録物識別子 |
0385-6208 |
書誌レコードID |
|
|
収録物識別子タイプ |
NCID |
|
収録物識別子 |
AA00816341 |
フォーマット |
|
|
内容記述タイプ |
Other |
|
内容記述 |
application/pdf |
著者版フラグ |
|
|
出版タイプ |
VoR |
|
出版タイプResource |
http://purl.org/coar/version/c_970fb48d4fbd8a85 |
その他のタイトル |
|
|
言語 |
ja |
|
値 |
電子メール文書からの関係情報の自動抽出 |
出版者 |
|
|
出版者 |
Faculty of Engineering, Mie University |
資源タイプ(三重大) |
|
|
値 |
Departmental Bulletin Paper / 紀要論文 |