TY - GEN
T1 - A proposal of Ontology-based health care information extraction system
T2 - 2007 IEEE International Conference on Research, Innovation and Vision for the Future, RIVF 2007
AU - Dung, Tran Quoc
AU - Kameyama, Wataru
PY - 2007/6/26
Y1 - 2007/6/26
N2 - This paper presents an Ontology-based health care information extraction system - VnHIES. In the system, we develop and use two effective algorithms called "Semantic Elements Extracting Algorithm" and "New Semantic Elements Learning Algorithm" for health care semantic words extraction and ontology enhancement. The former algorithm will extract Concepts (Cs), Descriptions of concepts (Ds), Pairs of Concept and Description(C-D) and Names of diseases (Ns) in health care information domain from web pages. Those extracted semantic elements are used by latter algorithm that will render suggestions in which might contain new semantic elements for later use by domain users to enrich ontology. After extracting semantic elements, a "Document Weighting Algorithm" is applied to get summary information of document with respect to all extracted semantic words and then to be stored in knowledge base which contains ontology and database to be used later in other applications. Our experiment results show that the approach is very optimistic with high accuracy in semantic extracting and efficiency in ontology upgrade. VnHIES can be used in many health care information management systems such as medical document classification, health care information retrieval system. VnHIES is implemented in Vietnamese language.
AB - This paper presents an Ontology-based health care information extraction system - VnHIES. In the system, we develop and use two effective algorithms called "Semantic Elements Extracting Algorithm" and "New Semantic Elements Learning Algorithm" for health care semantic words extraction and ontology enhancement. The former algorithm will extract Concepts (Cs), Descriptions of concepts (Ds), Pairs of Concept and Description(C-D) and Names of diseases (Ns) in health care information domain from web pages. Those extracted semantic elements are used by latter algorithm that will render suggestions in which might contain new semantic elements for later use by domain users to enrich ontology. After extracting semantic elements, a "Document Weighting Algorithm" is applied to get summary information of document with respect to all extracted semantic words and then to be stored in knowledge base which contains ontology and database to be used later in other applications. Our experiment results show that the approach is very optimistic with high accuracy in semantic extracting and efficiency in ontology upgrade. VnHIES can be used in many health care information management systems such as medical document classification, health care information retrieval system. VnHIES is implemented in Vietnamese language.
KW - Information extraction
KW - Ontology
KW - Ontology enhancement
KW - Semantic web
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=34250768878&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34250768878&partnerID=8YFLogxK
U2 - 10.1109/RIVF.2007.369128
DO - 10.1109/RIVF.2007.369128
M3 - Conference contribution
AN - SCOPUS:34250768878
SN - 1424406943
SN - 9781424406944
T3 - 2007 IEEE International Conference on Research, Innovation and Vision for the Future, RIVF 2007
SP - 1
EP - 7
BT - 2007 IEEE International Conference on Research, Innovation and Vision for the Future, RIVF 2007
Y2 - 5 March 2007 through 9 March 2007
ER -