Title | Document classification utilising ontologies and relations between documents |
Publication Type | Conference Paper |
Year of Publication | 2010 |
Authors | Nyberg K, Raiko T, Tiinanen T, Hyvönen E |
Conference Name | Eighth Workshop on Mining and Learning with Graphs |
Publisher | ACM |
ISBN Number | 978-1-4503-0214-2 |
Abstract | Two major types of relational information can be utilized in automatic document classification as background information: relations between terms, such as ontologies, and relations between documents, such as web links or citations in articles. We introduce a model where a traditional bag-of-words type classifier is gradually extended to utilize both of these information types. The experiments with data from the Finnish National Archive show that classification accuracy improves from 70% to 74% when the General Finnish Ontology YSO is used as background information, without using relations between documents. |
URL | http://www.www.yso.fi/publications/2010/nyberg-et-al-mlg-2010.pdf |
DOI | 10.1145/1830252.1830264 |
- Log in or register to post comments
- Google Scholar
- DOI