You are here

Document classification utilising ontologies and relations between documents

TitleDocument classification utilising ontologies and relations between documents
Publication TypeConference Paper
Year of Publication2010
AuthorsNyberg K, Raiko T, Tiinanen T, Hyvönen E
Conference NameEighth Workshop on Mining and Learning with Graphs
PublisherACM
ISBN Number978-1-4503-0214-2
Abstract

Two major types of relational information can be utilized in automatic document classification as background information: relations between terms, such as ontologies, and relations between documents, such as web links or citations in articles. We introduce a model where a traditional bag-of-words type classifier is gradually extended to utilize both of these information types. The experiments with data from the Finnish National Archive show that classification accuracy improves from 70% to 74% when the General Finnish Ontology YSO is used as background information, without using relations between documents.

URLhttp://www.www.yso.fi/publications/2010/nyberg-et-al-mlg-2010.pdf
DOI10.1145/1830252.1830264