Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning to Classify Structured Data by Graph Propositionalization
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
2006 (English)In: Proceedings of the Second IASTED International Conference on Computational Intelligence, 2006Conference paper, Published paper (Refereed)
Abstract [en]

Existing methods for learning from structured data are limited with respect to handling large or isolated substructures and also impose constraints on search depth and induced structure length. An approach to learning from structured data using a graph based propositionalization method, called finger printing, is introduced that addresses the limitations of current methods. The method is implemented in a system called DIFFER, which is demonstrated to compare favorable to existing state-of-art methods on some benchmark data sets. It is shown that further improvements can be obtained by combining the features generated by finger printing with features generated by previous methods.

Place, publisher, year, edition, pages
2006.
Keyword [en]
Machine Learning, Graph, Classification, Structured data
National Category
Computer Science
Research subject
Computer and Systems Sciences
Identifiers
URN: urn:nbn:se:su:diva-38408OAI: oai:DiVA.org:su-38408DiVA: diva2:310286
Conference
The IASTED International Conference on Computational Intelligence, November 20 – 22, 2006, San Francisco, USA
Available from: 2010-04-13 Created: 2010-04-13 Last updated: 2014-02-26Bibliographically approved
In thesis
1. Learning predictive models from graph data using pattern mining
Open this publication in new window or tab >>Learning predictive models from graph data using pattern mining
2014 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Learning from graphs has become a popular research area due to the ubiquity of graph data representing web pages, molecules, social networks, protein interaction networks etc. However, standard graph learning approaches are often challenged by the computational cost involved in the learning process, due to the richness of the representation. Attempts made to improve their efficiency are often associated with the risk of degrading the performance of the predictive models, creating tradeoffs between the efficiency and effectiveness of the learning. Such a situation is analogous to an optimization problem with two objectives, efficiency and effectiveness, where improving one objective without the other objective being worse off is a better solution, called a Pareto improvement. In this thesis, it is investigated how to improve the efficiency and effectiveness of learning from graph data using pattern mining methods. Two objectives are set where one concerns how to improve the efficiency of pattern mining without reducing the predictive performance of the learning models, and the other objective concerns how to improve predictive performance without increasing the complexity of pattern mining. The employed research method mainly follows a design science approach, including the development and evaluation of artifacts. The contributions of this thesis include a data representation language that can be characterized as a form in between sequences and itemsets, where the graph information is embedded within items. Several studies, each of which look for Pareto improvements in efficiency and effectiveness are conducted using sets of small graphs. Summarizing the findings, some of the proposed methods, namely maximal frequent itemset mining and constraint based itemset mining, result in a dramatically increased efficiency of learning, without decreasing the predictive performance of the resulting models. It is also shown that additional background knowledge can be used to enhance the performance of the predictive models, without increasing the complexity of the graphs.

Place, publisher, year, edition, pages
Stockholm: Department of Computer and Systems Sciences, Stockholm University, 2014. 118 p.
Series
Report Series / Department of Computer & Systems Sciences, ISSN 1101-8526 ; 14-003
Keyword
Machine Learning, Graph Data, Pattern Mining, Classification, Regression, Predictive Models
National Category
Computer Science
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-100713 (URN)978-91-7447-837-2 (ISBN)
Public defence
2014-03-25, room B, Forum, Isafjordsgatan 39, Kista, 13:00 (English)
Opponent
Supervisors
Available from: 2014-03-03 Created: 2014-02-11 Last updated: 2014-03-04Bibliographically approved

Open Access in DiVA

fulltext(325 kB)462 downloads
File information
File name FULLTEXT01.pdfFile size 325 kBChecksum SHA-512
c165b7ac98be1b5ae5afe7b7754961bb6fcf26965ab62ab4339ce82fca0fff85a2b64bce23667dc9b3b3ada4edc5e98606bb30e21afcdd72cb5afa65d91def1f
Type fulltextMimetype application/pdf

Other links

http://www.actapress.com/Abstract.aspx?paperId=29106

Search in DiVA

By author/editor
Karunaratne, ThashmeeBoström, Henrik
By organisation
Department of Computer and Systems Sciences
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 462 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 104 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf