Graduate Seminar (2009 Fall)
Title:
Graph-based Data Mining

Nikhil Ketkar, Ph.D
Postdoctoral Research Associate
Department of Computer Science
University of North Carolina at Charlotte
August 28 at 3:00pm
106 Woodward
Abstract:
The fields of machine learning and data mining are currently in the
midst of what has been called a structural revolution, and the new
field of graph mining has emerged. The focus of graph mining is the
development of predictive and descriptive models from structured data
represented as graphs. The distinguishing aspect of these models is
that they incorporate both the entities as well as the relationships
which are fundamental to the structural nature of the data. The
motivation behind considering relationships is that they represent a
rich source of information and this can lead to significant
improvements in the predictive accuracy of the models. While
incorporating structure can lead to potential improvement, this
improvement comes at a high computational cost and a key challenge is
scalability.
This talk will focus on the tasks of frequent subgraph mining, graph
classification, and graph regression. Given a set of graph
transactions, the task of frequent subgraph mining involves
enumerating all subgraphs present in at least a given number of
transactions. The graph classification problem is to induce a model
from a set of graph transactions and associated class values which
predicts the class value of unseen graphs. The graph regression
problem can be seen as the general case of the graph classification
problem where the value to be predicted is a real number. The problems
of frequent subgraph mining, graph classification, and graph
regression have applications in a variety of application domains where
the underlying data cannot be represented as a simple table of features.
The key results presented in this talk include novel sampling
approaches that allow us to mine exceedingly large graph transaction
databases and pruning mechanisms that significantly improve feature
extraction for graph classification and regression.
Bio:
Nikhil Ketkar completed his Ph.D. in Computer Science at Washington
State University in May 2009. He is currently a postdoctoral research
associate in the Department of Computer Science at the University of
North Carolina at Charlotte. His research interests are in machine
learning, data mining, and graph theory. The unifying theme of his
work is investigating how structural information can be leveraged to
develop better predictive and descriptive models and to develop
algorithms and software infrastructure that facilitates the use of
this methodology in various real world domains.
Back
|

|
|
Copyright © 2003
- 2008 College of Computing and Informatics
|
|