SEMINAR

 

DEPARTMENT OF COMPUTER ENGINEERING

 

ABSTRACT

 

 

Ilhan Uysal

M.S. in Computer Engineering

Supervisor: Assoc. Prof.Dr. H.Altay Güvenir

January 24, 2000

 

 

INSTANCE-BASED REGRESSION BY PARTITIONING FEATURE PROJECTIONS

 

A new instance-based learning method is presented forregression problems with high-dimensional data. As an instance-based approach, the conventional K-Nearest Neighbor (KNN) method has been applied to both classification and regression problems. Although KNN performs well for classification tasks, it does not perform similarly for regression problems. We have developed a new instance-based method, called Regression by Partitioning Feature Projections (RPFP), to fill the gap in the literature for a lazy method that achieves higher accuracies for regression problems. It also presents some additional properties and even better performance when compared to famous eager approaches of machine learning and statistics literature such as MARS, rule-based regression, and regression tree induction systems. The most important property of RPFP is that it performs much better than all other eager or lazy approaches on many domains that have missing values. If we consider databases today, where there are generally large number of attributes, such sparse domains are very frequent. RPFP handles such missing values in a very natural way, since it does not require all the attribute values to be present in the data set.

Keywords: Machine learning, instance-based learning, regression.

 

 

 

The Seminar will be on January 24, 2000, at 15:30

in EA502