Interactive prior elicitation of feature similarities for small sample size prediction

Homayun Afrabandpey, Tomi Peltola, Samuel Kaski

July 2017

Source Document

Abstract

Regression under the “small n$, large p” condition, of small sample size n and large number of features p in the learning data set, is a recurring setting in which learning from data is difficult. With prior knowledge about relationships of the features, p can effectively be reduced, but explicating such prior knowledge is difficult for experts. In this paper we introduce a new method for eliciting expert prior knowledge about the similarity of the roles of features in the prediction task. The key idea is to use an interactive multidimensional-scaling (MDS) type scatterplot display of the features to elicit the similarity relationships, and then use the elicited relationships in the prior distribution of prediction parameters.

Type

Conference paper

Publication

International Conference on User Modeling, Adaptation and Personalization (UMAP’17)

Source Themes

Interactive prior elicitation of feature similarities for small sample size prediction

Abstract

Homayun Afrabandpey

Senior Scientist