Book Details
Dataset Shift in Machine Learning
Description
An overview of recent efforts in the machine learning community to deal with dataset and covariate shift, which occurs when test and training inputs and outputs have different distributions.
Dataset shift is a common problem in predictive modeling that occurs when the joint distribution of inputs and outputs differs between training and test stages. Covariate shift, a particular case of dataset shift, occurs when only the input distribution changes. Dataset shift is present in most practical applications, for reasons ranging from the bias introduced by experimental design to the irreproducibility of the testing conditions at training time. (An example is -email spam filtering, which may fail to recognize spam that differs in form from the spam the automatic filter has been built on.) Despite this, and despite the attention given to the apparently similar problems of semi-supervised learning and active learning, dataset shift has received relatively little attention in the machine learning community until recently. This volume offers an overview of current efforts to deal with dataset and covariate shift. The chapters offer a mathematical and philosophical introduction to the problem, place dataset shift in relationship to transfer learning, transduction, local learning, active learning, and semi-supervised learning, provide theoretical views of dataset and covariate shift (including decision theoretic and Bayesian perspectives), and present algorithms for covariate shift.
About Author(s)
Joaquin Quiñonero-Candela is a Researcher in the Online Services and Advertising Group at Microsoft Research Cambridge, U.K.Masashi Sugiyama is Director of the RIKEN Center for Advanced Intelligence Project and Professor of Computer Science at the University of Tokyo.Anton Schwaighofer is an Applied Researcher in the Online Services and Advertising Group at Microsoft Research, Cambridge, U.K.Neil D. Lawrence is Senior Lecturer and Member of the Machine Learning and Optimisation Research Group in the School of Computer Science at the University of Manchester.
Content
Preface
I Introduction to Dataset Shift
1 When Training and Test Sets Are Different: Characterizing Learning Transfer
2 Projection and Projectability
II Theoretical View on Dataset and Covariate Shift
3 Binary Classification under Sample Selection
4 On Bayesian Transduction: Implications for the Covariate Shift Problem
5 On the Training / Test Distributions Gap: A Data Representation Learning Framework
III Algorithms for Covariate Shift
6 Geometry of Covariate Shift with Applications to Active Learning
7 A Conditional Expectation Approach to Model Selection and Active Learning under Covariate Shift
8 Covariate Shift by Kernel Mean Matching
9 Discriminative Learning under Covariate Shift with a Single Optimization Problem
Related Books
ORGANIZATIONAL COGNITION AND LEARNING : BUILDING SYSTEMS FOR THE LEARNING ORGANIZATION
By
BLENDED LEARNING IN PRACTICE: A GUIDE FOR PRACTITIONERS AND RESEARCHERS
By MADDEN, MARGULIEUX, KADEL, GOEL
SOCIAL LEARNING : AN INTRO TO MECHANISMS, METHODS AND MODELS
By HOPPITT, WILLIAM
Books by the Same Author
No other books by the same author. Popular Picks on the Month