RNA interference (RNAi) high-content screening (HCS) enables massive parallel gene
silencing and is increasingly being used to reveal novel connections between genes
and disease-relevant phenotypes. The application of genome-scale RNAi relies on the
development of high quality HCS assays. The Z' factor statistic provides a way to
evaluate whether or not screening run conditions (reagents, protocols, instrumentation,
kinetics and other conditions not directly related to the test compounds) are optimized.
Z' factor, introduced by Zhang et al. 1 is a dimensionless value that represents both
the variability and the dynamic range between two sets of sample control data. This
paper describes a new extension of the Z' factor, which integrates multiple readouts
for screening quality assessment. Currently presented multivariate Z' factor is based
on linear projection, which may not be suitable for data with nonlinear structure.
This paper proposes an algorithm which extends existing algorithm to deal with nonlinear
data by using the kernel function. Using kernel methods for projections, multiple
readouts are condensed to a single parameter, based on which the screening run quality
is monitored.