Statistical methods for analysis of HTS RNAi screenning

Comparison of RNAi and small-molecule screens

Library design

Transfection

Kinetics and mechanism of RNAi action

Number and quality of controls

RNAi screening analysis workflow

Step 1: data triage

Step 2: normalization

Fraction or percentage of control

common approach

division each sample value by the mean of the control (either positive or negative)

requires a large number of controls to provide adequate estimation of their mean

sensitive to outliers

Fraction or percent of samples

              common approach

the mean of samples on the plate may be substituted for the means of the controls

reduce the need for large numbers of controls

exacerbate the issue of nonrobustness

cannot incorporate information on the degree of variation in the sample data

z score and robust z score

              z score: the number of standard deviations from the mean

              frequently used in RNAi screening

incorporate information on the degree of variation in the sample data

              depends on the use of samples as de facto negative controls

z score is sensitive to outliers-> robust z score, substitutes the outlier-insensitive median and median absolute deviation for mean and standard deviation in the z-score calculation

B score

              for within-plate systematic effects

applied to remove row, column or well effects by iterative application of the Tukey median polish algorism

relatively robust to outliers

essentially uses the samples as negative controls

              cellHT2 package in BioConductor bioinformatics software

Step 3: calculation of quality metrics

Z’ or Z factor

              The most common quality metrics for RNAi and small-molecule screens

              Z’ factor is often used during assay optimization (based on controls)

              Z factor may be used to assess performance of the screen on actual samples

Strictly standardized mean difference (SSMD)

              developed for use with RNAi screening

              more rigorous than Z factor

the ratio between the difference of the means and the standard deviation of the difference between two populations

In one case, accurately captured the clear difference between the high and low populations, which is not detected by Z’ factor

Receiver operating characteristic curves (ROC)

              used as quality metric in microarray transcriptomics

              provides quick and intuitive understanding of dynamic ranges

multiple thresholds for defining positives and the resulting trade-offs between sensitivity and specificity can easily be investigated by plotting multiple ROC curve

used to compare validation performance of hits generated from differently normalized RNAi data

Step 4: hit identification

Median ±k median absolute deviation (MAD)

              an improvement of mean ±k standard deviation approach

              identify both strong and weak hits while controlling false positives

robust to outliers and to identify weak hits in RNAi data effectively

              generate fewer false negatives than mean ±kstandard deviation

              very easy to calculate

Multiple t-tests

simple to implement and understand, but requires three or more replicates of each condition

If a high false positive rate cannot be tolerated, it is imperative to apply multiple-comparison corrections to the resulting P-values of each individual test, resulting sensitive to outlier

Quartile-based selection

              for the unsymmetrical data distribution

set upper and lower hits selection thresholds based on number of interquartile ranges above or below the first and third quartiles of the data

              identify both strong and weak hits while controlling false positives

easy to calculate, but has not been general in RNAi screening (MAD is more common)

SSMD for hit identification

calculating the SSMD limits for hit selection based on the desired false positive rate, false negative rate or both

requires many negative controls

not calculated by standard analysis packages

Redundant siRNA activity (RSA)

              integrate information about multiple RNAi reagents tested for each gene

ranks silencing reagents according to experimental effect and assigns a P-value to all reagents for single gene based on whether the reagents for that gene are distributed significantly higher in the rankings than would be expected by chance

be able to provide P-value for gene hits without sacrificing robustness

have higher rates of reconfirmation than those of identified with conventional methods

Although RSA is not included in common analysis software packages, ist developers have made available implementations in C# (for Windows), R and Perl

Rank product

              originally developed for use with microarray data

The premise of the rank-product approach is that a consistent hit should be highly ranked in each independent biological replicate set.

provides P-values for potential hits without requiring the assumption of an underlying probability distribution

              requires substantial computation and several replicates per screen to work

similar to RSA, but it does not depend on the use of multiple different RNAi reagents per gene

available as part of RNAither package in BioConductor

Bayesian models

To seek explicit estimated probabilities that a given siRNA has no effect, and inhibition effect or an activation effect

2 models are reported in 2008

simpler model; using only the negative controls to describe the posterior distribution of the ne true mean value for sample given the observed data value

more complex model; a posterior distribution that assumes the availability of data from both positive-inhibition and positive activation controls as well as negative controls

Both model provide the means to calculate the false discovery rate associated with any given hit threshold, but are usable only on screens without replicates

incorporates both plate-wide and experiment-wide information as well as information from both negative controls and the assumed de facto negative samples

simpler Bayesian model is followed by plate-wide MAD

not yet available in common software

Write a comment

Comments: 11
  • #1

    Carl (Wednesday, 19 October 2011 15:46)

    Hm, I’m just comfortable with this but still not entirely convinced, hence i’m going to research a touch more.

  • #2

    Arun (Sunday, 23 October 2011 09:58)

    {-String.Spintax-|-{I think|I believe|I do believe|I do think} {other|additional|some other|various other} {web site|site|internet site|web page} {owners|proprietors|entrepreneurs|masters} {should|ought to|need to|must} {take|consider|get|acquire} {this web site|this website|this site} {as an|being an|as a possible|just as one} {model|design|product|style}

  • #3

    Giovany (Thursday, 27 October 2011 22:41)

    Howdy are utilizing WordPress to your site program? Iam a new comer to blog planet but. Iam trying to find started out and set up my very own. Also i heard about Drupal is ok. Will see my own option.... Informative post, thanks.

  • #4

    Jake (Friday, 28 October 2011 20:07)

    Hm, I’m just comfortable with this but still not entirely convinced, hence i’m going to research a touch more.

  • #5

    Mila (Sunday, 30 October 2011 02:26)

    Nice post bro

  • #6

    Kamu (Friday, 11 November 2011 15:50)

    Can I post your post to my wordpress blog? I’ll add a one-way link to your forum. That’s one actually nice post.

  • #7

    Armand (Saturday, 26 November 2011 16:23)

    No registration application on Twitter. Extension will be the highest good thing about simpleness. Auto Twitter WP plugin

  • #8

    Petrit (Thursday, 05 April 2012 05:14)

    Thank you for details

  • #9

    Egon (Thursday, 05 April 2012 12:39)

    Good info bro

  • #10

    lilider (Wednesday, 05 December 2012 16:19)

    I have to express my appreciation to this writer for bailing me out of this type of setting. Because of checking through the the web and seeing tricks that were not helpful, I figured my life was done. Living minus the solutions to the issues you've fixed by way of this review is a critical case, as well as the ones that would have adversely affected my career if I hadn't noticed your web blog. Your main know-how and kindness in touching all the things was useful. I'm not sure what I would have done if I hadn't discovered such a stuff like this. I am able to now look forward to my future. Thank you so much for this specialized and amazing help. I won't think twice to propose your web blog to any person who desires counselling about this subject.

  • #11

    nerereder (Saturday, 08 December 2012 23:03)

    I truly wanted to compose a small word so as to thank you for all the pleasant tips and tricks you are giving here. My time-consuming internet look up has at the end been compensated with sensible content to share with my neighbours. I would admit that we website visitors are undoubtedly blessed to dwell in a decent website with very many brilliant individuals with interesting advice. I feel pretty fortunate to have encountered your entire webpage and look forward to really more amazing minutes reading here. Thanks again for a lot of things.I have to express my appreciation to this writer for bailing me out of this type of setting. Because of checking through the the web and seeing tricks that were not helpful, I figured my life was done. Living minus the solutions to the issues you've fixed by way of this review is a critical case, as well as the ones that would have adversely affected my career if I hadn't noticed your web blog. Your main know-how and kindness in touching all the things was useful. I'm not sure what I would have done if I hadn't discovered such a stuff like this. I am able to now look forward to my future. Thank you so much for this specialized and amazing help. I won't think twice to propose your web blog to any person who desires counselling about this subject.