Hello,
I am a bit hopeless at understanding the various reference gene selection algorithms..
Thought I would ask a general (but wordy) question while I chew through the information available on this forum
I am trying to find an appropriate set of reference genes for analysing blood from melanoma patients. As blood is typically a difficult tissue I decided to test quite a few markers with 10 samples (5 patient, 5 control) and then go from there...the patient samples were chosen with a range of our putative target expression values and the controls were chosen to roughly match the age and gender of the patient samples.
I have made up a total of 20 primer sets for blood markers (ie CD14, HBB etc), common reference genes (ACTB, GAPDH, HPRT1, HMBS etc) as well of some markers that have come out of microarray analyses of normal and diseased blood (not melanoma) (PPIB, FPGS, DECR1 etc) trying to get a nice mix of message abundance as I am primarily intersted in rare messages but would like to have candidate reference genes with a range of abundance levels
I ran the samples in triplicate with the max and min range of all replicates less than a cycle so I think my replicates are reasonable although more blood sample numbers would be desirable I thought I would drive up my sample numbers once I had my reference genes narrowed down to ~3-4 reasonable ones
I am running a demo of GenEx while I am trying to talk my boss into buying a licence, as well as the excel add ons for GenNorm, Normfinder and Bestkeeper
I have Ct values directly from my qPCR machine (BioRad IQ5) as well as from linreg edit after importing my background subtracted (not baseline corrected) optical values, presumably this program calculates the Ct taking into account individual run efficiencies all runs seemed to have behaved nicely (reasonably reaction efficiencies, nice melt curves) so any wobble is likely due to the nature of blood samples as opposed to poorly designed primers, unclean preps and undo inter run variation
so I input either the Ct values directly (Bestkeeper, GenEx GeNorm, GenEx Normfinder) or linearise (using 2E-Ct) the Ct values, then normalise to the highest expressing sample for each marker and plug those values into the Excel and GenEx versions of GeNorm and Normfinder
Although GeNorm and Normfinder generally agree I get slightly different values and best marker suggestions depending on what inputs I have I guess not surprisingly as all markers seem relatively okay ie my individual Genorm M values and NormFinder Sd values are not all that bad but not fantastic and my accumulated SD curve is very shallow ie the minimum does not occur until ~10 markers even though the top three markers vary depending on how you pre-process your data. Bestkeeper chooses quite different markers and seems to favour markers of higher abundance ???? I have no comprehension of the statistics involved between the programs so I am trying to input my data in as many ways as possible and see how it effects the curves I get and try to get a visual consensus of what is going on without really understanding it !!!
I notice that GenEx allows you several options for linearising and normalising Ct data and these transformations give me different results than if I let GenEx use the straight Ct values and different again from when I plug these values into the ExCel add on versions of GeNorm and NormFinder
So is it a matter of picking one approach or are there any suggestions of what approach might be more valid?
Looking at the Normfinder accumulated SD curve it seems to me that the steep front end (best 1-4) then long flat U shape (5-17) suggests that my top 10 to 15 markers are all reasonable and it doesn't matter which I use but I should try to use 3-4 reference genes
Even if slightly different markers are chosen for the top 2 or 3 the accumulated SD curve looks roughly the same
Any thoughts or suggestions if I am not being too vague???
Cheers
Mark in Perth Western Australia











