rtransposepcaprincomp

how to use princomp() or prcomp() functions in R with large datasets, without trasposing the data?


I have just started knowing PCA and i wish to use it for a huge microarray dataset with more than 4,00,000 rows. I have my columns in the form of samples, and rows in the form of genes/locus. I did go through some tutorials on using PCA and came across princomp() and prcomp() and a few others.

Now, as i learn here that, in order to plot ¨samples¨ in the biplot, i would need to have them in the rows, and genes/locus in the columns, and hence i will have to transpose my data before using it for PCA.

However, since the rows are more than 4,00,000, i am not really able to transpose them into columns, because the columns are limited. So my question is that, is there any way to perform a PCA on my data, without transposing it, using these R functions ? If not, can anyone of you suggest me any other way or method to do so ?


Solution

  • Why do you hate to transpose your data? It's easy!

    If you read your data into R (for example as the matrix microarray.data) you can transpose them with just a command:

    transposed.microarray.data<-t(microarray.data)