Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How should I choose all markers? #1

Closed
zorrodong opened this issue Jul 6, 2019 · 3 comments
Closed

How should I choose all markers? #1

zorrodong opened this issue Jul 6, 2019 · 3 comments
Labels
good first issue Good for newcomers

Comments

@zorrodong
Copy link

Hi,

Thanks very much for such a wonderful tool to quickly find the DEGs.

However, if I want to extract all the DEGs of a cluster or all cluster, what cutoff of specScore should I use, or I have to use the adj_pvalue to measure? And if I use the pvalue, I have to extract the DEGs by myself, kind of not convenient, will you consider adding this function?

Looking forward to your reply!

Zorro
20190706

@mahmoudibrahim
Copy link
Owner

mahmoudibrahim commented Jul 7, 2019

hi
thank you so much for the suggestion! I agree with you that a function that organizes the output of sortGenes and getPValues together in a single tidy table is a nice idea and I will update this issue as soon as I have written it.

Until then, please see below and feel free to post any further questions!

the specScore matrix can be used to rank genes in each clusters, to estimate a cutoff to get all variable (differentially expressed genes, DEGs) I would go for the "adjpval" matrix from getPValues.

if "sg" is the output of "sortGenes()", this will give you the top 10 genes in each cluster:
plotTopMarkerHeat(sg, top_n = 10, outs = TRUE, plotheat = FALSE)

to get a small set of marker genes:
getMarkers(sg)

To get all variable genes:
pp=getPValues(sg)
names(which(apply(pp$adjpval, 1, function(x) any(x < 0.05))))

to get all marker genes for say cluster 1:
rownames(pp$adjpval[which(pp$adjpval[,1] < 0.05),])

@mahmoudibrahim mahmoudibrahim added the enhancement New feature or request label Jul 7, 2019
@zorrodong
Copy link
Author

Thanks for the prompt reply, actually, I have already done what you said.
Again, It is really a fast method, I really like it!

Best wishes,

Zorro

@mahmoudibrahim
Copy link
Owner

hi

there is now a new function in the latest release called getTable to summarize variable genes and fold change values. I would recommend now to run something like this to get all variable genes:

gs = sortGenes(kidneyTabulaMuris$exp, kidneyTabulaMuris$cellType)
pp = getPValues(gs)
tab = getTable(gs, pp)

to get a small list of marker genes:
mm = getMarkers(gs)

I'm closing the issue now, however feel free to reopen or open a new one if you have any further questions.

best wishes
Mahmoud

@mahmoudibrahim mahmoudibrahim added good first issue Good for newcomers and removed enhancement New feature or request labels Aug 10, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants