Question

Why it is not use UMIs for RNA-seq?

1

Entering edit mode

4.0 years ago

Rafael Soler ★ 1.3k

Hello,

This is a theoretical question. Why scRNA seq data have their UMIs to later on eliminate the PCR bias, but in bulk RNA seq it has no UMIs?? I know that in scRNAseq the PCR contamination is bigger, and I know that you can use Picard, but doing it only by the coordinates I find it very risky, and losing biological information. What do you think?

Thanks

RNA-Seq UMIs scRNA-seq • 2.9k views

ADD COMMENT • link updated 4 months ago by microbe77 ▴ 30 • written 4.0 years ago by Rafael Soler ★ 1.3k

1

Entering edit mode

UMI's can indeed be used for any type of sequencing. They add complexity (and cost) and can add a significant amount of informatics overhead as well. There are primers with UMI available from IDT that can be used for a wide range of applications.

ADD REPLY • link 4.0 years ago by GenoMax 146k

0

Entering edit mode

I always imagined (never thought it through in detail so far) that the limitation was that you would need to PCR amplify DNA to attach UMIs. In single-cell RNAseq there is sufficient material before amplification, but for single-cell DNA there wouldn't be. Thus one would not benefit that greatly from using UMIs. Perhaps someone here can settle whether this is indeed a factor.

ADD REPLY • link 4.0 years ago by Istvan Albert 101k

1

Entering edit mode

There are illumina compatible adapters that have UMI's built-in so they get directly attached to single molecules. xGEN Prism from IDT is one example I linked above.

ADD REPLY • link 4.0 years ago by GenoMax 146k

0

Entering edit mode

They've done that since the beginning (of NGS), they just call them "barcodes".

The Illumina implementation is called unique dual indexes (UDIs).

ADD REPLY • link 4.0 years ago by Jeremy Leipzig 22k

3

Entering edit mode

Barcodes are molecular indexes for samples. Indexes is likely more appropriate term for Illumina ones since they are never part of actual reads. barcodes are better to indicate an inline implementation and they are thus part of actual read.

UMI are molecular/oligo indexes for individual molecules of DNA/RNA.

ADD REPLY • link 4.0 years ago by GenoMax 146k

0

Entering edit mode

for bulk RNA-seq, the amount of UMIs for 8 random nucleodtes will only give you ~ 56k unique reads 4^8. So if you are using high input sample for RNA-seq, you won't capture the uniqueness of the transcripts because they exceed the number of unique UMIs. However in single cells, the input is very low and it is feasible then

ADD REPLY • link 4 months ago by microbe77 ▴ 30

2

Entering edit mode

Well, for bulk RNA-seq, you generally would deduplicate based on the locus a read mapped to in addition to the UMI.

ADD REPLY • link 4 months ago by dsull ★ 6.9k

score 2 · Answer 1 · 2020-11-10

2

Entering edit mode

4.0 years ago

jperezboza ▴ 30

UMIs are necessary for scRNA because of the low input. Amplification bias are more significant when the amount of molecules is low and can lead to high duplicated reads associated to technical problems (as is the case, for example, of miR-486 for illumina). By using the UMIs you compensate these bias.

When you are doing RNAseq, there will be enough starting material that you won't have such strong bias, and so, you will need to compensate less --> UMIs are less necessary.

ADD COMMENT • link 4.0 years ago by jperezboza ▴ 30

1

Entering edit mode

UMIs are necessary for scRNA

They are preferable, but not necessary. Some scRNA-seq methods do not use UMIs.

ADD REPLY • link 4.0 years ago by igor 13k