Notes on rRNA annotation
SSU rRNA and LSU rRNA
- SSU rRNA (small subunit rRNA)
- 16S rRNA in bacteria, archea and plastid
- 18S rRNA in eukaryotes
- 12S rRNA in mitochondria
- LSU rRNA (large subunit rRNA)
- 23S rRNA in bacteria, archea and plastid
- 5.8S & 28S rRNA in eukaryotes. The 5.8S rRNA is homologous to the 5’ end of non-eukaryotic LSU rRNA. In eukaryotes, the insertion of ITS2 breaks LSU rRNA into 5.8S and 28S rRNAs.
- 16S in Mitochondria
- 5S rRNA
- Biogenesis
- In bacteria, 16S rRNA, 23S rRNA, and 5S rRNA are typically organized as a co-transcribed operon.
- In eukaryotes, the 28S, 5.8S, and 18S rRNAs are encoded by a single transcription unit (45S) separated by 2 internally transcribed spacers, and transcribed by RNA polymerase I. 5S rRNA is transcribed independetly by RNA polymerase III.
Where to get rRNA data
- barrnap is widely used tool for rRNA annotation, SortMeRNA is a tool to sort out rRNA from NGS data. Here list data resources used by these two tools
- We can see two major sources of rRNA information come from:
- The silva database. Provide sequence alignments, focus on metagenomics (16S rRNA is the most wildely used marker gene in metagenomics).
- Also see
- greengenes for 16S rRNA
- RDP Bacterial and Archaeal 16S rRNA sequences, and Fungal 28S rRNA
- The Rfam database. Provide sequence alignments, reference structure and covariance model.
rRNA sequence in reference genome assembly
-
rRNA is generally recognized as repeat sequence in eukaryotic genomes, and abscent from some genome assemblies.
- Human genome
- 5S, 5.8S, 18S, 28S rRNAs are all presents in hg38 assembly.
- gencode v38 annotation contains two types of genomic rRNA (5S, 5.8S)and two types of mitochondrial rRNA (16S, 12S), but not genomic 16S and 28S rRNA.
- refseq provides annotations for all 4 types of rRNAs, see this link.
- See discussion in here: http://seqanswers.com/forums/showthread.php?t=41868
- rRNAs have considerable variation among populations
- See 2018, Science Advances, Variant ribosomal RNA alleles are conserved and exhibit tissue-specific expression
- Mouse rRNA prototypes
- NR_003278.3
- NR_003279.1
- NR_003280.2
- NR_030686.1
- Mouse rDNA prototypes
- BK000964.3
- chromosome 8 [123538334,123539354]
- Human rRNA prototypes
- NR_003285.2
- NR_003286.2
- NR_003287.2
- NR_023379.1
- Human rDNA prototypes
- U13369.1
- X12811.1
- Mouse rRNA prototypes
- See 2018, Science Advances, Variant ribosomal RNA alleles are conserved and exhibit tissue-specific expression