Notes on rRNA annotation
SSU rRNA and LSU rRNA
- SSU rRNA (small subunit rRNA)
    
- 16S rRNA in bacteria, archea and plastid
 - 18S rRNA in eukaryotes
 - 12S rRNA in mitochondria
 
 - LSU rRNA (large subunit rRNA)
    
- 23S rRNA in bacteria, archea and plastid
 - 5.8S & 28S rRNA in eukaryotes. The 5.8S rRNA is homologous to the 5’ end of non-eukaryotic LSU rRNA. In eukaryotes, the insertion of ITS2 breaks LSU rRNA into 5.8S and 28S rRNAs.
 - 16S in Mitochondria
 - 5S rRNA
 
 - Biogenesis
    
- In bacteria, 16S rRNA, 23S rRNA, and 5S rRNA are typically organized as a co-transcribed operon.
 - In eukaryotes, the 28S, 5.8S, and 18S rRNAs are encoded by a single transcription unit (45S) separated by 2 internally transcribed spacers, and transcribed by RNA polymerase I. 5S rRNA is transcribed independetly by RNA polymerase III.
 
 
Where to get rRNA data
- barrnap is widely used tool for rRNA annotation, SortMeRNA is a tool to sort out rRNA from NGS data. Here list data resources used by these two tools
 - We can see two major sources of rRNA information come from:
    
- The silva database. Provide sequence alignments, focus on metagenomics (16S rRNA is the most wildely used marker gene in metagenomics).
 - Also see
        
- greengenes for 16S rRNA
 - RDP Bacterial and Archaeal 16S rRNA sequences, and Fungal 28S rRNA
 
 - The Rfam database. Provide sequence alignments, reference structure and covariance model.
 
 
rRNA sequence in reference genome assembly
- 
    
rRNA is generally recognized as repeat sequence in eukaryotic genomes, and abscent from some genome assemblies.
 - Human genome
    
- 5S, 5.8S, 18S, 28S rRNAs are all presents in hg38 assembly.
 - gencode v38 annotation contains two types of genomic rRNA (5S, 5.8S)and two types of mitochondrial rRNA (16S, 12S), but not genomic 16S and 28S rRNA.
 - refseq provides annotations for all 4 types of rRNAs, see this link.
 - See discussion in here: http://seqanswers.com/forums/showthread.php?t=41868
 
 - rRNAs have considerable variation among populations
    
- See 2018, Science Advances, Variant ribosomal RNA alleles are conserved and exhibit tissue-specific expression
        
- Mouse rRNA prototypes
            
- NR_003278.3
 - NR_003279.1
 - NR_003280.2
 - NR_030686.1
 
 - Mouse rDNA prototypes
            
- BK000964.3
 - chromosome 8 [123538334,123539354]
 
 - Human rRNA prototypes
            
- NR_003285.2
 - NR_003286.2
 - NR_003287.2
 - NR_023379.1
 
 - Human rDNA prototypes
            
- U13369.1
 - X12811.1
 
 
 - Mouse rRNA prototypes
            
 
 - See 2018, Science Advances, Variant ribosomal RNA alleles are conserved and exhibit tissue-specific expression