The main website where RiboGrove is hosted may be unavailable outside Belarus due to technical troubles and the overall disaster.
Hence this mirror has been created, and RiboGrove files are available through Dropbox (links are below).




Home / Release archive / 4.210

Contents


Downloads

RiboGrove release 4.210 (2022-01-12)

The release is based on RefSeq release 210.

The fasta files are compressed with gzip, and the metadata file is a zip archive. To uncompress them, Linux and Mac OS users may use gzip and zip programs, they should be built-in. For Windows users, the free and open-source (de)compression program 7-Zip is available.

RiboGrove release archive

You can find all releases in the RiboGrove release archive.


Statistical summary

RiboGrove size
BacteriaArchaeaTotal
Number of gene sequences 128,152 699 128,851
Number of unique gene sequences 36,349 504 36,853
Number of species 6,743 321 7,064
Number of genomes 24,505 410 24,915
Number of genomes of category 1 16,008 122 16,130
Number of genomes of category 2 8,369 288 8,657
Number of genomes of category 3 128 0 128
16S rRNA gene lengths
BacteriaArchaea
Minimum (bp) 1,448.00 1,439.00
25th percentile (bp) * 1,517.25 1,472.00
Median (bp) * 1,532.00 1,475.00
75th percentile (bp) * 1,543.00 1,489.00
Average (bp) * 1,528.38 1,500.67
Mode (bp) * 1,537.00 1,473.00
Maximum (bp) 2,438.00 3,604.00
Standard deviation (bp) * 26.04 149.64

* Metrics marked with this sign were calculated with preliminary normalization, i.e. median within-species gene length was used for the summary.

16S rRNA gene copy number
(Bacteria and Archaea)
Copy
number *
Number of speciesPer cent of species (%)
1 982 13.90
2 1,354 19.17
3 1,088 15.40
4 897 12.70
5 560 7.93
6 698 9.88
7 565 8.00
8 370 5.24
9 178 2.52
10 152 2.15
11 71 1.01
12 49 0.69
13 34 0.48
14 40 0.57
15 7 0.10
16 6 0.08
17 4 0.06
18 4 0.06
20 3 0.04
27 1 0.01
37 1 0.01

* These are median within-species copy numbers.

Top-10 longest 16S rRNA genes
Assembly IDGene length
(bp)
RiboGrove Sequence ID (SeqID)Organism
Bacteria
10898951 2,438 NZ_AP024929.1:249100-251537_minus Thermus thermophilus strain AA2-2
11277031 1,887 NZ_CP045876.1:290071-291957_minus Candidatus Annandia pinicola isolate Ad13-065
8028891 1,806 NZ_AP022826.1:1258017-1259822_minus
NZ_AP022826.1:1532588-1534393_minus
NZ_AP022826.1:1939914-1941719_minus
Nitrosophilus labii strain HRV44
10731991 1,788 NZ_AP024619.1:2016182-2017969_minus Gelria sp. Kuro-4
282748 1,781 NC_014964.1:2252888-2254668_minus Thermoanaerobacter brockii strain Ako-1
40148 1,781 NC_010321.1:2265744-2267524_minus Thermoanaerobacter pseudethanolicus strain ATCC 33223
4499991 1,744 NZ_CP043427.1:609141-610884_plus
NZ_CP043427.1:930699-932442_minus
NZ_CP043427.1:1503078-1504821_minus
Campylobacter sputorum strain LMG 7795
1153941 1,744 NZ_CP019682.1:607981-609724_plus
NZ_CP019682.1:929565-931308_minus
NZ_CP019682.1:1501945-1503688_minus
Campylobacter sputorum strain RM3237
1153911 1,743 NZ_CP019683.1:606847-608589_plus
NZ_CP019683.1:935163-936905_minus
NZ_CP019683.1:1558189-1559931_minus
Campylobacter sputorum CCUG 20703
1101781 1,742 NZ_CP018789.1:273370-275111_plus
NZ_CP018789.1:1545743-1547484_minus
Campylobacter sp. RM6137
1153931 1,742 NZ_CP019685.1:577810-579551_plus
NZ_CP019685.1:891862-893603_minus
NZ_CP019685.1:1479764-1481505_minus
Campylobacter sputorum strain RM8705
7294871 1,742 NZ_CP053828.1:357136-358877_plus
NZ_CP053828.1:1667816-1669557_minus
Campylobacter hyointestinalis strain CHY5
Archaea
351728 3,604 NC_016645.1:127214-130817_plus Pyrobaculum ferrireducens strain 1860
28808 2,213 NC_003364.1:1089640-1091852_plus Pyrobaculum aerophilum strain IM2
37488 2,212 NC_009376.1:623323-625534_minus Pyrobaculum arsenaticum strain DSM 13514
32288 2,202 NC_000854.2:1218712-1220913_minus Aeropyrum pernix strain K1
40848 2,197 NC_010525.1:690419-692615_plus Pyrobaculum neutrophilum strain V24Sta
1145431 2,008 NZ_CP019964.1:751297-753304_minus Candidatus Mancarchaeum acidiphilum strain Mia14
9220081 2,003 NZ_CP060530.1:203642-205644_minus Candidatus Micrarchaeum sp. A_DKE
39388 1,679 NC_009954.1:129150-130828_minus Caldivirga maquilingensis strain IC-167
127981 1,650 NC_022521.1:1165168-1166817_minus Aeropyrum camini strain SY1
304318 1,576 NC_015931.1:84671-86246_minus Pyrolobus fumarii strain 1A
Top-10 shortest 16S rRNA genes
Assembly IDGene length
(bp)
RiboGrove Sequence ID (SeqID)Organism
Bacteria
44428 1,448 NC_012982.1:2336679-2338126_minus Hirschia baltica strain ATCC 49814
1460951 1,449 NZ_CP021913.1:2386837-2388285_plus
NZ_CP021913.1:3597920-3599368_plus
Sagittula sp. P11
860061 1,451 NZ_CP017718.1:2304269-2305719_minus Hyphomonas sp. Mor2
1163161 1,453 NZ_CP022540.1:1369380-1370832_plus
NZ_CP022540.1:2482480-2483932_plus
Antarctobacter heliothermus strain SMS3
6279751 1,454 NZ_AP022337.1:267139-268592_plus
NZ_AP022337.1:1420942-1422395_plus
NZ_AP022337.1:3191208-3192661_minus
Mameliella alba strain KU6B
9503471 1,455 NZ_CP022271.1:2407999-2409453_minus Hyphomonas sp. KY3
34128 1,455 NC_008358.1:2818466-2819920_minus Hyphomonas neptunium strain ATCC 15444
10854641 1,458 NZ_CP065359.1:3157837-3159294_minus Ruegeria sp. SCSIO 43209
8694041 1,458 NZ_CP060436.1:2482207-2483664_minus Pseudooceanicola algae strain Lw-13e
1078381 1,459 NZ_CP020612.1:582021-583479_minus
NZ_CP020612.1:1166317-1167775_minus
Paracoccus contaminans strain RKI 16-01929
9217271 1,459 NZ_CP069004.1:3093411-3094869_plus Sulfitobacter mediterraneus strain SC1-11
8738751 1,459 NZ_CP065429.1:477373-478831_plus Sulfitobacter sp. B30-2
8872011 1,459 NZ_CP065915.1:2729819-2731277_minus
NZ_CP065915.1:3593071-3594529_minus
Pelagovum pacificum strain SM1903
Archaea
39048 1,439 NC_009776.1:728362-729800_plus Ignicoccus hospitalis strain KIN4/I
10131521 1,457 NZ_LR792632.1:542755-544211_plus Methanocaldococcus sp. SG7
2220501 1,463 NZ_CP034692.1:397753-399215_minus Halorubrum sp. BOL3-1
7330651 1,466 NZ_CP058334.1:1530622-1532087_minus Natronomonas halophila strain C90
34548 1,466 NC_007796.1:39814-41279_plus
NC_007796.1:1301079-1302544_minus
NC_007796.1:3501525-3502990_minus
NC_007796.1:3507609-3509074_minus
Methanospirillum hungatei strain JF-1
599268 1,466 NC_020913.1:283607-285072_plus Candidatus Methanomethylophilus alvus strain Mx1201
10123301 1,466 NZ_CP075546.1:133354-134819_plus
NZ_CP075546.1:825954-827419_plus
NZ_CP075546.1:872641-874106_plus
NZ_CP075546.1:1727419-1728884_plus
Methanospirillum sp. J.3.6.1-F.2.7.3
10519241 1,466 NZ_CP077107.1:4649-6114_plus
NZ_CP077107.1:1359562-1361027_minus
NZ_CP077107.1:1365502-1366967_minus
NZ_CP077107.1:1986020-1987485_minus
Methanospirillum hungatei strain GP1
4352521 1,466 NZ_LR699000.1:283607-285072_plus Candidatus Methanomethylophilus alvus strain MGYG-HGUT-02456
2068141 1,466 NZ_CP017686.1:283608-285073_plus Candidatus Methanomethylophilus alvus strain Mx-05
Top-10 genomes with the largest 16S rRNA copy numbers
Assembly IDCopy
number
Organism
Bacteria
1115491 37 Tumebacillus avium stain AR23208
1166771 27 Tumebacillus algifaecis strain THMBR28
6720751 21 Priestia megaterium strain S2
9972241 20 Moritella sp. 36
9972251 20 Moritella sp. 28
9972261 20 Moritella sp. 5
2023811 19 Metabacillus litoralis strain Bac94
9513491 18 Photobacterium ganghwense strain C2.2
9972131 18 Moritella sp. 24
2201351 18 Peribacillus asahii strain OM18
Archaea
5073821 5 Natronorubrum aibiense strain 7-3
2580821 5 Natronorubrum bangense strain JCM 10635
11622961 5 Methanococcoides sp. LMO-1
10842511 5 Natrinema sp. SYSU A 869
521038 4 Natronococcus occultus strain SP4
11057071 4 Halosiccatus urmianus strain IBRC-M: 10911
4349641 4 Methanosphaera stadtmanae strain MGYG-HGUT-02164
33648 4 Methanosphaera stadtmanae strain DSM 3091
11151391 4 Halosiccatus sp. LT50
34548 4 Methanospirillum hungatei strain JF-1
11151361 4 Halosiccatus sp. LT50
38268 4 Methanococcus vannielii strain SB
10519241 4 Methanospirillum hungatei strain GP1
9298621 4 Haloterrigena salifodinae strain BOL5-1
9962651 4 Haloarcula sinaiiensis strain ATCC 33800
10123301 4 Methanospirillum sp. J.3.6.1-F.2.7.3
Top-10 genomes with the highest intragenomic variability of 16S rRNA genes
Assembly IDSum of entropy *
(bits)
Mean entropy *
(bits)
Number
of variable
positions
Gene
copy
number
Organism
Bacteria
4155511 226.25 0.13 247 12 Sporomusa termitida strain DSM 4440
7294871 217.64 0.12 237 3 Campylobacter hyointestinalis strain CHY5
1101781 211.21 0.12 230 3 Campylobacter sp. RM6137
9010851 184.58 0.12 201 3 Sinorhizobium meliloti strain AK76
8227731 167.21 0.10 216 5 Klebsiella pneumoniae strain GZ-1
1469691 145.25 0.10 156 3 Olleya sp. Bg11-27
7292581 136.25 0.09 145 4 Microbulbifer sp. YPW1
638441 135.84 0.08 138 4 Selenomonas sp. F0591
9428571 122.06 0.08 123 4 Thermobispora bispora strain ZC4
9307231 117.00 0.08 117 2 Thermoleptolyngbya sp. PKUAC-SCTB121
Archaea
4982121 137.00 0.09 137 2 Halomicrobium sp. ZPS1 **
11057071 131.55 0.09 146 4 Halosiccatus urmianus strain IBRC-M: 10911
9390741 128.00 0.09 128 2 Halapricum desulfuricans strain HSR12-2
11151391 127.74 0.09 145 4 Halosiccatus sp. LT50
9390521 127.00 0.09 127 2 Halapricum desulfuricans strain HSR-Bgl
2582391 125.81 0.09 137 3 Halomicrobium mukohataei strain JP60
11151361 123.31 0.08 140 4 Halosiccatus sp. LT50
9390681 111.00 0.08 111 2 Halapricum desulfuricans strain HSR-Est
9390731 109.00 0.07 109 2 Halapricum desulfuricans strain HSR12-1
999901 98.00 0.07 98 2 Halorientalis sp. IM1011

* Entropy is Shannon entropy calculated for each column of the multiple sequence alignment (MSA) of all full-length 16S rRNA genes of a genome. Entropy is then summed up (column "Sum of entropy") and averaged (column "Mean entropy").

** Halomicrobium sp. ZPS1 is a quite remarkable case. This genome harbours two 16S rRNA genes, therefore entropy is equal to the number of mismatching nucleotides between sequences of the genes. Respectively, per cent of identity between these two gene sequences is 90.70%! This is remarkable because the usual (however arbitrary) genus demarcation threshold of per cent of identity is 95%.


RiboGrove, 2023-05-19