The main website where RiboGrove is hosted may be unavailable outside Belarus due to technical troubles and the overall disaster.
Hence this mirror has been created, and RiboGrove files are available through Dropbox (links are below).




Contents


Downloads

RiboGrove release 23.229 (2025-03-13)

The release is based on RefSeq release 229.

The fasta file is compressed with gzip, and the metadata file is a zip archive. To uncompress them, Linux and Mac OS users may use gzip and zip programs, they should be built-in. For Windows users, the free and open-source (de)compression program 7-Zip is available.

RiboGrove release archive

You can find all releases in the RiboGrove release archive.

Release notes

Coverage estimation for primer pairs for different V-regions: more archaeal primers added.

You can find notes to all RiboGrove releases on the release notes page.


Statistical summary

RiboGrove size
BacteriaArchaeaTotal
Number of gene sequences 261,001 1,063 262,064
Number of unique gene sequences 63,099 752 63,851
Number of species 12,015 488 12,503
Number of genomes 47,285 611 47,896
Number of genomes of category 1 31,731 248 31,979
Number of genomes of category 2 15,294 363 15,657
Number of genomes of category 3 260 0 260
16S rRNA gene lengths
BacteriaArchaea
Minimum (bp) 1,401.00 1,439.00
25th percentile (bp) * 1,517.00 1,471.00
Median (bp) * 1,529.00 1,473.50
75th percentile (bp) * 1,542.00 1,483.00
Average (bp) * 1,526.95 1,491.44
Mode (bp) * 1,537.00 1,472.00
Maximum (bp) 2,438.00 3,604.00
Standard deviation (bp) * 25.22 122.03

* Metrics marked with an asterisk were calculated with preliminary normalization, i.e. median within-species gene length was used for the summary.

16S rRNA gene copy number
Copy number *BacteriaArchaea
Number of speciesPercent of species (%)Number of speciesPercent of species (%)
1 1,502 12.50 246 50.41
2 2,016 16.78 144 29.51
3 1,668 13.88 73 14.96
4 1,460 12.15 19 3.89
5 903 7.52 6 1.23
6 1,579 13.14 0 0.00
7 1,119 9.31 0 0.00
8 638 5.31 0 0.00
9 326 2.71 0 0.00
10 310 2.58 0 0.00
11 151 1.26 0 0.00
12 135 1.12 0 0.00
13 54 0.45 0 0.00
14 85 0.71 0 0.00
15 25 0.21 0 0.00
16 11 0.09 0 0.00
17 11 0.09 0 0.00
18 6 0.05 0 0.00
19 2 0.02 0 0.00
20 8 0.07 0 0.00
21 1 0.01 0 0.00
22 1 0.01 0 0.00
24 1 0.01 0 0.00
25 1 0.01 0 0.00
27 1 0.01 0 0.00
37 1 0.01 0 0.00

* These are median within-species copy numbers.

Top-10 longest 16S rRNA genes
OrganismGene length (bp)RiboGrove Sequence ID(s)Assembly
accession
Bacteria
Thermus thermophilus strain AA2-2 2,438 GCF_019974355.1:NZ_AP024929.1:249100-251537:minus GCF_019974355.1
Ca. Annandia pinicola strain Ad13-065 1,887 GCF_020541245.1:NZ_CP045876.1:290071-291957:minus GCF_020541245.1
Thermoanaerobacter ethanolicus strain JW 200 1,812 GCF_003722315.1:NZ_CP033580.1:456062-457873:plus GCF_003722315.1
Nitrosophilus labii strain HRV44 1,806 GCF_014466985.1:NZ_AP022826.1:1258017-1259822:minus
GCF_014466985.1:NZ_AP022826.1:1532588-1534393:minus
GCF_014466985.1:NZ_AP022826.1:1939914-1941719:minus
GCF_014466985.1
Sporomusa rhizae strain DSM 16652 1,802 GCF_041428845.1:NZ_CP156925.1:3123180-3124981:minus GCF_041428845.1
Gelria sp. Kuro-4 1,788 GCF_019668485.1:NZ_AP024619.1:2016182-2017969:minus GCF_019668485.1
Helicobacter mastomyrinus strain Hm-17 1,785 GCF_039555295.1:NZ_CP145316.1:765140-766924:minus GCF_039555295.1
Thermoanaerobacter brockii strain Ako-1 1,781 GCF_000175295.2:NC_014964.1:2252888-2254668:minus GCF_000175295.2
Thermoanaerobacter pseudethanolicus
strain ATCC 33223
1,781 GCF_000019085.1:NC_010321.1:2265744-2267524:minus GCF_000019085.1
Thermoanaerobacter sp. RKWS2 1,754 GCF_026240795.1:NZ_CP110888.1:94012-95765:plus GCF_026240795.1
Archaea
Pyrobaculum ferrireducens strain 1860 3,604 GCF_000234805.1:NC_016645.1:127214-130817:plus GCF_000234805.1
Pyrobaculum aerophilum strain IM2 2,213 GCF_000007225.1:NC_003364.1:1089640-1091852:plus GCF_000007225.1
Pyrobaculum arsenaticum strain DSM 13514 2,212 GCF_000016385.1:NC_009376.1:623323-625534:minus GCF_000016385.1
Aeropyrum pernix strain K1 2,202 GCF_000011125.1:NC_000854.2:1218712-1220913:minus GCF_000011125.1
Pyrobaculum neutrophilum strain V24Sta 2,197 GCF_000019805.1:NC_010525.1:690419-692615:plus GCF_000019805.1
Ca. Mancarchaeum acidiphilum strain Mia14 2,008 GCF_002214165.1:NZ_CP019964.1:751297-753304:minus GCF_002214165.1
Ca. Micrarchaeum sp. A_DKE 2,003 GCF_016806735.1:NZ_CP060530.1:203642-205644:minus GCF_016806735.1
Caldivirga maquilingensis strain IC-167 1,679 GCF_000018305.1:NC_009954.1:129150-130828:minus GCF_000018305.1
Aeropyrum camini strain SY1 1,650 GCF_000591035.1:NC_022521.1:1165168-1166817:minus GCF_000591035.1
Pyrolobus fumarii strain 1A 1,576 GCF_000223395.1:NC_015931.1:84671-86246:minus GCF_000223395.1
Top-10 shortest 16S rRNA genes
OrganismGene length (bp)RiboGrove Sequence ID(s)Assembly
accession
Bacteria
Anabaena sp. YBS01 1,401 GCF_009498015.1:NZ_CP034058.1:6920299-6921699:minus GCF_009498015.1
Clostridioides difficile strain TW11 1,426 GCF_009362915.1:NZ_CP045224.1:4068440-4069865:minus GCF_009362915.1
Staphylococcus warneri strain TWSL_1 1,440 GCF_032147125.1:NZ_CP135051.1:2625669-2627108:plus GCF_032147125.1
Roseicitreum antarcticum strain ZS2-28 1,447 GCF_014681765.1:NZ_CP061498.1:3436150-3437596:plus GCF_014681765.1
Hirschia baltica strain ATCC 49814 1,448 GCF_000023785.1:NC_012982.1:2336679-2338126:minus GCF_000023785.1
Sagittula sp. MA-2 1,449 GCF_030126985.1:NZ_CP126145.1:439-1887:plus
GCF_030126985.1:NZ_CP126145.1:2907211-2908659:minus
GCF_030126985.1
Mameliella alba strain KU6B 1,449 GCF_011405015.1:NZ_AP022337.1:1420943-1422391:plus
GCF_011405015.1:NZ_AP022337.1:3191212-3192660:minus
GCF_011405015.1:NZ_AP022337.1:267140-268588:plus
GCF_011405015.1
Sagittula stellata strain E-37 1,449 GCF_039724765.1:NZ_CP155729.1:664616-666064:plus
GCF_039724765.1:NZ_CP155729.1:1804792-1806240:plus
GCF_039724765.1
Sagittula sp. P11 1,449 GCF_002814095.1:NZ_CP021913.1:3597920-3599368:plus
GCF_002814095.1:NZ_CP021913.1:2386837-2388285:plus
GCF_002814095.1
Clostridioides difficile strain Cd18 1,450 GCF_018884705.1:NZ_CP037806.1:136016-137465:plus GCF_018884705.1
Archaea
Ignicoccus hospitalis strain KIN4/I 1,439 GCF_000017945.1:NC_009776.1:728362-729800:plus GCF_000017945.1
Methanocaldococcus lauensis strain SG7 1,457 GCF_902827225.1:NZ_LR792632.1:542755-544211:plus GCF_902827225.1
Halorubrum sp. BOL3-1 1,463 GCF_004114375.1:NZ_CP034692.1:397753-399215:minus GCF_004114375.1
Salinirubellus litoreus strain SYNS196 1,466 GCF_037335815.1:NZ_CP147841.1:597195-598660:minus GCF_037335815.1
Natronomonas halophila strain C90 1,466 GCF_013391085.1:NZ_CP058334.1:1530622-1532087:minus GCF_013391085.1
Methanospirillum purgamenti strain GP1 1,466 GCF_019263745.1:NZ_CP077107.1:4649-6114:plus
GCF_019263745.1:NZ_CP077107.1:1359562-1361027:minus
GCF_019263745.1:NZ_CP077107.1:1365502-1366967:minus
GCF_019263745.1:NZ_CP077107.1:1986020-1987485:minus
GCF_019263745.1
Natronomonas gomsonensis strain KCTC 4088 1,466 GCF_024300825.1:NZ_CP101323.1:2500564-2502029:plus GCF_024300825.1
Methanomethylophilus alvi strain Mx-05 1,466 GCF_003711245.1:NZ_CP017686.1:283608-285073:plus GCF_003711245.1
Ca. Methanomethylophilus alvi strain Mx1201 1,466 GCF_000300255.2:NC_020913.1:283607-285072:plus GCF_000300255.2
Methanospirillum hungatei strain JF-1 1,466 GCF_000013445.1:NC_007796.1:39814-41279:plus
GCF_000013445.1:NC_007796.1:1301079-1302544:minus
GCF_000013445.1:NC_007796.1:3501525-3502990:minus
GCF_000013445.1:NC_007796.1:3507609-3509074:minus
GCF_000013445.1
Salinirubellus salinus strain ZS-35-S2 1,466 GCF_025231485.1:NZ_CP104003.1:3070232-3071697:plus GCF_025231485.1
Natronomonas marina strain ZY43 1,466 GCF_024298905.1:NZ_CP101154.1:18680-20145:plus GCF_024298905.1
Methanomethylophilus alvi strain
MGYG-HGUT-02456
1,466 GCF_902387285.1:NZ_LR699000.1:283607-285072:plus GCF_902387285.1
Methanospirillum stamsii strain Pt1 1,466 GCF_046244385.1:NZ_CP176366.1:1311724-1313189:plus
GCF_046244385.1:NZ_CP176366.1:2035802-2037267:plus
GCF_046244385.1:NZ_CP176366.1:2042927-2044392:plus
GCF_046244385.1:NZ_CP176366.1:3625347-3626812:minus
GCF_046244385.1
Methanospirillum purgamenti strain
J.3.6.1-F.2.7.3
1,466 GCF_018502485.1:NZ_CP075546.1:133354-134819:plus
GCF_018502485.1:NZ_CP075546.1:825954-827419:plus
GCF_018502485.1:NZ_CP075546.1:872641-874106:plus
GCF_018502485.1:NZ_CP075546.1:1727419-1728884:plus
GCF_018502485.1
Top-10 genomes with the largest 16S rRNA copy numbers
OrganismCopy numberAssembly
accession
Bacteria
Tumebacillus avium strain AR23208 37 GCF_002162355.1
Tumebacillus algifaecis strain THMBR28 27 GCF_002243515.1
Photobacterium piscicola strain WVL24019 25 GCF_046058925.1
Photobacterium phosphoreum strain MIP2473 24 GCF_949787665.1
Mesobacillus maritimus strain ADH-29 22 GCF_044803185.1
Photobacterium damselae strain Phdp Wu-1 21 GCF_003130755.1
Peribacillus asahii strain KF4 21 GCF_023823975.1
Photobacterium damselae strain Pdd1411 21 GCF_030168855.1
Aneurinibacillus sp. Ricciae_BoGa-3 21 GCF_028421645.1
Photobacterium damselae strain AS-15-3942-9 20 GCF_021768365.1
Neobacillus drentensis strain JC05 20 GCF_021560175.1
Clostridium tagluense strain CM008 20 GCF_030585445.1
Photobacterium damselae strain RM-71 20 GCF_001708035.2
Photobacterium damselae strain AS-15-0759-2 20 GCF_021768425.1
Photobacterium damselae strain AS-15-3942-7 20 GCF_021768405.1
Photobacterium toruni strain WD2103 20 GCF_024494545.1
Photobacterium damselae strain AS-16-0963-1 20 GCF_021768345.1
Domibacillus sp.
DTU_2020_1001157_1_SI_ALB_TIR_016
20 GCF_032341995.1
Photobacterium damselae strain 04Ya311 20 GCF_026001825.1
Moritella sp. 5 20 GCF_018219455.1
Peribacillus sp. SCS-155 20 GCF_046237635.1
Photobacterium damselae strain WMD-P1 20 GCF_038086615.1
Photobacterium damselae strain WMD-P2 20 GCF_038086725.1
Photobacterium damselae strain WMD-P3 20 GCF_038086915.1
Moritella sp. 28 20 GCF_018219435.1
Photobacterium damselae strain XP-11 20 GCF_023973125.1
Moritella sp. 36 20 GCF_018219415.1
Photobacterium damselae strain 9046-81 20 GCF_009763125.1
Photobacterium damselae strain CSP DAM1 20 GCF_021766015.1
Neobacillus sp. SuZ13 20 GCF_030123365.1
Photobacterium damselae strain CSP DAM2 20 GCF_021765875.1
Archaea
Natronorubrum bangense strain JCM 10635 5 GCF_004799645.1
Methanolobus sp. ZRKC3 5 GCF_045291275.1
Methanococcoides orientis strain LMO-1 5 GCF_021184045.1
Natrinema sp. SYSU A 869 5 GCF_019879105.1
Natronorubrum aibiense strain 7-3 5 GCF_009392895.1
Methanoplanus endosymbiosus strain DSM 3599 5 GCF_024662215.1
Methanolobus mangrovi strain FTZ2 4 GCF_031312535.1
Methanolobus sp. WCC4 4 GCF_038022665.1
Natrinema thermotolerans strain A29 4 GCF_031165565.1
Haloterrigena salifodinae strain BOL5-1 4 GCF_016906025.1
Methanochimaera problematica strain FWC-SCC4 4 GCF_032878975.1
Methanospirillum purgamenti strain GP1 4 GCF_019263745.1
Uncultured Methanospirillum sp. 4 GCF_963668475.1
Methanolobus sediminis strain FTZ6 4 GCF_031312595.1
Halomicrobium salinisoli strain TH30 4 GCF_020405245.1
Haloarcula sinaiiensis strain ATCC 33800 4 GCF_018200015.1
Methanospirillum purgamenti strain
J.3.6.1-F.2.7.3
4 GCF_018502485.1
Methanogenium sp. S4BF 4 GCF_029633965.1
Halomicrobium urmianum strain IBRC-M: 10911 4 GCF_020217425.1
Methanospirillum hungatei strain JF-1 4 GCF_000013445.1
Methanospirillum lacunae strain Ki8-1 4 GCF_046195335.1
Methanosphaera stadtmanae strain DSM 3091 4 GCF_000012545.1
Methanogenium organophilum strain DSM 3596 4 GCF_026684035.1
Methanospirillum stamsii strain Pt1 4 GCF_046244385.1
Natronococcus occultus strain SP4 4 GCF_000328685.1
Halomicrobium salinisoli strain LT50 4 GCF_020405185.1
Methanococcus vannielii strain SB 4 GCF_000017165.1
Methanosphaera stadtmanae strain
MGYG-HGUT-02164
4 GCF_902384015.1
Top-10 genomes with the highest intragenomic variability of 16S rRNA genes
OrganismSum of entropy * (bits)Mean entropy * (bits)Number of variable positionsGene copy numberAssembly
accession
Bacteria
Clostridium perfringens strain A SNU21005 780.95 0.41 1,171 9 GCF_047150065.1
Escherichia coli strain P276M 433.81 0.26 569 6 GCF_009762385.1
Listeria monocytogenes strain 10-092876-1155 LM6 357.10 0.20 370 3 GCF_001999045.1
Klebsiella pneumoniae strain GZ-1 304.27 0.18 464 8 GCF_014854815.1
Streptococcus infantis strain SO 291.50 0.18 308 3 GCF_021497965.1
Synechococcus sp. NB0720_010 243.35 0.16 265 3 GCF_023078835.1
Streptomyces griseorubiginosus strain NBC_00586 231.55 0.15 342 6 GCF_036345135.1
Caminibacter mediatlanticus strain TB-2 228.78 0.15 282 4 GCF_005843985.1
Xanthomonas oryzae strain YNCX 227.74 0.15 248 3 GCF_024499285.1
Sporomusa termitida strain DSM 4440 226.25 0.13 247 12 GCF_007641255.1
Archaea
Halomicrobium sp. ZPS1 ** 137.00 0.09 137 2 GCF_009217585.1
Halomicrobium urmianum strain IBRC-M: 10911 131.55 0.09 146 4 GCF_020217425.1
Halapricum desulfuricans strain HSR12-2 128.00 0.09 128 2 GCF_017094525.1
Halomicrobium salinisoli strain TH30 127.74 0.09 145 4 GCF_020405245.1
Halapricum desulfuricans strain HSR-Bgl 127.00 0.09 127 2 GCF_017094445.1
Halomicrobium mukohataei strain JP60 125.81 0.09 137 3 GCF_004803735.1
Halomicrobium sp. HM KBTZ05 124.38 0.08 134 3 GCF_041530035.1
Halomicrobium salinisoli strain LT50 123.31 0.08 140 4 GCF_020405185.1
Halapricum desulfuricans strain HSR-Est 111.00 0.08 111 2 GCF_017094465.1
Halapricum desulfuricans strain HSR12-1 109.00 0.07 109 2 GCF_017094505.1

* Entropy is Shannon entropy calculated for each column of the multiple sequence alignment (MSA) of all full-length 16S rRNA genes of a genome. Entropy is then summed up (column “Sum of entropy”) and averaged (column “Mean entropy”).

** Halomicrobium sp. ZPS1 is a quite remarkable case. This genome harbours two 16S rRNA genes, therefore entropy is equal to the number of mismatching nucleotides between sequences of the genes. Respectively, percent of identity between these two gene sequences is 90.70%! This is remarkable because the usual (however arbitrary) genus demarcation threshold of percent of identity is 95%.

Coverage* of primer pairs for different V-regions of 16S rRNA genes

* Coverage of a primer pair is the percent of genomes having at least one 16S rRNA gene which can be amplified by PCR using this primer pair. For details, see our paper about RiboGrove.

In the tables below, you can find coverage of primer pairs that are being commonly used to amplify bacterial and archaeal genes (“bacterial” and “archaeal” primers).

You can find a more detailed table in the file primer_pair_genomic_coverage.tsv in the metadata. That table contains coverage not just for phyla, but also for each class, order, family, genus, and species. Moreover, that table contains coverage values for additional primer pairs, namely 1115F-1492R, 349f-519r, 1106F-Ar1378R, 1106F-SSU1492Rngs, SSU1ArF-SSU468R, SSU1ArF-SSU520R. In the tables below, they are omitted for brevity.

Bacterial genes, “bacterial” primers
Phylum Number
of genomes
Full gene V1–V2 V1–V3 V3–V4 V3–V5 V4 V4–V5 V4–V6 V5–V6 V5–V7 V6–V7 V6–V8
27F–
1492R
(%)
27F–
338R
(%)
27F–
534R
(%)
341F–
785R
(%)
341F–
944R
(%)
515F–
806R
(%)
515F–
944R
(%)
515F–
1100R
(%)
784F–
1100R
(%)
784F–
1193R
(%)
939F–
1193R
(%)
939F–
1378R
(%)
Pseudomonadota 25,842 99.71 99.50 99.69 99.93 84.00 99.89 84.07 88.89 88.57 93.45 92.52 96.44
Bacillota 10,928 99.83 99.74 99.78 99.92 95.25 99.97 95.13 99.45 98.08 97.55 98.67 99.38
Actinomycetota 4,858 99.90 99.20 99.71 94.75 67.87 94.61 67.68 96.93 99.75 99.84 99.84 96.89
Bacteroidota 1,620 96.36 95.99 96.42 99.94 64.01 99.38 63.58 37.90 38.02 93.95 92.59 95.43
Campylobacterota 1,282 100.00 100.00 100.00 100.00 100.00 99.92 99.92 99.92 99.53 99.53 99.69 99.53
Mycoplasmatota 746 89.95 83.78 71.85 98.53 90.75 98.66 91.15 74.66 48.53 42.90 75.87 0.54
Spirochaetota 365 59.18 59.73 59.73 92.60 99.73 92.60 99.73 99.73 73.15 73.15 89.32 47.40
Cyanobacteriota 363 99.72 99.72 99.72 100.00 3.58 100.00 3.58 100.00 1.10 1.10 100.00 99.72
Chlamydiota 232 0.00 0.00 0.00 100.00 100.00 0.00 0.00 0.00 100.00 100.00 100.00 94.40
Fusobacteriota 225 100.00 98.67 99.56 99.56 99.56 99.56 99.56 99.56 99.56 99.56 100.00 0.00
Verrucomicrobiota 142 99.30 0.00 99.30 100.00 12.68 100.00 12.68 100.00 1.41 1.41 98.59 98.59
Thermodesulfobacteriota 141 100.00 99.29 100.00 100.00 42.55 100.00 42.55 100.00 94.33 90.07 95.04 98.58
Deinococcota 95 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 53.68 100.00
Myxococcota 65 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Planctomycetota 64 100.00 18.75 100.00 100.00 62.50 100.00 62.50 0.00 0.00 0.00 3.12 0.00
Chloroflexota 51 100.00 92.16 100.00 43.14 0.00 94.12 0.00 90.20 11.76 11.76 94.12 27.45
Bdellovibrionota 44 100.00 100.00 100.00 100.00 77.27 100.00 77.27 100.00 100.00 100.00 100.00 100.00
Thermotogota 43 100.00 97.67 100.00 100.00 9.30 100.00 9.30 100.00 0.00 0.00 58.14 97.67
Acidobacteriota 41 97.56 97.56 97.56 100.00 100.00 100.00 100.00 100.00 70.73 58.54 87.80 100.00
Aquificota 18 100.00 16.67 100.00 100.00 16.67 100.00 16.67 100.00 0.00 0.00 0.00 16.67
Chlorobiota 15 100.00 100.00 100.00 100.00 0.00 0.00 0.00 0.00 100.00 93.33 86.67 6.67
Nitrospirota 15 100.00 100.00 100.00 100.00 73.33 100.00 73.33 100.00 100.00 73.33 73.33 100.00
Rhodothermota 15 40.00 40.00 40.00 100.00 100.00 100.00 100.00 86.67 86.67 100.00 100.00 100.00
Ca. Saccharibacteria 12 100.00 100.00 100.00 100.00 8.33 8.33 8.33 8.33 0.00 0.00 100.00 100.00
Synergistota 9 100.00 100.00 100.00 100.00 0.00 100.00 0.00 100.00 0.00 0.00 100.00 100.00
Gemmatimonadota 7 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Deferribacterota 6 100.00 100.00 100.00 100.00 0.00 100.00 0.00 100.00 100.00 100.00 100.00 100.00
Elusimicrobiota 4 100.00 50.00 100.00 100.00 0.00 100.00 0.00 100.00 75.00 75.00 100.00 100.00
Atribacterota 3 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00
Ignavibacteriota 3 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Armatimonadota 2 100.00 50.00 100.00 50.00 50.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Thermodesulfobiota 2 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00
Thermomicrobiota 2 100.00 100.00 100.00 100.00 0.00 100.00 0.00 100.00 0.00 0.00 50.00 50.00
Balneolota 2 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Chrysiogenota 2 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Dictyoglomota 2 100.00 100.00 100.00 100.00 0.00 100.00 0.00 100.00 0.00 0.00 100.00 0.00
Fibrobacterota 2 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Kiritimatiellota 2 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00
Ca. Fervidibacterota 1 100.00 0.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00
Ca. Cloacimonadota 1 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Ca. Bipolaricaulota 1 0.00 0.00 0.00 100.00 100.00 100.00 100.00 0.00 0.00 0.00 0.00 0.00
Ca. Absconditibacteriota 1 100.00 0.00 100.00 100.00 0.00 100.00 0.00 0.00 0.00 100.00 0.00 0.00
Calditrichota 1 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Caldisericota 1 100.00 100.00 100.00 100.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 100.00
Ca. Omnitrophota 1 100.00 100.00 100.00 100.00 0.00 100.00 0.00 100.00 100.00 100.00 100.00 100.00
Ca. Paceibacterota 1 0.00 0.00 0.00 100.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Vulcanimicrobiota 1 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00
Thermosulfidibacterota 1 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00
Nitrospinota 1 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00
Lentisphaerota 1 100.00 0.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00
Fidelibacterota 1 100.00 100.00 100.00 100.00 0.00 100.00 0.00 100.00 100.00 100.00 100.00 100.00
Coprothermobacterota 1 0.00 0.00 0.00 100.00 100.00 100.00 100.00 0.00 0.00 0.00 100.00 0.00
Archaeal genes, “archaeal” primers
Phylum Number
of genomes
Full gene V1–V2 V1–V3 V1–V3 V3–V4 V3–V4 V3–V4 V3–V5 V3–V5 V4 V4–V5 V5–V7
SSU1ArF–
SSU1492Rngs
(%)
SSU1ArF–
SSU280ArR
(%)
SSU1ArF–
SSU470R
(%)
SSU1ArF–
A519R
(%)
349f–
SSU666ArR
(%)
340f–
SSU666ArR
(%)
340f–
806rB
(%)
349f–
SSU1000ArR
(%)
340f–
SSU1000ArR
(%)
515fB–
806rB
(%)
Parch519f–
Arch915r
(%)
A751F–
UA1204R
(%)
Methanobacteriota 447 89.04 86.80 89.26 89.04 52.35 51.45 100.00 99.11 100.00 100.00 99.55 89.71
Thermoproteota 107 96.26 98.13 100.00 100.00 72.90 98.13 100.00 69.16 93.46 100.00 99.07 98.13
Nitrososphaerota 30 96.67 96.67 96.67 96.67 100.00 100.00 100.00 100.00 100.00 100.00 100.00 100.00
Thermoplasmatota 19 84.21 68.42 100.00 100.00 42.11 42.11 100.00 63.16 84.21 100.00 100.00 52.63
Ca. Nanohalarchaeota 4 0.00 25.00 0.00 100.00 0.00 0.00 100.00 50.00 100.00 100.00 100.00 0.00
Ca. Micrarchaeota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00 0.00
Nanobdellota 1 100.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00 0.00
Promethearchaeota 1 100.00 100.00 100.00 100.00 100.00 100.00 100.00 0.00 0.00 100.00 100.00 100.00

Bacterial genes, “archaeal” primers
Bacterial genes, “archaeal” primers
Phylum Number
of genomes
Full gene V1–V2 V1–V3 V1–V3 V3–V4 V3–V4 V3–V4 V3–V5 V3–V5 V4 V4–V5 V5–V7
SSU1ArF–
SSU1492Rngs
(%)
SSU1ArF–
SSU280ArR
(%)
SSU1ArF–
SSU470R
(%)
SSU1ArF–
A519R
(%)
349f–
SSU666ArR
(%)
340f–
SSU666ArR
(%)
340f–
806rB
(%)
349f–
SSU1000ArR
(%)
340f–
SSU1000ArR
(%)
515fB–
806rB
(%)
Parch519f–
Arch915r
(%)
A751F–
UA1204R
(%)
Pseudomonadota 25,842 1.24 0.03 0.56 0.59 0.00 0.00 0.09 0.00 0.00 99.89 28.03 0.00
Bacillota 10,928 2.58 0.05 0.13 1.46 0.02 0.00 0.06 0.01 0.00 99.97 98.43 0.00
Actinomycetota 4,858 0.86 0.23 0.74 1.15 0.00 0.00 0.04 0.00 0.00 94.61 88.37 0.00
Bacteroidota 1,620 1.91 0.00 1.79 1.91 0.00 0.00 0.19 0.00 0.00 99.38 99.26 0.00
Campylobacterota 1,282 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 99.92 0.16 0.00
Mycoplasmatota 746 2.01 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 98.66 77.61 0.00
Spirochaetota 365 0.55 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 92.60 92.33 0.00
Cyanobacteriota 363 3.03 0.00 0.28 0.28 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Chlamydiota 232 1.72 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Fusobacteriota 225 0.44 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 99.56 99.56 0.00
Verrucomicrobiota 142 5.63 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 9.86 0.70
Thermodesulfobacteriota 141 5.67 0.71 1.42 1.42 0.00 0.00 0.00 0.00 0.00 100.00 74.47 0.00
Deinococcota 95 40.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 96.84 0.00
Myxococcota 65 13.85 7.69 6.15 6.15 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Planctomycetota 64 1.56 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 79.69 0.00
Chloroflexota 51 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 94.12 100.00 0.00
Bdellovibrionota 44 0.00 0.00 0.00 0.00 0.00 0.00 0.00 4.55 0.00 100.00 27.27 0.00
Thermotogota 43 41.86 0.00 30.23 30.23 0.00 0.00 2.33 0.00 0.00 100.00 100.00 0.00
Acidobacteriota 41 9.76 0.00 0.00 4.88 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Aquificota 18 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 83.33 44.44
Chlorobiota 15 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Nitrospirota 15 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Rhodothermota 15 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Ca. Saccharibacteria 12 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 8.33 8.33 0.00
Synergistota 9 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Gemmatimonadota 7 0.00 14.29 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Deferribacterota 6 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Elusimicrobiota 4 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Atribacterota 3 33.33 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Ignavibacteriota 3 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Armatimonadota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 50.00 0.00
Thermodesulfobiota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Thermomicrobiota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Balneolota 2 50.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Chrysiogenota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Dictyoglomota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Fibrobacterota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Kiritimatiellota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Ca. Fervidibacterota 1 100.00 0.00 0.00 100.00 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00
Ca. Cloacimonadota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Ca. Bipolaricaulota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00
Ca. Absconditibacteriota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00
Calditrichota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Caldisericota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Ca. Omnitrophota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Ca. Paceibacterota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Vulcanimicrobiota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Thermosulfidibacterota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Nitrospinota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Lentisphaerota 1 100.00 0.00 100.00 100.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Fidelibacterota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00
Coprothermobacterota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 100.00 0.00

Archaeal genes, “bacterial” primers
Archaeal genes, “bacterial” primers
Phylum Number
of genomes
Full gene V1–V2 V1–V3 V3–V4 V3–V5 V4 V4–V5 V4–V6 V5–V6 V5–V7 V6–V7 V6–V8
27F–
1492R
(%)
27F–
338R
(%)
27F–
534R
(%)
341F–
785R
(%)
341F–
944R
(%)
515F–
806R
(%)
515F–
944R
(%)
515F–
1100R
(%)
784F–
1100R
(%)
784F–
1193R
(%)
939F–
1193R
(%)
939F–
1378R
(%)
Methanobacteriota 447 0.22 0.22 0.22 0.22 0.00 100.00 0.00 82.33 0.22 0.22 0.22 0.22
Thermoproteota 107 0.93 0.00 0.00 0.00 0.00 100.00 0.00 88.79 0.00 0.00 0.00 0.00
Nitrososphaerota 30 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00 0.00 0.00 0.00 0.00
Thermoplasmatota 19 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00 0.00 0.00 0.00 0.00
Ca. Nanohalarchaeota 4 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00 0.00 0.00 0.00 0.00
Ca. Micrarchaeota 2 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Nanobdellota 1 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Promethearchaeota 1 0.00 0.00 0.00 0.00 0.00 100.00 0.00 0.00 0.00 0.00 0.00 0.00

Primers used for coverage estimation
Primers used for coverage estimation
Primer nameSequenceReference
27FAGAGTTTGATYMTGGCTCAGFrank et al., 2008
338RGCTGCCTCCCGTAGGAGTSuzuki et al., 1996
341F *CCTACGGGNGGCWGCAGKlindworth et al., 2013
515FGTGCCAGCMGCCGCGGTAATurner et al., 1999
534RATTACCGCGGCTGCTGGWalker et al., 2015
784FAGGATTAGATACCCTGGTAAndersson et al., 2008
785R *GACTACHVGGGTATCTAATCCKlindworth et al., 2013
806RGGACTACHVGGGTWTCTAATCaporaso et al., 2010
939FGAATTGACGGGGGCCCGCACAAGLebuhn et al., 2014
944RGAATTAAACCACATGCTCFuks et al., 2018
1100RAGGGTTGCGCTCGTTGTurner et al., 1999
1193RACGTCATCCCCACCTTCCBodenhausen et al, 2013
1378RCGGTGTGTACAAGGCCCGGGAACGLebuhn et al., 2014
1492RTACCTTGTTACGACTTFrank et al., 2008
SSU1ArFTCCGGTTGATCCYGCBRGBahram et al., 2018
SSU520RGCTACGRRYGYTTTARRCBahram et al., 2018
340fCCCTAYGGGGYGCASCAGGantner et al., 2011
806rBGGACTACNVGGGTWTCTAATAppril et al., 2015
349fGYGCASCAGKCGMGAAWTakai and Horikoshi, 2000
519rTTACCGCGGCKGCTGKlindworth et al., 2013
515fBGTGYCAGCMGCCGCGGTAAParada et al., 2015
Parch519fCAGCCGCCGCGGTAAOvreås et al., 1997
Arch915rGTGCTCCCCCGCCAATTCCTRaskin et al., 1994
1106FTTWAGTCAGGCAACGAGCWatanabe et al., 2007
Ar1378R **TGTGCAAGGAGCAGGGACWatanabe et al., 2007
A751FCCGACGGTGAGRGRYGAABaker et al., 2003
SSU1492RngsCGGNTACCTTGTKACGACBahram et al., 2018
SSU280ArRTCAGWNYCCNWCTCSRGGBahram et al., 2018
SSU470RDCNGCNGGTDTTACCGCGBahram et al., 2018
SSU468RGNDCNGCNGGTDTTACCGBahram et al., 2018
A519RGGTDTTACCGCGGCKGCTGWang and Qian, 2009
SSU666ArRHGCYTTCGCCACHGGTRGBahram et al., 2018
SSU1000ArRGGCCATGCAMYWCCTCTCBahram et al., 2018
UA1204RTTMGGGGCATRCIKACCTBaker et al., 2003

* Primers 341F and 785R are used in the protocol for library preparation for sequencing of V3–V4 region of 16S rRNA genes on Illumina MiSeq.

** Ar1378R is originally named 1378R. We use amended name to avoid confusion.



RiboGrove, 2025-05-08