|Data set (reference[s])||Read length (nt)||No. of samples||Total no. of sequences||No. of unique sequences||No. of distances||No. of OTUs|
|Even (34, 36)||NA||NA||1,155,800||11,558||29,694||7,651|
|Staggered (34, 36)||NA||NA||1,156,550||11,558||29,694||7,653|
↵a Each data set contains sequences from the V4 region of the 16S rRNA gene. The number of distances for each data set indicates those that were less than or equal to 0.03. The number of OTUs was determined using the OptiClust algorithm. The even and staggered data sets were generated by extracting the V4 region from full-length reference sequences, and the data sets from the natural communities were generated by sequencing the V4 region using an Illumina MiSeq with paired reads of either 150 or 250 nt. NA, not applicable.