OTU benchmark (MOCK) |
MOCK
results
Ref seqs is the number of known reference sequences. The Titanium reference set should be complete. The Even and Uneven sets reference sets are probably incomplete due to missing paralogs. OTUs otupipe is the number of OTUs found by the otupipe. Ref seqs 97% is the number of reference sequences after clustering using UCLUST at 97%. This is a way to estimate a lower bound on the number of OTUs that should be found. Since an OTU pipeline cannot group paralogs from a single species if the paralogs are diverged more than 97%, we might expect more OTUs than species even if the algorithm is performing perfectly. OTUs uclust+uchime Number of clusters found by a naive method for comparison. I did a standard UCLUST clustering at 97% followed by UCHIME in reference database mode using the known reference sequences. Reads number of reads. Exact is the number of reference sequences that were recovered exactly by the pipeline. Download References |