Home Software Services About Contact     
 
USEARCH v11
 New in v11 

otutab_rare command

Subsamples ("rarefies") an OTU table to a fixed number of reads per sample using random subsampling without replacement. I now believe this is the best strategy because preserves the shape of the abundance distribution in each sample more accurately than systematic rounding as used in the obsolete otutab_norm command.

I recommend using this command to normalize samples to the same number of reads so that they are comparable to each other.

The output filename is specified by the output option.

Input and output are in QIIME classic format.

The number of reads per sample is specified by the -sample_size option. This option must be specified; there is no default value.

Samples which have < sample_size reads are discarded.

The -randseed option specifies a random number seed. The value must be a non-negative integer. By default, the seed is randomized using the operating system clock and process id so it will be different each time the command is executed. This option can be used to get reproducible results, e.g. -randseed 1.

Example

usearch -otutab_rare otutab.txt -sample_size 5000 -output otutab_5k.txt