abundance skew

Chimeric sequences form during PCR amplification.

A chimeric template is created during one round, then amplified by subsequent rounds to produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological sequence.

In each round, the number of amplicons of a given sequence approximately doubles (subject to biases due to sequence affinity and secondary structure), with the result that chimeric amplicons should almost always be less abundant than their parent sequences.

This observation is exploited by the de novo mode of UCHIME. Given a set of amplicon sequences, it is straightforward to find chimeric alignments by using the reference database mode of UCHIME where the same sequences are used for both database and query. The difficulty is to determine which of the three sequences in the alignment is the chimera and which is the parent. Amplicon abundances can be used to discriminate: the parents should be at least 2x as abundant as their chimeras. The ratio between the parent abundance and the putative chimera abundance is called the abundance skew. The minimum value required to classify a sequence as chimeric is set by the ‑abskew parameter of uchime_denovo (default value 2.0).