See also
FASTQ files
FASTQ commands
Quality scores
Wikipedia article
on FASTQ
Cock et ail
(2010) paper describing FASTQ
ASCII coding of Q scores
The Q value is coded as a printable
ASCII character using Q =
ASCII_CODE - ASCII_BASE. Here, ASCII_CODE is the ASCII code for the character
and ASCII_BASE is a constant. The original Sanger FASTQ format used ASCII_BASE =
33 so for example if the quality score is coded as 'C' then Q = ASCII_CODE('C')
- 33 = 67 - 32 = 35. See here for tables mapping
ASCII characters to Q scores for common variants of FASTQ.
FASTQ format options
Unfortunately, the FASTQ format is not standardized. There are several variants
in common use, and it is not possible to distinguish them automatically with
high reliability. The main parameters are the minimum and maximum Q scores and
the ASCII_BASE constant. These are set by the command-line options shown in the
following table. The fastq_chars command can be
used to guess the format of a FASTQ file.
Option |
Default |
Description |
-fastq_ascii |
33 |
ASCII_BASE constant described above. |
-fastq_qmin |
0 |
Minimum Q score. |
-fastq_qmax |
41 |
Maximum Q score (input files). |
-fastq_qmaxout |
41 |
Maximum Q score (output files). |
|