Metrics
Metrics generated by CollectHsMetrics for the analysis of target-capture sequencing experiments. The metrics in this class fall broadly into three categories:
This table summarizes the values that are specific to this metric.
| Metric | Summary |
|---|---|
| BAIT_SET | The name of the bait set used in the hybrid selection. |
| BAIT_TERRITORY | The number of bases which are localized to one or more baits. |
| BAIT_DESIGN_EFFICIENCY | The ratio of TARGET_TERRITORY/BAIT_TERRITORY. A value of 1 indicates a perfect design efficiency, while a valud of 0.5 indicates that half of bases within the bait region are not within the target region. |
| ON_BAIT_BASES | The number of PF_BASES_ALIGNED that are mapped to the baited regions of the genome. |
| NEAR_BAIT_BASES | The number of PF_BASES_ALIGNED that are mapped to within a fixed interval containing a baited region, but not within the baited section per se. |
| OFF_BAIT_BASES | The number of PF_BASES_ALIGNED that are mapped away from any baited region. |
| PCT_SELECTED_BASES | The fraction of PF_BASES_ALIGNED located on or near a baited region (ON_BAIT_BASES + NEAR_BAIT_BASES)/PF_BASES_ALIGNED. |
| PCT_OFF_BAIT | The fraction of PF_BASES_ALIGNED that are mapped away from any baited region, OFF_BAIT_BASES/PF_BASES_ALIGNED. |
| ON_BAIT_VS_SELECTED | The fraction of bases on or near baits that are covered by baits, ON_BAIT_BASES/(ON_BAIT_BASES + NEAR_BAIT_BASES). |
| MEAN_BAIT_COVERAGE | The mean coverage of all baits in the experiment. |
| PCT_USABLE_BASES_ON_BAIT | The fraction of aligned, on-bait bases out of the PF bases available. (NOTE: This uses duplicate reads for both numerator and denominator) |
| PCT_USABLE_BASES_ON_TARGET | The fraction of aligned, de-duped, on-target bases out of all the PF bases available. |
| FOLD_ENRICHMENT | The fold by which the baited region has been amplified above genomic background. |
| HS_LIBRARY_SIZE | The estimated number of unique molecules in the selected part of the library. |
| HS_PENALTY_10X | The "hybrid selection penalty" incurred to get 80% of target bases to 10X. This metric should be interpreted as: if I have a design with 10 megabases of target, and want to get 10X coverage I need to sequence until PF_ALIGNED_BASES = 10^7 * 10 * HS_PENALTY_10X. |
| HS_PENALTY_20X | The "hybrid selection penalty" incurred to get 80% of target bases to 20X. This metric should be interpreted as: if I have a design with 10 megabases of target, and want to get 20X coverage I need to sequence until PF_ALIGNED_BASES = 10^7 * 20 * HS_PENALTY_20X. |
| HS_PENALTY_30X | The "hybrid selection penalty" incurred to get 80% of target bases to 30X. This metric should be interpreted as: if I have a design with 10 megabases of target, and want to get 30X coverage I need to sequence until PF_ALIGNED_BASES = 10^7 * 30 * HS_PENALTY_30X. |
| HS_PENALTY_40X | The "hybrid selection penalty" incurred to get 80% of target bases to 40X. This metric should be interpreted as: if I have a design with 10 megabases of target, and want to get 40X coverage I need to sequence until PF_ALIGNED_BASES = 10^7 * 40 * HS_PENALTY_40X. |
| HS_PENALTY_50X | The "hybrid selection penalty" incurred to get 80% of target bases to 50X. This metric should be interpreted as: if I have a design with 10 megabases of target, and want to get 50X coverage I need to sequence until PF_ALIGNED_BASES = 10^7 * 50 * HS_PENALTY_50X. |
| HS_PENALTY_100X | The "hybrid selection penalty" incurred to get 80% of target bases to 100X. This metric should be interpreted as: if I have a design with 10 megabases of target, and want to get 100X coverage I need to sequence until PF_ALIGNED_BASES = 10^7 * 100 * HS_PENALTY_100X. |
See also General Documentation | Tool Docs Index Tool Documentation Index | Support Forum
GATK version 4.6.2.0 built at Sun, 13 Apr 2025 13:21:43 -0400.