UmiMetrics

Metrics

Category Metrics


Overview

Metrics that are calculated during the process of marking duplicates within a stream of SAMRecords using the UmiAwareDuplicateSetIterator.

This table summarizes the values that are specific to this metric.

Metric Summary
LIBRARY
Library that was used to generate UMI data.
MEAN_UMI_LENGTH
Number of bases in each UMI
OBSERVED_UNIQUE_UMIS
Number of different UMI sequences observed
INFERRED_UNIQUE_UMIS
Number of different inferred UMI sequences derived
OBSERVED_BASE_ERRORS
Number of errors inferred by comparing the observed and inferred UMIs
DUPLICATE_SETS_IGNORING_UMI
Number of duplicate sets found before taking UMIs into account
DUPLICATE_SETS_WITH_UMI
Number of duplicate sets found after taking UMIs into account
OBSERVED_UMI_ENTROPY
Entropy (in base 4) of the observed UMI sequences, indicating the effective number of bases in the UMIs. If this is significantly smaller than UMI_LENGTH, it indicates that the UMIs are not distributed uniformly.
INFERRED_UMI_ENTROPY
Entropy (in base 4) of the inferred UMI sequences, indicating the effective number of bases in the inferred UMIs. If this is significantly smaller than UMI_LENGTH, it indicates that the UMIs are not distributed uniformly.
UMI_BASE_QUALITIES
Estimation of Phred scaled quality scores for UMIs
PCT_UMI_WITH_N
The percentage of reads that contain an UMI that contains at least one N

Return to top


See also General Documentation | Tool Docs Index Tool Documentation Index | Support Forum

GATK version 4.6.2.0 built at Sun, 13 Apr 2025 13:21:43 -0400.