IndependentReplicateMetric

Metrics

Category Metrics


Overview

A class to store information relevant for biological rate estimation @author Yossi Farjoun

This table summarizes the values that are specific to this metric.

Metric Summary
nSites
The count of sites used.
nThreeAllelesSites
The count of sites in which a third allele was found.
nTotalReads
The total number of reads over the het sites.
nDuplicateSets
The number of duplicate sets examined.
nExactlyTriple
The number of sets of size exactly 3 found.
nExactlyDouble
The number of sets of size exactly 2 found.
nReadsInBigSets
The number of reads in duplicate of sizes greater than 3.
nDifferentAllelesBiDups
The number of doubletons where the two reads had different bases in the locus.
nReferenceAllelesBiDups
The number of doubletons where the two reads matched the reference.
nAlternateAllelesBiDups
The number of doubletons where the two reads matched the alternate.
nDifferentAllelesTriDups
The number of tripletons where at least one of the reads didn't match either allele of the het site.
nMismatchingAllelesBiDups
The number of tripletons where the two reads had different bases in the locus.
nReferenceAllelesTriDups
The number of tripletons where the two reads matched the reference.
nAlternateAllelesTriDups
The number of tripletons where the two reads matched the alternate.
nMismatchingAllelesTriDups
The number of tripletons where at least one of the reads didn't match either allele of the het site.
nReferenceReads
The number of reference alleles in the reads.
nAlternateReads
The number of alternate alleles in the reads.
nMismatchingUMIsInDiffBiDups
The number of UMIs that are different within Bi-sets that come from different alleles.
nMatchingUMIsInDiffBiDups
The number of UMIs that are match within Bi-sets that come from different alleles.
nMismatchingUMIsInSameBiDups
The number of UMIs that are different within Bi-sets that come from the same alleles.
nMatchingUMIsInSameBiDups
The number of UMIs that are match within Bi-sets that come from the same alleles.
nMismatchingUMIsInCoOrientedBiDups
The number of bi-sets with mismatching UMIs and same orientation.
nMismatchingUMIsInContraOrientedBiDups
The number of bi-sets with mismatching UMIs and opposite orientation.
nBadBarcodes
The number of sets where the UMIs had poor quality bases and were not used for any comparisons.
nGoodBarcodes
the number of sets where the UMIs had good quality bases and were used for any comparisons.
biSiteHeterogeneityRate
the rate of heterogeneity within doubleton sets.
triSiteHeterogeneityRate
the rate of heterogeneity within tripleton sets
biSiteHomogeneityRate
the rate of homogeneity within doubleton sets.
triSiteHomogeneityRate
the rate of homogeneity within tripleton sets.
independentReplicationRateFromBiDups
independentReplicationRateFromTriDups
The biological duplication rate (as a fraction of the duplicates sets) calculated from tripleton sets.
pSameUmiInIndependentBiDup
When the alleles are different, we know that this is a biological duplication, thus we expect nearly all the UMIs to be different (allowing for equality due to chance). So we expect this to be near 1.
pSameAlleleWhenMismatchingUmi
When the UMIs mismatch, we expect about the same number of different alleles as the same (assuming that different UMI implies biological duplicate) thus, this value should be near 0.5
independentReplicationRateFromUmi
Given the UMIs one can estimate the rate of biological duplication directly, as this would be the rate of having different UMIs in all duplicate sets. This is only a good estimate if the assumptions hold, for example if pSameUmiInIndependentBiDup is near 1.
replicationRateFromReplicateSets
An estimate of the duplication rate that is based on the duplicate sets we observed.

Return to top


See also General Documentation | Tool Docs Index Tool Documentation Index | Support Forum

GATK version 4.6.2.0 built at Sun, 13 Apr 2025 13:21:43 -0400.