Skip to content

bamtools random reads are highly skewed towards small chromosomes #230

@benbfly

Description

@benbfly

I believe the way bamtools random works is to first randomly pick a reference, and then randomly pick a position within the reference.

For instance, in human GRCh38 , there are lots of tiny reference chromosomes, like chrUn_**** and chr1_****_random. Bamtools random will pick a large fraction of reads from these chromosomes, even if they are proportionally at tiny fraction of all reads. This is undesirable behavior, especially since these chromosomes are abnormal sequence which is mostly high copy and low complexity repeats.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions