Skip to content

Picard ValidateSamFile halts for 100 Mate CIGAR String (MC Attribute) present for a read whose mate is unmapped Error after lifting from chm13v2 to hg38 #71

@yunkaig

Description

@yunkaig

Hi there,

I have been trying to adapt leviosam2 to out pipeline. This is the process I ran through:

  1. bwa mem alignment to chm13v2
  2. mark duplicates
  3. sort bam & validate bam file was OK then
  4. lift over to hg38 with leviosam2
  5. bqsr & validate bam file throws the error and stops

These are the parameters I used to run leviosam2:

export LVS_SIF="$sw/leviosam2/leviosam2_latest.sif"
export LVS_PY="$sw/leviosam2/leviosam2/workflow/leviosam2.py"
export LVS_CMD="singularity exec ${LVS_SIF} python3 ${LVS_PY}"

${LVS_CMD} \
    -t "${n_threads}" \
    -i "${infile}" # infile is the input bam aligned to chm13v2\
    -o "${outfile_prefix}" \
    -O bam \
    -C "${clft_file}" \
    -f "${ref_fasta}" # ref_fasta is hg38.noalt.decoy.bwa/GCA_000001405.15_GRCh38_no_alt_plus_hs38d1_analysis_set.fna, used in previous pipeline as reference \
    -fi "${ref_fasta}"\
    -s ilmn_pe \
    -a bwamem \
    --use_preset True \
    --forcerun

And the error messages from ValidateSamFile:

ERROR::MATE_CIGAR_STRING_INVALID_PRESENCE:Record 6, Read name ST-E00106:225:HHM22CCXX:7:2118:6238:47668, Mate CIGAR String (MC Attribute) present for a read whose mate is unmapped
ERROR::MATE_CIGAR_STRING_INVALID_PRESENCE:Record 47, Read name ST-E00106:225:HHM22CCXX:7:1210:27732:7866, Mate CIGAR String (MC Attribute) present for a read whose mate is unmapped
ERROR::MATE_CIGAR_STRING_INVALID_PRESENCE:Record 169, Read name ST-E00106:225:HHM22CCXX:7:1111:29630:48705, Mate CIGAR String (MC Attribute) present for a read whose mate is unmapped
ERROR::MATE_CIGAR_STRING_INVALID_PRESENCE:Record 172, Read name ST-E00106:225:HHM22CCXX:7:1211:12763:57284, Mate CIGAR String (MC Attribute) present for a read whose mate is unmapped
ERROR::MATE_CIGAR_STRING_INVALID_PRESENCE:Record 177, Read name ST-E00106:225:HHM22CCXX:7:1207:10439:33920, Mate CIGAR String (MC Attribute) present for a read whose mate is unmapped
ERROR::MATE_CIGAR_STRING_INVALID_PRESENCE:Record 415, Read name ST-E00106:225:HHM22CCXX:7:2218:24515:6267, Mate CIGAR String (MC Attribute) present for a read whose mate is unmapped
ERROR::MISMATCH_MATE_ALIGNMENT_START:Record 415, Read name ST-E00106:225:HHM22CCXX:7:2218:24515:6267, Mate alignment does not match alignment start of mate

... (more like this)

Maximum output of [100] errors reached.

It seems to me that no one has brought up that issue, but I did found a commit to remove MC:z tags for unmapped reads.

Is there anything I have missed?

Thanks in advance,
Yunkai

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions