Skip to content

Experimental ancient DNA modifications #523

@teepean

Description

@teepean

I decided to use Claude Code to automate testing strobealign's options with ancient DNA and gave it bwa and SHRiMP's code alongside with other references and it started modifying code and came up with this:

https://github.com/teepean/strobealign/tree/aDNA

Here's some results with one test case:

https://github.com/teepean/strobealign/blob/aDNA/FINAL_COMPARISON_ANALYSIS.md

This should explain the idea that it got to feed unaligned reads from K=16 and map those with k=12. Read length of 30 was my idea as the data is preprocessed with adaterremoval and reads below 30 are discarded along with removing adapters and merging reads.

echo "Multi-k fallback strategy for ancient DNA:"
echo "  1. Align with k=16 (--ancient-dna --mcs=always -S 0.95)"
echo "  2. Extract unmapped reads"
echo "  3. Realign unmapped with k=12"
echo "  4. Merge and output final BAM"
echo ""
echo "Parameters:"
echo "  threads: number of threads (default: 16)"
echo "  read_length: -r parameter for strobealign (default: 30, recommended for aDNA)"
echo ""

I would be thankful if anyone would like to verify the results as they seem to be too good to be true even after using several methods to check the results.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions