-
Notifications
You must be signed in to change notification settings - Fork 24
Description
I decided to use Claude Code to automate testing strobealign's options with ancient DNA and gave it bwa and SHRiMP's code alongside with other references and it started modifying code and came up with this:
https://github.com/teepean/strobealign/tree/aDNA
Here's some results with one test case:
https://github.com/teepean/strobealign/blob/aDNA/FINAL_COMPARISON_ANALYSIS.md
This should explain the idea that it got to feed unaligned reads from K=16 and map those with k=12. Read length of 30 was my idea as the data is preprocessed with adaterremoval and reads below 30 are discarded along with removing adapters and merging reads.
echo "Multi-k fallback strategy for ancient DNA:"
echo " 1. Align with k=16 (--ancient-dna --mcs=always -S 0.95)"
echo " 2. Extract unmapped reads"
echo " 3. Realign unmapped with k=12"
echo " 4. Merge and output final BAM"
echo ""
echo "Parameters:"
echo " threads: number of threads (default: 16)"
echo " read_length: -r parameter for strobealign (default: 30, recommended for aDNA)"
echo ""
I would be thankful if anyone would like to verify the results as they seem to be too good to be true even after using several methods to check the results.