Skip to content

Enable GPU-accelerated translated search#1082

Open
KimBioInfoStudio wants to merge 1 commit intosoedinglab:masterfrom
KimBioInfoStudio:gpu-translated-search
Open

Enable GPU-accelerated translated search#1082
KimBioInfoStudio wants to merge 1 commit intosoedinglab:masterfrom
KimBioInfoStudio:gpu-translated-search

Conversation

@KimBioInfoStudio
Copy link

Summary

  • Pass GPU and MAKEPADDEDSEQDB_PAR environment variables from Search.cpp to the translated search workflow
  • When GPU is enabled and the target database is nucleotide, automatically run makepaddedseqdb on the translated ORF database before the inner protein search
  • Clean up the temporary padded database when REMOVE_TMP is set

This enables GPU acceleration for all three translated search modes: blastx (nucleotide query → protein target), tblastn (protein query → nucleotide target), and tblastx (nucleotide query → nucleotide target).

Benchmark (1,000 queries × 10,000 targets, DGX Spark)

Mode GPU CPU (ungapped) Speedup
tblastx 2.2s 34.4s 15.5×
blastx 3.7s 64.2s 17.4×

GPU and CPU produce identical results in all tested cases.

Test plan

  • blastx (nucl→prot) with --gpu 1: verified GPU/CPU results identical
  • tblastn (prot→nucl) with --gpu 1 --mask 0: verified GPU/CPU results identical
  • tblastx (nucl→nucl) with --gpu 1 --mask 0: verified GPU/CPU results identical (6/6 hits match)
  • Large-scale benchmark: 1,000 × 10,000 sequences, hit counts match between GPU and CPU

🤖 Generated with Claude Code

Pass GPU and MAKEPADDEDSEQDB_PAR variables to the translated search
workflow. When GPU is enabled and the target is nucleotide, automatically
create a padded database from the translated ORFs before the inner
protein search.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant