A port of the BIGSI database to the Seq language.
COVID sequence for benchmarking comes from https://www.ncbi.nlm.nih.gov/sra/ERX5420434[accn]
Team Members: Gabe Ramirez, Kaveri Nadhamuni, Charvi Gopal
- Clone the repo.
- Install Seq, ensuring that it is in your
$PATHand that$LD_LIBRARY_PATHcontains{seq_installation_directory}/lib/seq
/bin/bash -c "$(curl -fsSL https://seq-lang.org/install.sh)"- Install BIGSI's required libraries. See their docs.
- Compile the
bigseqbinary.
cd bigseq
seqc build -release bigseq.seq -o=bigseq- Set
SEQ_PYTHONto the output ofpython3 find.py. - Run
bigseq.
bloom <ctx_file> <out_file>: constructs a bloom filterbuild -b <bloom_file> -s <sample_name> ...: builds an indexsearch <seq>: searches for a matching sample
See licenses directory and report.