Handle low complexity sequence

Method

Tools

# example usage of dustmasker
dustmasker -outfmt fasta -parse_seqids -in {input.fasta} -out {masked.fasta}
# low complexity sequence set set to lower case, if you want to set it to N, run
sed '/^>/! s/[[:lower:]]/N/g' {masked.fasta} > {hardmasked.fasta} # see https://www.biostars.org/p/13677/s
  • dust, a program shipped with MEME suites
dust sequences.fasta {cutoff} > sequences.masked.fasta