I have LS454's SFF files. Fastq file only have 133,202 reads(Step 2), but my SSAHA2 result(BAM file) have 188,214 reads(Step 5). Why different?? 1000 genomes data exactly same but my result are different.

  1. Run sff_extract(http://bioinf.comav.upv.es/sff_extract/usage.html), I receive the KOBB0909_190.fastq file.

sff_extract -o KOBB0909_190_hong -Q KOBB0909_190.sff

  1. FASTQ file have total 133,202( 532808/4)reads.

cat KOBB0909_190_hong.fastq |wc -l 532808

  1. Running SSAHA2 like this

ssaha2 -454 -output sam -outfile KOBB0909_190.sam -diff 10 -best 1 -save ref.fa KOBB0909_190_hong.fastq

  1. Recevie the KOBB0909_190.sam file, I convert SAM to BAM

  2. Run SAMTOOLS

samtools flagstat KOBB0909_190.bam

188214 in total

0 QC failure

0 duplicates

188214 mapped (100.00%)

0 paired in sequencing

0 read1

0 read2

0 properly paired (nan%)

0 with itself and mate mapped

0 singletons (nan%)

0 with mate mapped to a different chr

0 with mate mapped to a different chr (mapQ>=5)

asked 29 Jun '10, 07:32

hongiiv's gravatar image

hongiiv
1261
accept rate: 0%


Confirmed with ssaha2 help:

-best If set to 1, only report the best alignment for each match, if multiple best scores report all (0).

link

answered 02 Jul '10, 17:23

lh3's gravatar image

lh3
22191
accept rate: 10%

If there are multiple mappings having the same S-W score, I think SSAHA2 will report them all.

link

answered 02 Jul '10, 13:51

Adrian_H's gravatar image

Adrian_H
1
accept rate: 0%

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×13
×2
×1
×1

Asked: 29 Jun '10, 07:32

Seen: 937 times

Last updated: 02 Jul '10, 17:23

powered by OSQA