Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FASTA file error message #56

Open
fp215 opened this issue Feb 8, 2018 · 5 comments
Open

FASTA file error message #56

fp215 opened this issue Feb 8, 2018 · 5 comments

Comments

@fp215
Copy link

fp215 commented Feb 8, 2018

Hi

I've just tried running the java version of QoRTs using the current Ensembl primary assembly (GRCh38, v91, downloaded from the Ensembl ftp) fasta file which I also used to align my test BAM file (using STAR - no problems), however, QoRTs falls over as it says it can't find chromosome 1 in the fasta file, even though I can see that it is present. Could this be an Ensembl format compatibility issue?:

Head of the fasta reference file:

1 dna:chromosome chromosome:GRCh38:1:1:248956422:1 REF
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

QoRTs error message:

<====== FATAL ERROR! ======>

 Error message: "FATAL ERROR: Cannot find chromosome "1" in genome FASTA file!"
 Stack Trace:
    java.lang.Thread.getStackTrace(Thread.java:1552)
    internalUtils.Reporter$.error(Reporter.scala:294)
    internalUtils.genomicAnnoUtils$EfficientGenomeSeqContainer_MFA.switchToChrom(genomicAnnoUtils.scala:170)
    internalUtils.genomicAnnoUtils$EfficientGenomeSeqContainer.shiftBufferTo(genomicAnnoUtils.scala:111)
    qcUtils.qcOverlapMatch.runOnReadPair(qcOverlapMatch.scala:218)
    qcUtils.qcOverlapMatch.runOnReadPair(qcOverlapMatch.scala:83)
    qcUtils.runAllQC$.$anonfun$runOnSeqFile$7(runAllQC.scala:1312)
    qcUtils.runAllQC$.$anonfun$runOnSeqFile$7$adapted(runAllQC.scala:1285)
    qcUtils.runAllQC$$$Lambda$188/576936864.apply(Unknown Source)
    scala.collection.Iterator.foreach(Iterator.scala:929)
    scala.collection.Iterator.foreach$(Iterator.scala:929)
    internalUtils.stdUtils$IteratorProgressReporter$$anon$5.foreach(stdUtils.scala:487)
    qcUtils.runAllQC$.runOnSeqFile(runAllQC.scala:1285)
    qcUtils.runAllQC$.run(runAllQC.scala:960)
    qcUtils.runAllQC$allQC_runner.run(runAllQC.scala:672)
    runner.runner$.main(runner.scala:97)
    runner.runner.main(runner.scala)

<==========================>
Exception in thread "main" java.lang.Exception: FATAL ERROR: Cannot find chromosome "1" in genome FASTA file!
at internalUtils.Reporter$.error(Reporter.scala:299)
at internalUtils.genomicAnnoUtils$EfficientGenomeSeqContainer_MFA.switchToChrom(genomicAnnoUtils.scala:170)
at internalUtils.genomicAnnoUtils$EfficientGenomeSeqContainer.shiftBufferTo(genomicAnnoUtils.scala:111)
at qcUtils.qcOverlapMatch.runOnReadPair(qcOverlapMatch.scala:218)
at qcUtils.qcOverlapMatch.runOnReadPair(qcOverlapMatch.scala:83)
at qcUtils.runAllQC$.$anonfun$runOnSeqFile$7(runAllQC.scala:1312)
at qcUtils.runAllQC$.$anonfun$runOnSeqFile$7$adapted(runAllQC.scala:1285)
at qcUtils.runAllQC$$$Lambda$188/576936864.apply(Unknown Source)
at scala.collection.Iterator.foreach(Iterator.scala:929)
at scala.collection.Iterator.foreach$(Iterator.scala:929)
at internalUtils.stdUtils$IteratorProgressReporter$$anon$5.foreach(stdUtils.scala:487)
at qcUtils.runAllQC$.runOnSeqFile(runAllQC.scala:1285)
at qcUtils.runAllQC$.run(runAllQC.scala:960)
at qcUtils.runAllQC$allQC_runner.run(runAllQC.scala:672)
at runner.runner$.main(runner.scala:97)
at runner.runner.main(runner.scala)

Thanks for your help

@hartleys
Copy link
Owner

hartleys commented Feb 8, 2018 via email

@fp215
Copy link
Author

fp215 commented Feb 8, 2018

Thanks for the swift response!

The fasta file does actually start with ">", I must have accidentally dropped it when I pasted it in (sorry!)...

The chromosomes are named 1,2,3 without the "chr" prefix - but as I aligned the BAM using the same fasta file, this matches up with the BAM - or do you mean that QoRTs is specifically looking for the "chr" prefix?

If this is the case, is there any option to edit this in QoRTs, or would I need to return to my fasta & BAM files to edit them (which seems a bit of a faff)?

Thanks!

@hartleys
Copy link
Owner

hartleys commented Feb 8, 2018 via email

@fp215
Copy link
Author

fp215 commented Feb 8, 2018

Ah OK, thanks - will give that a go while I wait for the next release!

@fbearoff
Copy link

fbearoff commented Apr 4, 2018

Any update to this? I am experiencing the same issue with the Ensembl genome FASTA files. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants