Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

# No output in filter_P_prediction file #11

Open
baljinder-y opened this issue Sep 12, 2024 · 3 comments
Open

# No output in filter_P_prediction file #11

baljinder-y opened this issue Sep 12, 2024 · 3 comments

Comments

@baljinder-y
Copy link

baljinder-y commented Sep 12, 2024

I am trying to run bash script on test dataset using default command provided in the manual. I am getting no output in filter_P_prediction file. File is blank.
Moreover, the script_err file displays very low mapping percentage (0.05%) of sequencing data against mature miRNA sequences.
Kindly provide some solution

@alanlamsiu
Copy link
Collaborator

Hi @baljinder-y. Could you provide more details on which dataset and what commands did you use for testing?

@baljinder-y
Copy link
Author

baljinder-y commented Sep 18, 2024

Hi @alanlamsiu,
Thank you for your time. I was trying to work with the test dataset i.e., GSM2094927.fa and TAIR10_genome.fa to test the pipeline.
First, I tried to run simple bowtie command to check alignment of reads against index of mature miRNAs provided with the tool.

This is the command that i used which is available in the bash script:
"bowtie -v 2 mature_index -f GSM2094927.fa > mir.aln"

This is the output:
reads processed: 968888
reads with at least one reported alignment: 1343 (0.14%)
reads that failed to align: 967545 (99.86%)
Reported 1343 alignments to 1 output stream(s)

Also i tried running bash script directly. This is the command that i was running.
./miRDP2-v1.1.4_pipeline.bash -g TestData/TAIR10_genome.fa -x TestData/TAIR10.genome -f -i TestData/GSM2094927.fa -o output/ -p 6

The command generates all the necessary files in the output folder. However, "GSM2094927_filter_P_prediction file" is blank.
I think issue is related to the low alignment percentage of reads with miRNA sequences.

@alanlamsiu
Copy link
Collaborator

Hi @baljinder-y,

The alignment rate of your read data to the reference genome is very low, which suggests either the reference genome does not match the read data, or there are some issues in the read data. It is necessary to inspect whether the read data is clean, e.g. free of adapter sequences. Or try to blast the reads against the reference genome and check whether they are the same species. I also notice that you use mature_index in your bowtie command. miRDP2 does require to align the read data to the genome sequence instead of mature miRNA sequence. You shouldn't proceed with miRDP2 before improving the alignment rate.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants