Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding the Use of Incomplete Plant Mitochondrial Genome Assembly Outputs for Subsequent Analysis #317

Open
twdawd opened this issue Mar 14, 2024 · 2 comments

Comments

@twdawd
Copy link

twdawd commented Mar 14, 2024

Thank you very much for making this software. I have a question: When I assembled the plant mitochondrial genome, I got a file named embplant_mt.K105.scaffolds.graph1.1.path_sequence.fasta. Can I use the generated file(fasfa file) in the next step of analysis such as gene annotation and phylogeny?
Looking forward to your reply

@JianjunJin
Copy link
Collaborator

The assembly quality of incomplete organelle genomes (scaffolds) cannot be guaranteed in the current version of GetOrganelle (1.7.7.0 or earlier). However, enhancements were made in version 1.8.0 (although not officially released yet, see https://github.com/Kinggerm/GetOrganelle/tree/intermediate_graph) for chloroplast and animal mitochondrial genomes.

The primary concern of directly using the plant mitochondrial assembly lies in the potential inclusion of chloroplast loci, due to the similarity between pt and mt and the false hits of blast. This becomes severe when the assembly is incomplete because depth and graph information can barely be utilized to help identify contigs. Therefore, it is advisable to run get_organelle_from_assembly.py using the current output, implementing stricter contig filtering measures, which can be quick and easy. The following command is an example proposed for this purpose. Still, you are encouraged to carefully assess the possibility of chloroplast inclusion in mitochondrial assemblies even after this.

 get_organelle_from_assembly.py -F embplant_mt --slim-options "--perc-identity 99.0 -e 1e-250" -g your_old_output/filtered_K105.assembly_graph.fastg.extend-embplant_mt-embplant_pt.fastg -o further_purified_result

@JianjunJin JianjunJin changed the title About embplant_mt genome Regarding the Use of Incomplete Plant Mitochondrial Genome Assembly Outputs for Subsequent Analysis Mar 14, 2024
@twdawd
Copy link
Author

twdawd commented Mar 15, 2024

I understand !!! Thank you !!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants