Skip to content

RepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences.

License

Notifications You must be signed in to change notification settings

sapojnik/RepeatMasker

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

89 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RepeatMasker
Developed by Arian Smit and Robert Hubley
Please refer to: Smit, AFA, Hubley, R. & Green, P "RepeatMasker" at
http://www.repeatmasker.org

IMPORTANT:
The github 'master' branch only contains the source code files for
the latest release of RepeatMasker.  A complete distribution including
the source files, a copy of the Dfam database, and the required taxonomy
data file may be found at the RepeatMasker website:
http://www.repeatmasker.org/RMDownload.html

RepeatMasker

RepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Sequence comparisons in RepeatMasker are performed by the program cross_match, an efficient implementation of the Smith-Waterman-Gotoh algorithm developed by Phil Green, or by WU-Blast developed by Warren Gish.

See "INSTALL" for instructions on how to install RepeatMasker. See "repeatmaker.help" for a detailed program manual.

Libraries Overview

Updates of the RepeatMasker program are distributed with a copy of the Dfam database ( www.dfam.org ). Dfam is a small but growing "open" databases of Transposable Element seed alignments, profile Hidden Markov Models and consensus sequences.

RepeatMasker is also compatible with the RepBase database managed by the Genetic Information Research Institute and requires a license to use. Up until 2019 we maintained the "Repbase RepeatMasker Edition" libraries as co-editor of RepBase Update. For newer versions of RepBase users will need to use the sequences in FASTA format with RepeatMasker's "-lib" option.

RepeatMasker "open-4.0" and later versions are distributed under the Open Source License. Please read LICENSE file for more information.

About

RepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Perl 94.4%
  • Python 5.1%
  • Other 0.5%