bedbuncher

Archived. This functionality was moved to bedboss in 2023. Please see https://github.com/databio/bedbase for more information

bedbuncher

Pipeline designed to create bedsets (sets of BED files) that will be retrieved from bedbase.

Example bedsets:

Bed files from the AML database.
Bed files from the EWS database.
Bed files from ChiP-seq experiments.

Before running the pipeline

Required: A PEP will with an attribute specifying a path to a JSON file that contains the query to create the bedset (see the tests directory for reference).

To run the pipeline

Clone the repository
Install required python packages via

pip install -r requirements/requirements.txt --user

Submit the pipeline with looper

looper run project/cfg.yaml

Pipeline outputs

bedbuncher generates the following files as well as links within the Elastic Search bedset index from which they can be retrieved:

TAR ball containing the BED files that match the query criteria.
Dataframe where rows represent individual BED files and columns show statistics generated by GenomicDistributions (for ease of user-needed calculations).
iGD database created from the bedset.
Bedset statistics (currenty means and standard deviations).
PEP for a specific Bedset created using the pipeline (currently under development).

Additional CML dependencies

iGD builds a database that integrates genomic sets from one or more data sources and minimizes the search space for a specific query. Visit the iGD repository for more information and installation details.

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
pipelines		pipelines
project		project
requirements		requirements
tests		tests
tools		tools
.gitignore		.gitignore
README.md		README.md
bbconfig.yaml		bbconfig.yaml
pipeline_interface.yaml		pipeline_interface.yaml
resources.tsv		resources.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bedbuncher

Before running the pipeline

To run the pipeline

Pipeline outputs

Additional CML dependencies

About

Releases 1

Packages

Contributors 4

Languages

databio/bedbuncher

Folders and files

Latest commit

History

Repository files navigation

bedbuncher

Before running the pipeline

To run the pipeline

Pipeline outputs

Additional CML dependencies

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 4

Languages

Packages