-
Notifications
You must be signed in to change notification settings - Fork 18
/
Copy pathpackage-submission.Rmd
382 lines (305 loc) · 18.8 KB
/
package-submission.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
# (PART) Package Submissions {-}
# Overview {.unnumbered #submission-overview}
The following page gives an overview of the submission process along with key
principles to follow. See also [Package Guidelines][guidelines] for
package specific guidelines and requirement and the
[_Bioconductor_ new package submission tracker][tracker].
# Bioconductor Package Submissions
- [Introduction](#subintro)
- [Types of Packages](#type)
- [Package Naming Policy](#naming)
- [Author/Maintainer Expectations](#author)
- [Submission](#submission)
- [Experiment data package](#experPackage)
- [Annotation package](#annPackage)
- [Workflow package](#workPackage)
- [Review Process](#whattoexpect)
- [Following Acceptance](#afteraccept)
- [Additional Support](#support)
## Introduction {#subintro}
To submit a package to _Bioconductor_ the package should:
* Address areas of high-throughput genomic analysis, e.g., sequencing,
expression and other microarrays, flow cytometry, mass spectrometry,
image analysis; see [biocViews][].
* Interoperate with other _Bioconductor_ packages by _re-using common data
structures_ (see [Common Bioconductor Methods and Classes][bioc-common]) and
existing infrastructure (e.g., `rtracklayer::import()` for input of common
genomic files).
* Adopt software best practices that enable reproducible research and
use, such as full documentation and vignettes (including fully
evaluated code) as well as commitment to long-term user support
through the _Bioconductor_ [support site][].
* Not exist on CRAN. A package can only be submitted to one or the other.
* Comply with [Package Guidelines][guidelines].
* Your package cannot depend on any package (or version of a package)
that is not (yet) available on CRAN or _Bioconductor_. The package should work
with whatever current version of the package is publically available.
## Types of Packages {#type}
_Bioconductor_ packages are broadly defined by four main package types:
[**Software**][software-pkgs], [**Experiment Data**][exptdata-pkgs],
[**Annotation**][annotation-pkgs] and [**Workflow**][workflow-pkgs].
* Software Packages. Most packages contributed by users are software
packages. Software packages provide implementation of algorithms
(e.g. statistical analysis), access to resources (e.g. biomart, or NCBI) or
visualizations (e.g. volcano plots, pathways plots). Instructions for creating
Software packages can be found here: [Package guidelines][guidelines].
* [Annotation packages](#annPackage) are database-like packages that provide
information linking identifiers (e.g., Entrez gene names or Affymetrix
probe ids) to other information (e.g., chromosomal location, Gene
Ontology category). It is also encouraged to utilize AnnotationHub for
storage and access to large raw data files and their conversion to
standard R formats. Instructions for adding data to AnnotationHub and
designing a annotation package to use AnnotationHub can be found here:
[Create A Hub Package][].
* [Experiment data packages](#experPackage) provide data sets that are used,
often by software packages, to illustrate particular analyses. These packages
contain curated data from an experiment, teaching course or publication and
in most cases contain a single data set. It is also encouraged to utilize
ExperimentHub for storage and access to larger data files. ExperimentHub is
also particularly useful for hosting collections of related data sets.
Instructions for adding data to ExperimentHub and designing an experiment data
package to use ExperimentHub can be found here:
[Create A Hub Package][].
* [Workflow packages](#workPackage) contain vignettes that describe a
bioinformatics workflow that involves multiple Bioconductor packages. These
vignettes are usually more extensive than the vignettes that accompany
software packages. These packages do not need man/ or R/ directories nor a
data/ directory as ideally workflows make use of existing data in a
Bioconductor package. See development section on [Workflows][] for more
details.
See [Package Guidelines][guidelines] for details on package format and syntax.
## Package Naming Policy {#naming}
Package naming:
i) Ownership of package name. Bioconductor follows [CRAN's
policy][] in requiring that contributors give the right to use the package name
to Bioconductor at time of submission, so that the Bioconductor team can orphan
the package and allow another maintainer to take it over in the event that the
package contributor discontinues package maintenance. See Bioconductor's package
[end-of-life policy][] for more details.
ii) Uniqueness of package name. Packages should be named in a way that does not
conflict (irrespective of case) with any current or past BIOCONDUCTOR package,
nor any current or past CRAN package.
See [Package naming guidelines](#package-name) for more guidelines.
## Author / Maintainer Expectations {#author}
Acceptance of packages into _Bioconductor_ brings with it ongoing
package maintenance responsibilities. Package authors are expected to:
* Follow Bioconductor guidelines
These include standard [guidelines][guidelines], [version numbering][versioning],
[coding style](#r-code), [code performance
requirements](#correctness-space-and-time), [memory usage](#memory), [using
existing data classes][bioc-common], and the other requirements described
below.
* Follow the release cycle of Bioconductor
There are two releases each year, around April and October. The
[release schedule][release-schedule] will indicate the timetables and
deadlines for each release. A release cycle typically produces two
versions of packages, ‘devel’ and ‘release’. It is important to be familiar
with these branch concepts. Once your package has been accepted, it will
initially be in the ‘devel’ branch. The current devel branch becomes the
next release. Most users are expected to use the release branch, so they will
not immediately have access to your package until the next release.
Bug fixes can be fixed in both branches, while new features should only
be added to the ‘devel’ branch.
* Maintain the package using version control
Realize that _Bioconductor_, unlike CRAN, maintains all package source code
under git version control. This means that you make changes to your
package using [git][gitver]. If your package is accepted, you will receive
instructions with typical git operations (see the after
[acceptance section](#afteraccept)). Package maintenance through software
release cycles, including prompt updates to software and documentation,
is needed due to possible underlying changes in R and/or other package
dependencies.
* Check Build Reports and Fix Issues Promptly
_Bioconductor_ has weekly to daily [build reports][] for all package types. It is
the maintainers responsibility to check the build reports and respond to any
ERROR/Warning/Notes produced by the install, build, or check process of a
package. The maintainer will receive automatic build notifications when the
package starts to fail on linux from `[email protected]`;
maintainers should check that emails from this email address can be delivered.
* Subscribe to the [bioc-devel][bioc-devel-mail] mailing list
The Bioconductor team communicates with developers through this list.
It is also a good channel to communicate changes to other developers.
Addressing Bioconductor team requests in a timely manner guarantees that
your package remains available through Bioconductor.
* [Register][support-register] on and use the [support site][]
The support site is the official support channel for users. Users and even
developers may ask questions regarding your package on this platform.
Be sure to include all the packages that you maintain in the "Watched Tags"
section of your support site profile. This will notify you of any questions
posted regarding your package(s).
It is important to promptly respond to bug reports and questions either on
the [_Bioconductor_ support site][support site] post or directly to developers.
Some maintainers prefer to indicate a `BugReports:` field in their package's
DESCRIPTION file. This field indicates a particular web page for submitting
bug reports and questions.
* Ensure the maintainer email in the DESCRIPTIONS stays accurate and
reachable. The maintainer email in the DESCRIPTION is the definitive contact
_Bioconductor_ will use. It is important to keep this email up-to-date to
ensure we can contact you if your package is failing or to notify you of any
important upcoming impactful events. If the email is not reachable your
package would be in jeopardy of removal.
## Submission
* Read and follow the full [Contributor Guidelines][guidelines] section.
* Submit by opening a new issue in the _Bioconductor_
[Contributions][] repository, following the [guidelines][tracker] of
the `README.md` file.
Assuming that your package is in a [GitHub Repository][git-repo-create] and
under the default branch, add the link to your repository to
the issue you are opening. You cannot specify any alternative branches; the
default branch is utilized. The default branch must contain only package
code. Any files or directories for other applications (Github Actions,
devtool, etc) should be in a different branch. If you are submitting two
highly related packages or circular dependent packages please see
[here][cirdep]. The lighter dependent or package that can be installed without
a dependency should be submitted first; this is generally the associated data
package to a software package.
## Experiment Data Packages {#experPackage}
Experimental data packages contain data specific to a particular
analysis or experiment. They often accompany a software package for
use in the examples and vignettes and in general are not updated
regularly. If you need a general subset of data for workflows or
examples first check the AnnotationHub resource for available files
(e.g., BAM, FASTA, BigWig, etc.) or ExperimentHub for available processed
example data set already included in _Bioconductor_. If no current files or data
sets are appropriate consider an associated Experiment Data Package that
utilizes `r BiocStyle::Biocpkg("ExperimentHub")`.
If you have an associated data package for your software package,
please do *NOT* create a separate issue in the our tracker repository
for that. Instead, please add the data package repository to the same
issue as the software package. The process for doing this is
documented [here][cirdep]. Generally the data package should be submitted first.
The `r BiocStyle::Biocpkg("HubPub")` package is helpful for creating a template
for a hub package. The vignette [Create A Hub Package][] provides full details.
## Annotation Packages {#annPackage}
Annotation packages contain lightly or non-curated data from a public
source and are updated with each _Bioconductor_ release (every 6
months). They are a source of general annotation for one or many
organisms and are not specific to a particular experiment. When
possible, they should support the `select()` interface from
AnnotationDbi.
Annotation packages should *NOT* be posted to the tracker repository.
Instead send an email to <[email protected]> with a
description of the proposed annotation package and further instructions
of where to send the package will be provided. Whenever possible Annotation
Packages should use the `r BiocStyle::Biocpkg("AnnotationHub")` for managing
files.
The `r BiocStyle::Biocpkg("HubPub")` package is helpful for creating a template
for a hub package. The vignette [Create A Hub Package][] provides full details.
<a name="workPackage"></a>
## Workflow Packages {#workPackage}
Workflow packages contain vignettes that describe a bioinformatics workflow that
involves multiple Bioconductor packages. These vignettes are usually more
extensive than the vignettes that accompany software packages. These packages do
not need man/ or R/ directories nor a data/ directory as ideally workflows make
use of existing data in a Bioconductor package. See development section on
[Workflows][] for more details
<a name="whattoexpect"></a>
## Review Process {#whattoexpect}
* A new package is initially labeled as `1. awaiting moderation`.
A _Bioconductor_ team member will take a very brief look at your
package, to ensure that it is not doing anything malicious or
inappropriate. Packages that pass this stage will be labelled
`pre-check passed`.
* The moderator will add your package as a repository to the
git.bioconductor.org git server, copy the SSH keys from your github
account to the [BiocCredentials application][] application and the issue labelled
`pre-review`.
ALL CHANGES TO YOUR PACKAGE must be pushed to the
git.bioconductor.org repository created in this step. See the [New package
workflow][] for instructions on pushing changes to your git.bioconductor.org
repository.
* The package will be submitted to the _Bioconductor_ build
system (BBS). The system will check out your package from GitHub and move it
to our git.bioconductor.org git server. Please familiarize yourself with
[git][gitver] as the git.bioconductor.org is the versions the (BBS) will
always use. It will then run `R CMD build` to create a 'tarball' of your source code,
vignettes, and man pages. It will run `R CMD check` on the tarball,
to ensure that the package conforms to standard _R_ programming best
practices. _Bioconductor_ has chosen to utilize a custom `R CMD check`
environment; See [R CMD check environment][] for more details. Finally, the
build system will run `BiocCheckGitClone()` and `BiocCheck('new-package'=TRUE)` to ensure that
the package conforms to _Bioconductor_ `r BiocStyle::Biocpkg("BiocCheck")`
standards. The system will perform these steps using the ['devel'
version][] of _Bioconductor_, on three platforms
(Linux, Mac OS X, and Windows). After these steps are complete, a link to a
build report will be appended to the new package issue. Avoid surprises by
running these checks on your own computer, under the 'devel' version
of _Bioconductor_, before submitting your package.
* If the build report indicates problems, modify your package and
commit changes to the git.bioconductor.org version of your package as
described in the [new package git workflow](#new-package-workflow). If
there are problems that you do not understand, seek help on the
[bioc-devel][bioc-devel-mail] mailing list.
* To trigger a new build, include a version bump in your commit, e.g.,
from `Version: 0.99.0` to `Version: 0.99.1`. Pre-release versions utilize the
`0.99.z` format. When accepted and released, your package's version number
will be automatically incremented to 1.0.0.
* If in the pre-review process there are identified larger issues with the
package, a label `3e. pending pre-review changes` or a more specific flag of
package issue will be assigned. Please address any pre-review identified
issues and comment back for the package administrators to re-evaluate.
* Once your package builds and checks without errors or (avoidable)
warnings, the package is assigned a reviewer. The package will be
labelled `2. Review in progress`.
* The reviewer will provide a technical review of your package. Other
_Bioconductor_ developers and users with domain expertise are encouraged to
provide additional community commentary. Reviewers will add comments to the
issue you created.
* Please be courteous with your package reviewers and always follow the
_Bioconductor_ [Code Of Conduct][] with correspondence. Please allow 2-3 weeks
for reviewers to assess your package.
* Respond to the issues raised by the reviewers. You _must_ respond to
the primary reviewer, and are strongly encouraged to consider
community commentary. Typically your response will involve code
modifications; commit these to the default branch of git.bioconductor.org. When
you have addressed all concerns, add a comment to the issue created in step 2
to explain your response.
* The reviewer will assess your responses, perhaps suggesting further
modifications or clarification. The reviewer will then accept your
package for inclusion in _Bioconductor_, or decline it. The label
`2. review in progress` will be replaced by `3a. accepted` or
`3b. declined`.
* If your package is accepted, it will be added to _Bioconductor_'s
nightly 'devel' builds. All packages in the 'devel' branch of the repository
are 'released' to the user community once every six months, in
approximately April and October.
* Once the review process is complete, the issue you created will be
closed. All updates to your package will be through the
[_Bioconductor_ Git Server][gitver].
Please be mindful that reviewers are volunteers and package reviews are not
the only responsibility of _Bioconductor_ team members. We like to see the
review process progress by updates from the submitter or by comments from the
reviewer within 2-3 weeks. The entire review process typically takes between 2
and 6 weeks. If there is no response after 3 to 4 weeks, package reviewers may
close the issue until further updates, changes, and/or commentary are received.
## Following Acceptance {#afteraccept}
Following acceptance of a package:
* Packages accepted on the tracker repository are added to the 'devel'
branch of the _Bioconductor_ GIT repository, with the current version
number of the accepted package.
* Packages are then built by the _Bioconductor_ nightly build
process. On-demand builds of accepted packages do not occur. Please see the
[build reports][] for how often the different package types are built. The
changes pushed to a 'devel' version of a package can take up to 24-28 hours to
appear. See [build timings][]. If the build is successful, the package has its own
'landing page' created, and the package is made available to users
of the 'devel' branch of _Bioconductor_ via `BiocManager::install()`.
* Changes to their package (if any), should be done to version in the
[_Bioconductor_ git server][gitver].
* Developers should bump the `z` portion of their version number every
time they commit changes to their package, following the
[Version numbering][versioning] guidelines. If developers don't
bump the version, the changes made to their package *do not propagate*
to the _Bioconductor_ web site and package repository.
* Devlopers should make sure the maintainer email in the DESCRIPTION stays
accurate and reachable.
## Additional Support {#support}
We are eager to enhance the quality and interoperability of
_Bioconductor_ software and will provide additional support when
requested by package developers. Example areas of assistance include
use of appropriate S4 structures, specific guidance on efficient
implementation, guidance on code structure, and critical assessment of
package documentation and structure. Use the
[bioc-devel][bioc-devel-mail] mailing list or email
<[email protected]> to obtain additional support.
* Support Email: <[email protected]>