forked from stefpeschel/NetCoMi
-
Notifications
You must be signed in to change notification settings - Fork 0
/
NEWS.Rmd
503 lines (384 loc) · 20.5 KB
/
NEWS.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
---
output: github_document
---
## NetCoMi 1.1.0 <img src="man/figures/NetCoMi_logo_800x400_300dpi.png" align="right" width="200" />
### New features
* **renameTaxa()**: New function for renaming taxa in a taxonomic table.
It comes with functionality for
making unknown and unclassified taxa unique and substituting them by the next
higher known taxonomic level. E.g., an unknown genus "g__", where family is the
next higher known level, can automatically be renamed to
"1_Streptococcaceae(F)". User-defined patterns determine the format
of known and substituted names. Unknown names (e.g., NAs) and unclassified taxa
can be handled separately. Duplicated names within one or more chosen ranks can
also be made unique by numbering them consecutively.
* **editLabels()**: New function for editing node labels, i.e., shortening to a
certain length and removing unwanted characters. It is used by NetCoMi's plot
functions plot.microNetProps() and plot.diffnet().
* In `netCompare()`: The **adjusted Rand index** is also computed for the
largest connected component (LCC). The summary method has been adapted.
* Argument **"testRand"** added to `netCompare()`. Performing a permutation
test for the adjusted Rand index can now be disabled to save run time.
* **Graphlet-based network measures** implemented. NetCoMi contains two new
exported functions **`calcGCM()`** and **`calcGCD()`** to compute the Graphlet
Correlation Matrix (GCM) of a network and the Graphlet Correlation Distance
(GCD) between two networks.
**Orbits** for graphlets with up to four nodes are considered.
Furthermore, the GCM is computed with `netAnalyze()` and the
GCD with `netCompare()` (for the whole network and the largest connected
component, respectively). Also the orbit counts are returned. The GCD is added
to the summary for class `microNetComp` objects returned by `netCompare()`.
* **Significance test for the GCD**: If permutation tests are conducted with
`netCompare()`, the GCD is tested for being significantly different from zero.
* New function **`testGCM()`** to **test graphlet-based measures** for
significance. For a single GCM, the correlations are tested for being
significantly different from zero.
If two GCMs are given, it is tested if the correlations are
significantly different between the two groups, that is, the absolute
differences between correlations ( $|gc1_{ij}-gc2_{ij}|$ ) are tested
for being different from zero.
* New function **`plotHeat()`** for plotting a mixed heatmap where, for
instance, values are shown in the upper triangle and corresponding p-values or
significance codes in the lower triangle. The function is used for plotting
heatmaps of the GCMs, but could also be used for association matrices.
* `netAnalyze()` now by default returns a **heatmap of the GCM(s)** with
graphlet correlations in the upper triangle and significance codes in the lower
triangle.
* Argument **"doPlot"** added to `plot.microNetProps()` to suppress the plot if
only the return value is of interest.
* New **"show"** arguments are added to the summary methods for class
`microNetProps` and `microNetComp` objects. They specify which network
properties should be printed in the summary. See the help pages of
`summary.microNetProps` and `summary.microNetComp()` for details.
* New **zero replacement** method **"pseudoZO"** available in `netConstruct()`.
Instead of adding the desired pseudo count to the whole count matrix, it is
added to zero counts only if `pseudoZO` is chosen. The behavior of "pseudo"
(a further available method where a pseudo count is added to all counts) has not
changed. Adding a pseudo count only to zeros preserves the ratios between
non-zero counts, which is desirable.
* `createAssoPerm()` now accepts objects of class `microNet` as input (in
addition to objects of class `microNetProps`).
* **`SPRING's`** fast version of latent correlation computation (implemented in
[mixedCCA](https://github.com/irinagain/mixedCCA)) is available again.
It can be used by setting the `netConstruct()` parameter `measurePar$Rmethod`
to "approx", which is now the default again.
* The function **`multAdjust()`** now has an argument `pTrueNull` to pre-define
the proportion of true null hypotheses for the adaptive BH method.
* `netConstruct()` has a new argument **`assoBoot`**, which enables the
computation of bootstrap association matrices outside netConstruct() if
**bootstrapping** is used for sparsification. An example has been added to the
help page `?netConstruct`. This feature might be useful for very large
association matrices (for which the working memory might reach its limit).
### Bug fixes
* In `netConstruct()`:
+ Using **"bootstrap"** as sparsification method in
combination with one of the association methods "bicor", "cclasso", "ccrepe", or
"gcoda" led to the error: `argument "verbose" is missing, with no default`,
which has been fixed.
+ The **"signedPos"** transformation did not work properly.
Dissimilarities corresponding to negative correlations were set to zero instead
of infinity.
* In `editLabels()`: The function (and thus also `plot.microNetProps`)
threw an error if taxa have been renamed with
`renameTaxa` and the data contain more than 9 taxa with equal names, so that
double-digit numbers were added to avoid duplicates.
* Issues in network analysis and plotting if association matrices are used
for network construction, but **row and/or column names are missing**.
(issue [#65](https://github.com/stefpeschel/NetCoMi/issues/65))
* `diffnet()` threw an error if association matrices are used for network
construction instead of count matrices.
(issue [#66](https://github.com/stefpeschel/NetCoMi/issues/66))
* In `plot.microNetProps()`:
+ The function now directly returns an error if `x` has not the
expected class.
+ The `cut` parameter could not be changed.
* In **`cclasso()`**: In rare cases, the function produced complex numbers,
which led to an error.
### Further changes
* In **permutation tests**: The permuted group labels must now be different from
the original group vector. In other words, the original group vector is strictly
avoided in the matrix with permuted group labels. So far, only duplicates were
avoided. Only in exact permutation tests (if `nPerm` equals the possible number
of permutations), the original group vector is still included in the permutation
matrix. The calculation of p-values has been adapted to the new behavior:
*p=B/N* for exact p-values and *p=(B+1)/(N+1)* for approximated p-values, where
*B* is the number of permutation test statistics being larger than or equal to
the observed one, and *N* is the number of permutations. So far, *p=(B+1)/(N+1)*
has been used in all cases.
* In `plot.microNetProps()`:
+ The default of `shortenLabels` is now "none", i.e. the **labels are not
shortened by default**, to avoid confusion about the node labels.
+ The **edge filter** (specified via `edgeFilter` and `edgeInvisFilter`) now
refers to the estimated association/dissimilarities instead of edge weights.
E.g., setting the threshold to 0.3 for an association network hides edges
with a corresponding absolute association below 0.3 even though the edge
weight might be different (depending on the transformation used for network
construction). (issue [#26](https://github.com/stefpeschel/NetCoMi/issues/26))
+ If two networks are constructed and the **`cut`** parameter is not
user-defined, the mean of the two determined cut parameters is now used for
both networks so that edge thicknesses are comparable.
* More expressive messages and errors in `diffnet` and `plot.diffnet` if no
**differential associations** are detected.
* New function **`.suppress_warnings()`** to suppress certain warnings returned
by external functions.
* In `netConstruct` if **"multRepl"** is used for zero handling:
The warning about the proportion of zeros is suppressed by setting the
`multRepl()` parameter "z.warning" to 1.
* The functions **`makeCluster`** and **`stopCluster`** from `parallel` package
are now used for parallel computation because those from `snow` package
sometimes led to problems on Unix machines.
### Style
* The whole R code has been reformatted to follow general conventions.
* The element `"clustering_lcc"` as part of the `netAnalyze` output has changed
to `"clusteringLCC"` to be in line with the remaining output.
* Input argument checking of exported function has been revised. New functions
`.checkArgsXxx()` are added to perform argument checking outside the main
functions.
* Non-exported functions have been renamed to follow general naming conventions,
i.e. that of [Bioconductor](https://contributions.bioconductor.org/r-code.html):
+ Use camelCase for all functions.
+ Non-exported functions have prefix "."
+ The following functions have been renamed:
```{r, echo=FALSE, results='asis'}
library(knitr)
oldnames <- c(
"boottest",
"calc_association",
"calc_diff_props",
"calc_jaccard",
"calc_props",
"diff_connect_pairs",
"diff_connect_variables",
"diff_connect_network",
"filter_edges",
"filter_nodes",
"filter_samples",
"filter_taxa",
"first_unequal_element",
"get_clust_cols",
"get_node_size",
"get_perm_group_mat",
"get_vec_names",
"norm_counts",
"permtest_diff_asso",
"scale_diss",
"sparsify",
"trans_to_diss",
"trans_to_sim",
"trans_to_adja",
"zero_treat"
)
newnames <- c(
".boottest",
".calcAssociation",
".calcDiffProps",
".calcJaccard",
".calcProps",
".diffConnectPairs",
".diffConnectVariables",
".diffConnectNetwork",
".filterEdges",
".filterNodes",
".filterSamples",
".filterTaxa",
".firstUnequalElement",
".getClustCols",
".getNodeSize",
".getPermGroupMat",
".getVecNames",
".normCounts",
".permTestDiffAsso",
".scaleDiss",
".sparsify",
".transToDiss",
".transToSim",
".transToAdja",
".zeroTreat"
)
namemat <- cbind(oldnames, newnames)
colnames(namemat) <- c("Old names", "New names")
kable(namemat)
```
## NetCoMi 1.0.3
This is a minor release with some bug fixes and changes in the documentation.
### Bug fixes
* `netConstruct()` threw an error if the data had no row and/or column names,
which is fixed.
* An edge list is added to the output of `netConstruct()` (issue [#41](https://github.com/stefpeschel/NetCoMi/issues/41)). See the
help page for details.
* `SPRING`'s fast version of latent correlation computation (implemented in
[mixedCCA](https://github.com/irinagain/mixedCCA)) is currently not available
due to deprecation of the R package `chebpol`. The issue is fixed by setting
the `netConstruct()` parameter `measurePar$Rmethod` internally to "original" if
SPRING is used for association estimation.
* In `plot.microNetProps()`: The `xpd` parameter is changed to `NA` so that
plotting outside the plot region is possible (useful for legends or additional
text).
* Labels in the network plot can now be suppressed by setting `labels = FALSE`
(issue [#43](https://github.com/stefpeschel/NetCoMi/issues/43))
* The `netCompare()` function threw an error if one of the permutation networks
was empty, i.e. had no edges with weight different from zero (issue [#38](https://github.com/stefpeschel/NetCoMi/issues/38)),
which is now fixed.
* Fix issues [#29](https://github.com/stefpeschel/NetCoMi/issues/29) and [#40](https://github.com/stefpeschel/NetCoMi/issues/40), where permutation tests
did not terminate for small sample sizes.
Now, if the possible number of permutations (resulting from the sample
size) is smaller than that defined by the user, the function stops and returns
an error.
* Fix a bug in `diffnet()` (issue
[#51](https://github.com/stefpeschel/NetCoMi/issues/51)), where colors in
differential networks could not be changed.
* `diffnet()` threw an error if the `netConstruct()` argument `jointPrepro` was
set to `TRUE`.
## NetCoMi 1.0.2
This release includes a range of new features and fixes known bugs and issues.
### New features
#### Improved installation process
Packages that are optionally required in certain settings are not installed
together with `NetCoMi` anymore.
Instead, there is a new function `installNetCoMiPacks()` for installing the
remaining packages.
If not installed via `installNetCoMiPacks()`, the required package is installed
by the respective NetCoMi function when needed.
#### installNetCoMiPacks()
New function for installing the R packages used in NetCoMi not listed as
`dependencies` or `imports` in NetCoMi's description file.
#### netConstruct()
* New argument `matchDesign`: Implements matched-group (i.e. matched-pair) designs,
which are used for permutation tests in `netCompare()` and `diffnet()`. `c(1,2)`,
for instance, means that one sample in the first group is matched to two samples
in the second group. If the argument is not `NULL`, the matched-group design is
kept when generating permuted data.
* New argument `jointPrepro`: Specifies whether two data sets (of group one and
two) should be preprocessed together. Preprocessing includes sample and taxa
filtering, zero treatment, and normalization. Defaults to `TRUE` if `data` and
`group` are given, and to `FALSE` if `data` and `data2` are given, which is
similar to the behavior of `NetCoMi 1.0.1`. For dissimilarity networks, no joint
preprocessing is possible.
* `mclr(){SPRING}` is now available as normalization method.
* `clr{SpiecEasi}` is used for centered log-ratio transformation
instead of `cenLR(){robCompositions}`.
* `"symBetaMode"` is accepted as list element of `measurePar`, which is passed to
`symBeta(){SpiecEasi}`. Only needed for SpiecEasi or SPRING associations.
* The pseudocount (if `zeroMethod = "pseudo"`) may be freely specified. In
v1.0.1, only unit pseudocounts were possible.
#### netAnalyze()
* Global network properties are now computed for the whole network as well as
for the largest connected component (LCC).
The summary of network properties now contains for the whole network only
statistics that are not based on shortest paths (or, more generally, also
meaningful for disconnected networks).
For the LCC, all global properties available in NetCoMi are shown.
* New global network properties (see the docu of `netAnalyze()` for definitions):
+ Number of components (only whole network)
+ Relative LCC size (only LCC)
+ Positive edge percentage
+ Natural connectivity
+ Average dissimilarity (only meaningful for the LCC)
+ Average path length (only meaningful for the LCC)
* New argument `centrLCC`: Specifies whether to compute centralities only for
the LCC. If `TRUE`, centrality values of disconnected components are zero.
* New argument `avDissIgnoreInf`: Indicates whether infinite values should be
ignored in the average dissimilarity. If `FALSE`, infinities are set to 1.
* New argument `sPathAlgo`: Algorithm used for computing shortest paths
* New argument `sPathNorm`: Indicates whether shortest paths should be normalized
by average dissimilarity to improve interpretability.
* New argument `normNatConnect`: Indicates whether to normalize natural
connectivity values.
* New argument `weightClustCoef`: Specifies the algorithm used for computing the
global clustering coefficient. If `FALSE`, `transitivity(){igraph}` with
`type = "global"` is used (similar to `NetCoMi 1.0.1`). If `TRUE`, the local
clustering coefficient is computed using `transitivity(){igraph}` with
`type = "barrat"`. The global clustering coefficient is then the arithmetic
mean of local values.
* Argument `connect` has been changed to `connectivity`.
* Documentation extended by definitions of network properties.
#### summary.microNetProps()
* New argument `clusterLCC`: Indicates whether clusters should be shown for the
whole network or only for the LCC.
* The `print` method for `summary.microNetProps` was completely revised.
#### plot.microNetProps()
* All normalization methods available for network construction can now be used
for scaling node sizes (argument `nodeSize`).
* New argument `normPar`: Optional parameters used for normalization.
* Usage of `colorVec` changed: Node colors can now be set separately in both
groups (`colorVec` can be a single vector or a list with two vectors). Usage
depends on `nodeColor` (see docu of `colorVec`).
* New argument `sameFeatCol`: If `nodeColor = "feature"` and `colorVec` is not
given, `sameFeatCol` indicates whether same features should have same colors in
both groups.
* Argument `colorNegAsso` has been renamed to `negDiffCol`. Using the old name
leads to a warning.
* New functionality for using the same layout in both groups (if two networks
are plotted). In addition to computing the layout for one group and adopting it
for the other group, a union of both layout can be computed and used in both
groups so that nodes are placed as optimal as possible equally for both networks.
This option is applied via `sameLayout = TRUE` and `layoutGroup = "union"`.
Many thanks to [Christian L. Müller](https://github.com/muellsen?tab=followers)
and [Alice Sommer](https://www.iq.harvard.edu/people/alice-sommer) for providing
the idea and R code for this new feature!
#### netCompare()
* New arguments for storing association and count matrices of the permuted data
into an external file:
+ `fileLoadAssoPerm`
+ `fileLoadCountsPerm`
+ `storeAssoPerm`
+ `fileStoreAssoPerm`
+ `storeCountsPerm`
+ `fileStoreCountsPerm`
* New argument `returnPermProps`: If `TRUE`, global network properties and the
respective absolute group differences of the permuted data are returned.
* New argument `returnPermCentr`: If `TRUE`, the computed centrality values
and the respective absolute group differences of the permuted data are returned
as list with a matrix for each centrality measure.
* The arguments `assoPerm` and `dissPerm` are still existent for compatibility
with `NetCoMi 1.0.1` but the former elements `assoPerm` and `dissPerm` are not
returned anymore (matrices are stored in an external file instead).
#### createAssoPerm()
New function for creating association/dissimilarity matrices for permuted count
data. The stored count or association/dissimilarity matrices can then be passed
to `netCompare()` or `diffnet()` to decrease runtime. The function also
allows to generate a matrix permuted group labels without computing associations.
Using this matrix, `createAssoPerm()` furthermore allows to estimate the
permutation associations/dissimilarities in blocks
(by passing only a subset of the permuted group matrix to `createAssoPerm()`).
#### summary.microNetComp()
Summary method has been adapted to the new network properties (analogous to the
summary of `microNetProps` objects, which are returned from `netAnalyze()`)
#### diffnet()
* New arguments for storing association and count matrices of the permuted data
into an external file:
+ `fileLoadAssoPerm`
+ `fileLoadCountsPerm`
+ `storeAssoPerm`
+ `fileStoreAssoPerm`
+ `storeCountsPerm`
+ `fileStoreCountsPerm`
* The argument `assoPerm` is still existent for compatibility with `NetCoMi 1.0.1`
but the former element `assoPerm` is not returned anymore (matrices are
stored in an external file instead).
* Changed output: For permutation tests and Fisher's z-test, a vector and matrix
with p-values and the corresponding matrix with group differences are returned
for both with and without multiple testing adjustment.
* Documentation has been revised.
#### plot.diffnet()
* New argument `adjusted`: Indicates whether the adjacency matrix (matrix with
group differences) based on adjusted or unadjusted p-values should be plotted.
* New argument `legendPos` for positioning the legend.
* New argument `legendArgs` for specifying further arguments passed to `legend`.
#### colToTransp()
* The function is now exported and its name has changed from `col_to_transp()`
to `colToTransp()`. The function expects a color vector as input and adds
transparency to each color.
### Bug fixes
The major issues fixed in this release are:
* The following error is solved: `Error in update.list(...): argument "new" is
missing`. The error was caused by a conflict between `SpiecEasi` and
`metagenomeSeq`, in particular by `gplot` as a dependency of `metagenomeSeq`.
A former version of `gplot` was dependend on `gdata`, which caused the conflict.
So, please update `gplot` and remove the package `gdata` to fix the error.
* `sparcc()` from SpiecEasi package is now used for estimating SparCC associations.
For some users, NetCoMi's `Rccp` implementation of SparCC caused errors when
installing NetCoMi. If these are fixed, the Rcpp implementation will
be included again, so that users can decide between the two SparCC versions.
* VST transformations are now computed correctly.
* Error when plotting two networks, where one network is empty, has been fixed.