Calculation of CLR transformation #2

Vlasovets · 2022-02-02T20:33:06Z

the currently used function looks as follows:

def transform_features(
    features: pd.DataFrame, transformation: Str = "clr", coef: float = 0.5
) -> pd.DataFrame:
    if transformation == "clr":
        X = features.values
        #null_set = X <= 0.0 # just ignore zero replacement for the sake of experiment
        #X[null_set] = coef
        X = np.log(X)
		____________
		#bug is here
        X = (X.T - np.mean(X, axis=1)).T
		# one could change to 
		X = (X - np.mean(X, axis=0))
		____________

        return pd.DataFrame(
            data=X, index=list(features.index), columns=list(features.columns)
        )

    else:
        raise ValueError(
            "Unknown transformation name, use clr and not %r" % transformation
        )

a small test example, if check the first two values by hand, you will see the mistake

df = pd.DataFrame(np.random.randint(0,10,size=(3, 3)), columns=list('ABC'))
transform_features(df)

Cheers,
Oleg

# Conflicts: # .idea/workspace.xml

Leo-Simpson · 2022-02-02T23:44:26Z

Hello Oleg,
Thank you for the pull request, and for "taking the lead" on the project somehow.

So it might be yes, then there might be something I do not understand. Shouldn't the mean be performed on each row of matrix X (if row i contains the features of sample i ) ?

Vlasovets · 2022-02-03T13:14:31Z

oh, if row i contains the features of sample i", so your X is an (nxp) matrix where n - samples, p - features then axis=1 is correct since we need a geometric mean per sample
thanks for the clarification and sorry if I confused you.

Vlasovets added 9 commits September 23, 2021 17:04

cloned the version edbf9d2 of q2-classo

95c17f9

cloned the version edbf9d2 of q2-classo

03c1236

cloned the version edbf9d2 of q2-classo

686954f

cloned the version edbf9d2 of q2-classo

c063580

ran the tutorial

e72f7ed

test all

be25d05

cloned the version edbf9d2 of q2-classo

949a50b

Merge remote-tracking branch 'o-q2-classo/o-q2-classo' into o-q2-classo

5792449

# Conflicts: # .idea/workspace.xml

fixed the error in CLR transformation

1fe52c0

Vlasovets changed the title ~~Mistake in calculation CLR transform~~ Mistake in calculation of CLR transformation Feb 2, 2022

Vlasovets changed the title ~~Mistake in calculation of CLR transformation~~ Calculation of CLR transformation Feb 2, 2022

Vlasovets added 2 commits February 2, 2022 22:23

fixed the naming bug

5c401a4

add consistency with qiime2 naming

21eadd9

Vlasovets added 15 commits February 3, 2022 14:17

clarified the ordination of X and respawned original function from Leo

4e5d611

fixed the type of parameters

51a536d

clean up repository

ab02426

update .gitignore

2a60e9a

update .gitignore

a253022

update .gitignore

41f1e30

update .gitignore

eb4c6b0

add acm count table

29b89fa

set q2 STR type for parameters

98c0955

allow Composition table as output

52c8934

change color pallete

c9c386e

revert changes in output format

abb14ab

fix typo in type assignment

0cc3af7

fix a bug with plotting CV

36b2970

use python native str type instead of q2 Str

57d5b28

Vlasovets added 6 commits August 18, 2023 17:52

make compatible with q2-gglasso

637fdc8

use Design type instead of Composition

7936964

use Design type instead of Composition

f494f89

save svg images

15c0fe1

save svg images

5b3e0bc

change Table type to frequency

44f429f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculation of CLR transformation #2

Calculation of CLR transformation #2

Vlasovets commented Feb 2, 2022 •

edited

Loading

Leo-Simpson commented Feb 2, 2022

Vlasovets commented Feb 3, 2022

Calculation of CLR transformation #2

Are you sure you want to change the base?

Calculation of CLR transformation #2

Conversation

Vlasovets commented Feb 2, 2022 • edited Loading

Leo-Simpson commented Feb 2, 2022

Vlasovets commented Feb 3, 2022

Vlasovets commented Feb 2, 2022 •

edited

Loading