Removal of adata preprocessing #3

YiFuiC · 2024-08-12T14:35:09Z

Hi, I'm currently working on improving GPcounts and found that removal of sc.pp.log1p(adata) would result in better model performance. This is especially true for lower sigma values and could be possibly due to the logarithm decreasing the dispersion of the counts that are sampled from poisson distribution.

Additionally, we noticed a bug when generating scales for the use of GPcounts. When setting the family parameter of smf.glm() it should be set to sm.families.NegativeBinomial(sm.families.links.identity())).fit() instead of sm.families.NegativeBinomial(sm.families.links.log())).fit(). This fixes the previous errors where most real data counts could not be fitted.

These changes have been edited in the fork accompanying this pull request. I eagerly await for your response.

Removal of adata preprocessing

723b625

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Removal of adata preprocessing #3

Removal of adata preprocessing #3

Uh oh!

YiFuiC commented Aug 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Removal of adata preprocessing #3

Are you sure you want to change the base?

Removal of adata preprocessing #3

Uh oh!

Conversation

YiFuiC commented Aug 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant