Conversation
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #1010 +/- ##
=======================================
Coverage 92.20% 92.20%
=======================================
Files 49 49
Lines 7560 7560
=======================================
Hits 6971 6971
Misses 589 589
🚀 New features to boost your workflow:
|
|
Hi @jonas2612, Thanks for the PR. However, before I look further into this, could you please check whether the problem persists when you use the PR that unpins dask, #1006? I had a different issue with partitioning, but it could be connected so want to see if it would be fixed. Also, in general, |
|
Hi @melonora. Yes, of course. I'll check it today and get back to you |
|
Cheers, please also check this https://github.com/melonora/spatialdata/blob/e017ca7d6107623750196606a07fe8e4407c242f/src/spatialdata/_core/operations/rasterize.py#L674-L677. This might provide some context for what I mentioned in my message above. |
|
@jonas2612, the reason why it is important is because |
|
Thanks @jonas2612 for the PR, I will merge it now. It introduces a performance regression but we can address it later. I did some quick tests and it seems that data[c] = data[c].cat.as_known()takes the same time as data[c] = data[c].cat.set_categories(data[c].compute().cat.categories)Since there may be some concern around the order of categories with |
|
I added a test, which fails before the new change in the pair. As I explain in the test, |
|
I'll skip the CI from the latest push since I just removed a comment. |
Possible solution to Issue #1009