Add option to disable checksums for external files and blobs#1025
Add option to disable checksums for external files and blobs#1025guzman-raphael merged 32 commits intodatajoint:masterfrom
Conversation
|
Perhaps a better way would be to switch checksums globally rather than in each fetch call. |
|
@dimitri-yatsenko What if we added a The main reason we had included this at the scope of each |
|
I agree that a global |
|
How about to summarize:
@dimitri-yatsenko Thumbs up if you agree and let me know if you want to catch up on this. |
|
|
|
@dimitri-yatsenko and I met yesterday and discussed this. We converged on a simpler approach that could address the issue more directly. To summarize:
|
guzman-raphael
left a comment
There was a problem hiding this comment.
@jverswijver ✨ Great work! This is coming together nicely. ✨
Suggesting minor points to simplify some sections.
Additionally, could you:
- Add changelog/releaselog
- Add a test
|
Now relies on #1031 to be merged first. This is due to needing to use the new logger to test if a warning was thrown when disabling checksums. |
guzman-raphael
left a comment
There was a problem hiding this comment.
@jverswijver 👏 Nicely done! 👏
Adds an option to disable checksums when calling fetch or fetch1.This has big implications when fetching large(100gb+) files as then can take a long time to verify the checksum. I also added a warning/confirm message that indicates to the user that they are losing the guarantee that their data is unmodified. You can disable the warning by setting
dj.config['safemode']=Falsehere is an example of the warning prompt and safemode usage:I have implemented a solution to disable checksums when the filesize goes over a certain threshold.
The way to use this is to set
filepath_checksum_size_limitin you DJ config to the size threshold that you want in bytes.It defaults to None and will always use checksums when none.