Checksums: use blocks to read the files by wpoely86 · Pull Request #836 · easybuilders/easybuild-framework

wpoely86 · 2014-02-05T10:59:40Z

We need to read a file in blocks to calculate the checksum because else, the memory blows up if you try to install icc.

This PR fixes it for sha1 and md5 but not yet for addler32 or crc32. I will try to fix them when I find the time.

When calculating the checksums of a large file, we need to read it into blocks (currently 16MB) to keep memory usage acceptable.

JensTimmerman · 2014-02-05T11:15:34Z

this would be nicer if it was just on the next line...(starting at the (")

itkovian · 2014-02-05T11:21:43Z

"parents", while you are cleaning up stuff.

itkovian · 2014-02-05T12:30:45Z

Looks OK to me.

stdweird · 2014-02-06T10:41:09Z

split off the hash functions from the dicts/lambda's. too much duplicated code

try: import hashlib md5_func=hashlib.md5 sha1_func=hashlib.sha1 except ImportError: import md5, sha md5_func=md5.md5 sha1_func=sha.sha CHECKSUM_FUNCTIONS['md5'] = lambda p: calc_block_checksum(p, md5_func()) CHECKSUM_FUNCTIONS['sha1'] = lambda p: calc_block_checksum(p, sha1_func())

stdweird · 2014-02-06T12:15:27Z

class names are camelcase

stdweird · 2014-02-06T12:44:33Z

do the try/except a bit higher, and all this in the dict above. this looks odd now.

stdweird · 2014-02-06T13:05:30Z

@wpoely86 nice!
@boegel ready

JensTimmerman · 2014-02-06T16:49:53Z

are you sure you need to call md5_func and sha1_func here? It seems like you just want to pass them along.
(did this pass the untit tests?)

We call them to have a md5 or sha1 object to work with. It has passed the unit tests (the last one failed because the disk was full, @boegel was going to fix it).

so better to call the md5_class and sha1_class then? (and i wasn't going to make any more remarks...)

Fair point, the would be more correct.

Represents more correct what the variable is/does.

boegel · 2014-02-08T20:29:42Z

Thanks @wpoely86, thanks for the reviewing @itkovian, @stdweird, @JensTimmerman!

Checksums: use blocks to read the files

wpoely86 added 2 commits February 5, 2014 10:55

filetools.py: pep8 clean up

805ae58

checksums: read sha1 and md5 into blocks

d046f6c

When calculating the checksums of a large file, we need to read it into blocks (currently 16MB) to keep memory usage acceptable.

JensTimmerman reviewed Feb 5, 2014
View reviewed changes

filetools.py: fix indenting

ec5e106

itkovian reviewed Feb 5, 2014
View reviewed changes

Comment thread easybuild/tools/filetools.py Outdated

Copy link
Copy Markdown

Contributor

itkovian Feb 5, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"parents", while you are cleaning up stuff.

wpoely86 added 3 commits February 5, 2014 13:31

filetools.py: fix typo

5e5870c

filetools.py: use python 2.x stuff

9bf488f

filetools.py: python 2.4 comp. => don't use blocksize

ef2949b

stdweird reviewed Feb 6, 2014
View reviewed changes

wpoely86 added 2 commits February 6, 2014 12:12

filetools.py: nicier imports of checksums and explain magic number

8cebe18

filetools.py: use hashlib like wrapper for addler32 and crc32

e43c0ab

stdweird reviewed Feb 6, 2014
View reviewed changes

Comment thread easybuild/tools/filetools.py Outdated

Copy link
Copy Markdown

Contributor

stdweird Feb 6, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

class names are camelcase

filetools.py: small changes after remarks

429b3cf

stdweird reviewed Feb 6, 2014
View reviewed changes

filetools.py: reshuffled a bit

7c1d372

JensTimmerman reviewed Feb 6, 2014
View reviewed changes

wpoely86 added 2 commits February 6, 2014 18:01

ZlibChecksum: use new style class

550033d

filetools.py: change some variable names

9822a97

Represents more correct what the variable is/does.

boegel added a commit that referenced this pull request Feb 8, 2014

Merge pull request #836 from wpoely86/checksums

c76599c

Checksums: use blocks to read the files

boegel merged commit c76599c into easybuilders:develop Feb 8, 2014

wpoely86 deleted the checksums branch February 9, 2014 19:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checksums: use blocks to read the files#836

Checksums: use blocks to read the files#836
boegel merged 12 commits intoeasybuilders:developfrom
wpoely86:checksums

wpoely86 commented Feb 5, 2014

Uh oh!

JensTimmerman Feb 5, 2014

Uh oh!

itkovian Feb 5, 2014

Uh oh!

itkovian commented Feb 5, 2014

Uh oh!

stdweird Feb 6, 2014

Uh oh!

stdweird Feb 6, 2014

Uh oh!

stdweird Feb 6, 2014

Uh oh!

stdweird commented Feb 6, 2014

Uh oh!

JensTimmerman Feb 6, 2014

Uh oh!

wpoely86 Feb 6, 2014

Uh oh!

stdweird Feb 6, 2014

Uh oh!

wpoely86 Feb 6, 2014

Uh oh!

boegel commented Feb 8, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

wpoely86 commented Feb 5, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

itkovian commented Feb 5, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stdweird commented Feb 6, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

boegel commented Feb 8, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants