Comments on: About https://rforwork.wordpress.com Data for Life :) Thu, 28 May 2015 22:22:46 +0000 hourly 1 http://wordpress.com/ By: prradep https://rforwork.wordpress.com/about/#comment-10108 Thu, 28 May 2015 22:22:46 +0000 http://rforwork.wordpress.com/?page_id=2#comment-10108 Nice explanation.

]]>
By: Craig Offman https://rforwork.wordpress.com/about/#comment-3999 Tue, 22 Jul 2014 16:44:41 +0000 http://rforwork.wordpress.com/?page_id=2#comment-3999 Hi. It’s Craig Offman, the Globe reporter you spoke to last fall about ebikes. I’d like to follow up with you. Are you free in the next day or two for a chat? Much appreciated.

]]>
By: Alec https://rforwork.wordpress.com/about/#comment-641 Wed, 23 Oct 2013 21:19:19 +0000 http://rforwork.wordpress.com/?page_id=2#comment-641 Hi, I’m a community blog curator for DZone. I wanted to talk with you about potentially featuring your blog on DZone’s content portals. If you’re interested, please send me an email at [email protected] and I’ll explain the details.

Thank you!

]]>
By: inkhorn82 https://rforwork.wordpress.com/about/#comment-367 Sun, 14 Apr 2013 01:10:09 +0000 http://rforwork.wordpress.com/?page_id=2#comment-367 In reply to zelazny7.

Hi there,

I can’t say I’ve had to work with data sets containing millions of records and thousands of variables, but I can say that ff will definitely be able to handle it. Like SAS, ff allows you to analyze data without loading all of it into memory at once. I would for sure recommend using ff to solve your work needs. Just remember that when working with ff, everything needs to be treated accordingly. When making new vectors, they always seem to need to be declared (e.g. “as.ff(ifelse(x[,”something”] == “blah”,1,0)) ) and not every R function works the same when your data is in ff form.

Good luck!

]]>
By: zelazny7 https://rforwork.wordpress.com/about/#comment-364 Sat, 06 Apr 2013 22:26:01 +0000 http://rforwork.wordpress.com/?page_id=2#comment-364 Hi, I’m just curious about the size of the datasets you typically work with? I found this site when searching for help on using the ff package. I regularly use SAS analyze credit risk data on datasets consisting of millions of records and thousands of variables. The data I analyze are also of mixed modes with a roughly 50/50 split between character and numeric.

Is this kind of data similar to what you analyze? I have been struggling to find a consistent workflow for analyzing large datasets (not “big” data) using open-source tools like R. I need to read in csv files too big to fit in memory, explore the data, create new column vectors, and store it all on disk somehow. Would you recommend the ff package as a solution that can handle this?

Cheers, and thanks for the interesting blog posts!

]]>
By: inkhorn82 https://rforwork.wordpress.com/about/#comment-171 Thu, 03 Jan 2013 16:47:22 +0000 http://rforwork.wordpress.com/?page_id=2#comment-171 In reply to PT.

Thanks for your comment. I sent an email to the address associated with the account you’ve posted from. Is that the right email to reach you at?

]]>
By: inkhorn82 https://rforwork.wordpress.com/about/#comment-169 Thu, 03 Jan 2013 14:30:26 +0000 http://rforwork.wordpress.com/?page_id=2#comment-169 In reply to ramsi haddad.

Hi there,

I wish it were so easy that I could just analyze the English translations, but if I were to do that then the results would probably say a lot more about the person doing the translating than the original text.

]]>
By: PT https://rforwork.wordpress.com/about/#comment-167 Thu, 03 Jan 2013 02:21:41 +0000 http://rforwork.wordpress.com/?page_id=2#comment-167 Your analysis on the authorship of the Hebrew Bible is interesting. I can help you with your questions, but it will be more suitable to an email exchange rather than posting a brief comment.

]]>
By: ramsi haddad https://rforwork.wordpress.com/about/#comment-164 Wed, 02 Jan 2013 23:31:22 +0000 http://rforwork.wordpress.com/?page_id=2#comment-164 dear sir,
i wonder if you repeated the analysis on an english translation of the first 5 books of the christian bible (the Torah isn’t translated, is it?). the nice thing there is that there are cifferent translations. king james, new standard american, whatever… presumably, the translations should give the same classifications. i wonder if that would give your analysis more legitamcy. i guess the translations are like adding noise to the undelying text. if tne results are the same with the addition of noise, then i would think that provides some measure of confidence?
best regards.
ramsi
[email protected]

]]>