Sys Admin/DevOps and Junior developer at Whisk.
This blog is mainly to display my different projects, as well as a place for me to share and discuss technology news and interests.
Here’s a common problem: You ever want to add up a very large list (hundreds of megabytes) or grep through it, or other kind of operation that is embarrassingly parallel? Data scientists, I am talking to you. You probably have about four cores or more, but our tried and true tools like grep, bzip2, wc, awk, sed and so forth.