Ask HN: Processing multiple GBs of data on local machine
2 by EdwardDiego | 2 comments on Hacker News.
I recall a few blog posts on this, but I'm struggling to find them now that I need them. I have about 100GB of Log4j formatted logs to process to find a particular needle in the haystack, and am looking for a decent way to process those files locally without breaking out Spark in EMR etc. I recall a few blog posts on this subject, but my search fu is letting me down. Is this ringing bells for anyone? Thanks in advance :)

Post a Comment

Previous Post Next Post