I plan to write a series of articles to discuss some simple but not embarrassingly parallel algorithms. These will have practical usages and would most likely be on many-core CPUs or CUDA GPUs. Today’s is the first one to discuss a parallel algorithm implementation for CSV file parser. In the old days, when our spin disk speed maxed out at 100MiB/s, we only have two choices: either we don’t care a