This is one of the worst performance I have ever seen. I think even VBA can do better than that. This is what happens when you let amateurs lose on a linux environment. Goodness...well I am happy Python did the job for you, after all you can always brew some fresh coffee in between
The 1,000,000 Line Test
I created a simple Python script that would loop 20,000 times through the 59 line CSV file and output a 1,180,000 line file to test the efficiency of the script.
Python CSV to CSV Script
- INPUT: 1,180,000 lines of raw data from a 105mb CSV file.
- OUPUT: 1,000,000 lines of manipulated data to a 75mb CSV file.
- OUPUT: Print to linux terminal to show progress.
- TIME: 2 minutes 53 seconds.
Computer Specs
- Acer Aspire AM5641-E5651A desktop computer
- PCLinuxOS 2014.12 with Mate desktop.
- Intel(R) Core(TM)2 Duo CPU E7200 @ 2.53GHz
- 3 GB DDR2 Memory
- 640 GB SATA Hard disk
The output CSV file was then loaded into Gnumeric spreadsheet for some bold text and currency formatting. No math functions or entries were made in the spreadsheet.
![]()
Screenshot of the truncated 1,000,000 line CSV file in Gnumeric.
![]()