Improved method for reading the first/last/specific line from csv into a DataFrame
Project description
Improved method for reading the first/last/specific line from csv into a DataFrame
Ever deal with multiple huge csv files and and the panads read_csv/skiprows method is slowing you down? You are not alone.
read-csv-turbo is an improved method of reading the first and last lines using unix head and tail commands to get the data you want in a dataframe as fast as possible. I may include Windows support in the future if requested.
Reading a large csv once is “fine” but often I find myself looping through many files and this process is painfully slow which is why StackOverflow suggestions didn’t cut it. There may be a newer/smarter way of approaching this but this method should be as fast as you could get.
At the moment the use case of this is quite limited as it just provides a fast way to read the first, last or n row of a csv into a dataframe
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for readcsvturbo-0.0.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 40470ed64cfff1781001b7b6f999fb1774d84ff89bfcaa867bb370b925883f5b |
|
MD5 | 9246530c0b60c66b7d25428a0582718f |
|
BLAKE2b-256 | c48831de83d028372866fbffecc0653120ccda32912f93f207ab7997669483cb |