To read only specific columns from CSV file using Pandas read_csv method we need to use parameter
Steps to read specific columns from CSV file
- Import pandas
- Define columns to be read
usecols=fields- to list columns to be read
- Subset of columns to select, denoted either by column labels or column indices.
low_memory = True- reading in chunks
- Internally process the file in chunks, resulting in lower memory use while parsing, but possibly mixed type inference.
index_col- to identify column X as index
- Column(s) to use as row label(s), denoted either by column labels or column indices.
headerto override the column names.
More information can be found: DataFrame.plot - secondary_y
Suppose we have the following CSV file which we like to read with Pandas. We want to read only single column from this CSV file into DataFrame:
,x,y,z 0,a,e,1 1,b,f,2 2,c,g,3 3,d,i,4
import pandas as pd cols = ['x', 'y'] df = pd.read_csv('data/data_0.csv', usecols = cols, low_memory = True)