How to Read CSV Only Specific Columns Into Pandas DataFrame
-
Use the
read_csv()
Function and Pass a List of Column Names to Be Read -
Use the
read_csv()
Function and Pass List of Column Numbers to Be Read
This tutorial will discuss different methods to read specific columns from a csv file using Pandas. We’ll use the following CSV file for the demo.
Course | Mentor | price $ |
|
---|---|---|---|
0 | Python | Robert | 199 |
1 | Spark | Elizibeth | 299 |
2 | Java | Nolan | 99 |
3 | JavaScript | Chris | 250 |
4 | C# | johnson | 399 |
Use the read_csv()
Function and Pass a List of Column Names to Be Read
The Pandas library provides read_csv()
function. The read_csv()
is used to read a comma-separated values (csv) file into DataFrame
.
The read_csv()
takes the file path of the csv file, and it has more than 40 optional parameters. For this tutorial, let’s concentrate usecols
parameter.
Syntax - pd.read_csv()
:
pd.read_csv(filepath, usecols)
Parameters
filepath
- The path or location of the csv file to be loaded.usecols
- List of specific columns to be read.
Returns
DataFrame
Follow the steps below to use the read_csv()
function and pass the list of column names to be read.
-
Import pandas library.
-
Pass the path of csv file and list of columns to be read to the
read_csv()
method. -
The
read_csv()
method will return a data frame with that specific columns. -
Print the
DataFrame
.
The following code is the implementation of the above approach.
# import the pandas
import pandas as pd
# Reading the specific columns from csv
df = pd.read_csv("data.csv", usecols=["Course", "price$"])
# printing the dataframe
print(df)
Output:
Course price$
0 Python 199
1 Spark 299
2 Java 99
3 JavaScript 250
4 C# 399
Use the read_csv()
Function and Pass List of Column Numbers to Be Read
We can also use the column numbers to read_csv()
to read-only specific columns.
-
Import pandas library.
-
Pass the path of csv file and list of columns numbers to be read to the
read_csv()
method. -
The
read_csv()
method will return a data frame with that specific columns. -
Print the
DataFrame
.
The following code is the implementation of the above approach.
# import the pandas
import pandas as pd
# Reading the specific columns from csv
df = pd.read_csv("data.csv", usecols=[2, 3])
# printing the dataframe
print(df)
Output:
Mentor price$
0 Robert 199
1 Elizibeth 299
2 Nolan 99
3 Chris 250
4 johnson 399