Let us consider this csv file:
We can read a csv file using pandas' read_csv function:
import pandas as pd
Let us firstly read and save our csv file and define index_col = 0, telling Python that 1st column is the row names.
hrdata = pd.read_csv("HR_data.csv",index_col = 0)
hrdata.head()
Let us firstly see what happens if we iterate over hrdata?
for i in hrdata:
print(i)
It returns the column names:
To iterate over various rows and column entries in a data frame we use iterrows function
Here we have two iterator variables: row and col
and our iterable object is hrdata.iterrows( )
for row,col in hrdata.iterrows():
print("For employee id:" + str(row))
print(col)
print('--')
Here row is taking the row name (our employee id)
while col is denoting the values for each row-col combination
Task: Fetch the row and column entries only for department column using iterrows
To achieve this we have filtered our iterator col by 'department'
for row,col in hrdata.iterrows():
print("For employee id:" + str(row))
print(col['department'])
print('--')
Task: Fetch the row and column entries only for department and age columns using iterrows
To achieve this we have filtered our iterator col by providing a list ['department','age']
for row,col in hrdata.iterrows():
print("For employee id:" + str(row))
print(col[['department','age']])
print('--')
Comments