>>df[[‘empid’,’ename’]]
empid ename
0 101 Sachin
1 102 Vinod
2 103 Lakhbir
Anil
4 105 Devinder
5 106 UmaSelvi
To Add & Rename a column in data frame
import pandas as pd
s = pd.Series([10,15,18,22])
df=pd.DataFrame(s)
List1 List2 List3
df.columns=[‘List1’]
#To Rename the default column of Data frame as List1 0 10 20 30
df[‘List2’]=20 1 15 20 35
# To create a new column List2 with all values as 20
df[‘List3’]=df[‘List1’]+df[‘List2’] 2 18 20 38
#Add Column1 and Column2 and store in New column List3 3 22 20 42
print(df)
To Delete a Column in data frame
1. del
2. pop()
3. drop()
We can delete the column from a data frame by using any of the the following –
Output-
>>del df[‘List3’]
# We can simply delete a column by passing column name in subscript with df
>>df
List1 List2
0 10 20
1 15 20
2 18 20
3 22 20
>>df.pop(‘List2’) we can simply delete a column by passing column name in pop method.
>>df
List1
0 10
1 15
2 18
3 22
To Delete a Column Using drop()
import pandas as pd
s= pd.Series([10,20,30,40])
df=pd.DataFrame(s)
df.columns=[‘List1’]
df[‘List2’]=40
df1=df.drop(‘List2’,axis=1) #(axis=1) means to delete Data column wise
df2=df.drop(index=[2,3],axis=0) #(axis=0) means to delete data row wise with given index
print(df)
print(“ After deletion::”)
print(df1)
print (“ After row deletion::”)
print(df2)
Output-
List1 List2
0 10 40
1 20 40
2 30 40
3 40 40
After deletion::
List1
0 10
1 20
2 30
3 40
After row deletion::
List1
0 10
1 20
Accessing the data frame through loc() and iloc() method or indexing using Labels
Pandas provide loc() and iloc() methods to access the subset from a data frame using
row/column.
Accessing the data frame through loc()
It is used to access a group of rows and columns.
Syntax- Df.loc[StartRow : EndRow, StartColumn : EndColumn]
Note -If we pass : in row or column part then pandas provide the entire rows or columns
respectively.
Example 2:
Example 3:
Accessing the data frame through iloc()
It is used to access a group of rows and columns based on numeric index value.
Syntax-
Df.loc[StartRowindexs : EndRowindex, StartColumnindex : EndColumnindex]
Note -If we pass : in row or column part then pandas provide the entire rows or columns
respectively.
head() and tail() Method
The method head() gives the first 5 rows and the method tail() returns the last 5 rows.
import pandas as pdempdata={ 'Doj':['12-01-2012','15-01-2012','05-09-2007','17-01-
2012','05-09-2007','16-01-2012'], 'empid':[101,102,103,104,105,106],'ename':['Sachin',
'Vinod','Lakhbir','Anil','Devinder','UmaSelvi'] }
Output-
df=pd.DataFrame(empdata)
print(df)
print(df.head())
print(df.tail())
Doj empid ename
0 12-01-2012 101 Sachin
1 15-01-2012 102 Vinod
2 05-09-2007 103 Lakhbir Data Frame
3 17-01-2012 104 Anil
4 05-09-2007 105 Devinder
5 16-01-2012 106 UmaSelvi
Doj empid ename
0 12-01-2012 101 Sachin
1 15-01-2012 102 Vinod
2 05-09-2007 103 Lakhbir head() displays first 5 rows
3 17-01-2012 104 Anil
4 05-09-2007 105 Devinder
Doj empid ename
1 15-01-2012 102 Vinod
2 05-09-2007 103 Lakhbir tail() display last 5 rows
3 17-01-2012 104 Anil
4 05-09-2007 105 Devinder
5 16-01-2012 106 UmaSelvi
To display first 2 rows we can use head(2) and to returns last2 rows we can use tail(2) and to
return 3rd to 4th row we can write df[2:5].
import pandas as pd
empdata={ 'Doj':['12-01-2012','15-01-2012','05-09-2007',
'17-01-2012','05-09-2007','16-01-2012'],
'empid':[101,102,103,104,105,106],
'ename':['Sachin','Vinod','Lakhbir','Anil','Devinder','UmaSelvi'] }
df=pd.DataFrame(empdata)
print(df)
print(df.head(2))
print(df.tail(2))
print(df[2:5])