Pandas: Get the numeric representation of an array by identifying distinct values of a given column of a DataFrame
Write a Pandas program to get the numeric representation of an array by identifying distinct values of a given column of a DataFrame.
Sample Solution :
Python Code :
import pandas as pd
df = pd.DataFrame({
'Name': ['Alberto Franco','Gino Mcneill','Ryan Parkes', 'Eesha Hinton', 'Gino Mcneill'],
'Date_Of_Birth ': ['17/05/2002','16/02/1999','25/09/1998','11/05/2002','15/09/1997'],
'Age': [18.5, 21.2, 22.5, 22, 23]
})
print("Original DataFrame:")
print(df)
label1, unique1 = pd.factorize(df['Name'])
print("\nNumeric representation of an array by identifying distinct values:")
print(label1)
print(unique1)
Sample Output:
Original DataFrame: Name Date_Of_Birth Age 0 Alberto Franco 17/05/2002 18.5 1 Gino Mcneill 16/02/1999 21.2 2 Ryan Parkes 25/09/1998 22.5 3 Eesha Hinton 11/05/2002 22.0 4 Gino Mcneill 15/09/1997 23.0 Numeric representation of an array by identifying distinct values: [0 1 2 3 1] Index(['Alberto Franco', 'Gino Mcneill', 'Ryan Parkes', 'Eesha Hinton'], dtype='object')
Python-Pandas Code Editor:
Have another way to solve this solution? Contribute your code (and comments) through Disqus.
Previous: Write a Pandas program to clean object column with mixed data of a given DataFrame using regular expression.
Next: Write a Pandas program to replace the current value in a dataframe column based on last largest value. If the current value is less than last largest value replaces the value with 0.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.
- Weekly Trends and Language Statistics
- Weekly Trends and Language Statistics