w3resource

Merging DataFrames with duplicate Keys in Pandas


Pandas: Custom Function Exercise-9 with Solution


Write a Pandas program to merge DataFrames with duplicate Keys.

Following program shows how to merge DataFrames that contain duplicate keys, resulting in a Cartesian product of matching rows.

Sample Solution :

Code :

import pandas as pd

# Create two sample DataFrames with duplicate keys
df1 = pd.DataFrame({
    'ID': [1, 1, 2],
    'Name': ['Annabel', 'Annabel', 'Selena']
})

df2 = pd.DataFrame({
    'ID': [1, 2],
    'Age': [25, 30]
})

# Merge the DataFrames with duplicate keys
merged_df = pd.merge(df1, df2, on='ID')

# Output the result
print(merged_df)

Output:

   ID     Name  Age
0   1  Annabel   25
1   1  Annabel   25
2   2   Selena   30        

Explanation:

  • Created two DataFrames df1 and df2 with duplicate keys in df1.
  • Used pd.merge() to merge on the 'ID' column.
  • The result is a Cartesian product where matching keys from both DataFrames are combined.

Python-Pandas Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

What is the difficulty level of this exercise?

Test your Programming skills with w3resource's quiz.



Follow us on Facebook and Twitter for latest update.