w3resource

Pandas Practice Set-1: Drop a row if any or all values in a row are missing of diamonds DataFrame on two specific columns


42. Drop Row if Any or All Values Missing in Two Specific Columns

Write a Pandas program to drop a row if any or all values in a row are missing of diamonds DataFrame on two specific columns..

Sample Solution:

Python Code:

import pandas as pd
diamonds = pd.read_csv('https://raw.githubusercontent.com/mwaskom/seaborn-data/master/diamonds.csv')
print("Original Dataframe:")
print(diamonds.head())
print("\nAfter droping those rows where any value in a row is missing in carat and cut columns:")
print(diamonds.dropna(subset=['carat', 'cut'], how='any').shape)
print("\nAfter droping those rows where all values in a row are missing in carat and cut columns:")
print(diamonds.dropna(subset=['carat', 'cut'], how='all').shape)

Sample Output:

Original Dataframe:
   carat      cut color clarity  depth  table  price     x     y     z
0   0.23    Ideal     E     SI2   61.5   55.0    326  3.95  3.98  2.43
1   0.21  Premium     E     SI1   59.8   61.0    326  3.89  3.84  2.31
2   0.23     Good     E     VS1   56.9   65.0    327  4.05  4.07  2.31
3   0.29  Premium     I     VS2   62.4   58.0    334  4.20  4.23  2.63
4   0.31     Good     J     SI2   63.3   58.0    335  4.34  4.35  2.75

After droping those rows where any value in a row is missing in carat and cut columns:
(53940, 10)

After droping those rows where all values in a row are missing in carat and cut columns:
(53940, 10)

For more Practice: Solve these Related Problems:

  • Write a Pandas program to drop rows from the diamonds DataFrame if any values are missing in two specified columns.
  • Write a Pandas program to remove rows where both specified columns have missing values and display the result.
  • Write a Pandas program to conditionally drop rows based on missing values in two columns using the how parameter in dropna().
  • Write a Pandas program to filter out rows with missing data in either of two specific columns and then check the updated DataFrame dimensions.

Go to:


Previous: Write a Pandas program to check the number of rows and columns and drop those row if 'any' values are missing in a row of diamonds DataFrame.
Next: Write a Pandas program to set an existing column as the index of diamonds DataFrame.

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

What is the difficulty level of this exercise?



Follow us on Facebook and Twitter for latest update.