Handling Missing data before Resampling
Pandas Resampling and Frequency Conversion: Exercise-13 with Solution
Write a Pandas program to handle missing data before Resampling.
Sample Solution:
Python Code :
# Import necessary libraries
import pandas as pd
import numpy as np
# Create a time series data with hourly frequency
date_rng = pd.date_range(start='2023-01-01', end='2023-01-05', freq='H')
ts = pd.Series(np.random.randn(len(date_rng)), index=date_rng)
# Introduce some missing values
ts.iloc[10:20] = np.nan
# Fill missing values using forward fill method
ts_filled = ts.ffill()
# Resample the time series to daily frequency
ts_daily = ts_filled.resample('D').mean()
# Display the resampled time series
print(ts_daily)
Output:
2023-01-01 0.672487 2023-01-02 -0.014764 2023-01-03 0.043870 2023-01-04 -0.190643 2023-01-05 -0.639927 Freq: D, dtype: float64
Explanation:
- Import Pandas and NumPy libraries.
- Create a date range with hourly frequency.
- Generate a random time series data with the created date range.
- Introduce missing values in the time series.
- Fill missing values using the forward fill method.
- Resample the time series data to daily frequency by calculating the mean.
- Print the resampled time series data.
Python Code Editor:
Have another way to solve this solution? Contribute your code (and comments) through Disqus.
Previous: Calculating Rolling Mean of Resampled data.
Next: Creating Custom Resampling periods.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.
It will be nice if you may share this link in any developer community or anywhere else, from where other developers may find this content. Thanks.
https://198.211.115.131/python-exercises/pandas/resampling-frequency-conversion/handling-missing-data-before-resampling-time-series-data.php
- Weekly Trends and Language Statistics
- Weekly Trends and Language Statistics