w3resource

Pandas: Extract hash attached word from twitter text from the specified column of a given DataFrame


25. Extract Hashtags from Twitter Text

Write a Pandas program to extract hash attached word from twitter text from the specified column of a given DataFrame.

Sample Solution:

Python Code :

import pandas as pd
import re as re
pd.set_option('display.max_columns', 10)
df = pd.DataFrame({
    'tweets': ['#Obama says goodbye','Retweets for #cash','A political endorsement in #Indonesia', '1 dog = many #retweets', 'Just a simple #egg']
    })
print("Original DataFrame:")
print(df)
def find_hash(text):
    hword=re.findall(r'(?<=#)\w+',text)
    return " ".join(hword)
df['hash_word']=df['tweets'].apply(lambda x: find_hash(x))
print("\Extracting#@word from dataframe columns:")
print(df)

Sample Output:

Original DataFrame:
                                  tweets
0                    #Obama says goodbye
1                     Retweets for #cash
2  A political endorsement in #Indonesia
3                 1 dog = many #retweets
4                     Just a simple #egg
\Extracting#@word from dataframe columns:
                                  tweets  hash_word
0                    #Obama says goodbye      Obama
1                     Retweets for #cash       cash
2  A political endorsement in #Indonesia  Indonesia
3                 1 dog = many #retweets   retweets
4                     Just a simple #egg        egg

For more Practice: Solve these Related Problems:

  • Write a Pandas program to extract hashtag words from a tweet column using regex and then output a list of hashtags.
  • Write a Pandas program to capture all words starting with '#' in a DataFrame column and then count the frequency of each hashtag.
  • Write a Pandas program to extract hashtags from a text column and then create a new column listing the hashtags as a comma-separated string.
  • Write a Pandas program to filter a DataFrame column to retrieve only hashtag words and then remove any duplicates.

Go to:


Previous:Write a Pandas program to extract email from a specified column of string type of a given DataFrame.
Next: Write a Pandas program to extract word mention someone in tweets using @ from the specified column of a given DataFrame.

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

What is the difficulty level of this exercise?

Test your Programming skills with w3resource's quiz.



Follow us on Facebook and Twitter for latest update.