8000 GitHub - ChrisNiekler/NLP: NLP
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

ChrisNiekler/NLP

Repository files navigation

Emotion Data For Whatsapp Status

Text Data For Whatsapp Status Emotion Prediction

This is a labeled dataset for a sentiment analysis. The original dataset is from kaggle.

The original data is somewhat poor, meaning that the given label might not actually fit the true emotion of the text. We are slowly going through the data to clean it. While doing that we created a fourth sentiment neutral.

Cleaning process

  • removing Mojibake and features that are clearly no WhatsApp Status (mostly residue from scrapping)
  • relabeling
  • removing racist, homophobic and abusive comments

We will give a little definition of each sentiment below.

happy

We label a text as happy if it sounds clearly happy and has a positive connotation and expresses light hearted and whimsical thoughts.

angry

We label a text as angry if the text sounds angry, sarcastic or sassy. And it is usually accompanied by strong language such as curse words and aggresive wording.

sad

To be labeled as sad the text expresses fear or insecurity and usually has a dark or gloomy tone.

neutral

Status that are neither happy or sad nor express any particular sentiment are labeled as neutral. It can be advice or facts.


We also try to add more features by scrapping them from different websites that offer inspiration for WhatsApp Status.

About

NLP

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0