

Sentiment Analysis of Code Mixed Text Consisting of English- Punjabi Lexicon
Sentiment analysis is a field of study for analyzing emotions of people such as happy, sad, angry, etc. towards the entities and attributes expressed in written text. In this study, the data was collected in the textual form from different sources like Facebook, YouTube, Twitter, and Whatsapp, then pre-processed the collected data. After that, identification of the language of code-mixed text performed, which includes tokenization, word-play, misspelled words, abbreviations, slang words, phonetic-typing, etc. After the identification task, the English-Punjabi dictionary was created which was consisting of opinionated words list like positive, negative, and neutral words list. The rest of the words are being stored in an unsorted word list. In the last, a statistical technique applied at sentence level sentiment polarity of the English-Punjabi code mixed dataset. It was identified that the results up to the Five-Grams and Tri-Grams approaches had the similarity.
Keywords
Code Mixed Text, Romanized Text, Natural Language Processing, Text Processing, Romanized Text, Sentiment Analysis, Microblogging.
User
Font Size
Information