{"id":3066,"date":"2019-12-27T16:14:30","date_gmt":"2019-12-27T16:14:30","guid":{"rendered":"http:\/\/34.74.67.11\/?page_id=3066"},"modified":"2021-05-17T07:07:04","modified_gmt":"2021-05-17T07:07:04","slug":"sentiment-analysis","status":"publish","type":"page","link":"https:\/\/www.iventura.ai\/index.php\/sentiment-analysis\/","title":{"rendered":"Sentiment Analysis"},"content":{"rendered":"<section class=\"kc-elm kc-css-858254 kc_row\"><div class=\"kc-row-container  kc-container\"><div class=\"kc-wrap-columns\"><div class=\"kc-elm kc-css-621681 kc_col-sm-12 kc_column kc_col-sm-12\"><div class=\"kc-col-container\"><div class=\"kc-elm kc-css-498777 kc_text_block\"><\/p>\n<p style=\"text-align: center;\">Sentiment Analsysis is frequently used for Natural Language Processing. The goal is to analyze a text and predict whether the underlying sentiment is positive, negative or neutral. It can done on either supervised data or on un-supervised data.<\/p>\n<p>\n<\/div><div class=\"kc-elm kc-css-67961 kc_shortcode kc_single_image\">\n\n        <img decoding=\"async\" src=\"https:\/\/www.iventura.ai\/wp-content\/uploads\/2019\/12\/H2.png\" class=\"\" alt=\"\" \/>    <\/div>\n<div class=\"kc-elm kc-css-975714 kc_row kc_row_inner\"><div class=\"kc-elm kc-css-503844 kc_col-sm-12 kc_column_inner kc_col-sm-12\"><div class=\"kc_wrapper kc-col-inner-container\">\n<div class=\"kc-elm kc-css-224092 kc-title-wrap \">\n\n\t<h4 class=\"kc_title\">Sentiment Analysis for drugs\/medicines<\/h4>\n<\/div>\n<div class=\"kc-elm kc-css-238749 kc_text_block\"><\/p>\n<h4 dir=\"ltr\" style=\"line-height: 1.2; margin-top: 0pt; margin-bottom: 0pt; color: #00aeef;\"><strong>Problem Statement:<\/strong><\/h4>\n<ol>\n<li>Nowadays the narrative of a brand is not only built and controlled by the company that owns the brand. For this reason, companies are constantly looking out across Blogs, Forums, and other social media platforms, etc for checking the sentiment for their various products and also competitor products to learn how their brand resonates in the market. This kind of analysis helps them as part of their post-launch market research. This is relevant for a lot of industries including pharma and their drugs.<\/li>\n<li>Sentiment can be clubbed into 3 major buckets &#8211; Positive, Negative and Neutral Sentiments.<\/li>\n<li>Data contains samples of text retrieved from various social media platforms. This text can contain one or more drug names. Each row contains a unique combination of the text and the drug name. Note that the same text can also have different sentiment for a different drug.<\/li>\n<\/ol>\n<h4 dir=\"ltr\" style=\"line-height: 1.2; margin-top: 0pt; margin-bottom: 0pt; color: #00aeef;\"><strong>Challenge:<\/strong><\/h4>\n<ul>\n<li>The challenge is that the language used in this type of content is not strictly grammatically correct. Some use sarcasm. Others cover several topics with different sentiments in one post. Some post comments and replies thereby indicating their sentiment about the medicine<\/li>\n<\/ul>\n<p>\n<\/div><\/div><\/div><\/div><div class=\"kc-elm kc-css-79411 kc_row kc_row_inner\"><div class=\"kc-elm kc-css-66808 kc_col-sm-12 kc_column_inner kc_col-sm-12\"><div class=\"kc_wrapper kc-col-inner-container\"><div class=\"kc-elm kc-css-342323 kc_shortcode kc_single_image\">\n\n        <img decoding=\"async\" src=\"https:\/\/www.iventura.ai\/wp-content\/uploads\/2020\/01\/2020-01-03.jpg\" class=\"\" alt=\"\" \/>    <\/div>\n<div class=\"kc-elm kc-css-493233 kc_text_block\"><\/p>\n<h4 dir=\"ltr\" style=\"line-height: 1.2; margin-top: 0pt; margin-bottom: 0pt; color: #00aeef;\"><strong>Solution:<\/strong><\/h4>\n<ol>\n<li>iVentura Machine Learning Platform was used for building the solution. iVentura provides the complete ecosystem for data scientists to build models without worrying about the underlying Infra &amp; Security. Either for a team or an individual data scientist, iVentura is ideally suited as a platform of choice.<\/li>\n<li>To deal with the above problem statement ,datasets needs to be analysed and evaluated with metrics to acquire best outcome. Here we go:<\/li>\n<\/ol>\n<ul>\n<li>1) Input Dataset is the form of &#8220;text\" thus the unstructured data is processed with raw data preprocessing followed by text preprocessing .<\/li>\n<li>2) TFIDF featurization is used to convert preprocessed text into vectors.<\/li>\n<li>3) Sentiment class data is imbalanced . Thus, Sentiment Class data is performed over-sampling using SMOTE.<\/li>\n<li>4) The misclassification error for each alpha value is plotted and best alpha value is used in Naive Bayes classifier.<\/li>\n<li>5) Here,MultinominalNB classifier is used to predict sentiment of text datasets.<\/li>\n<li>6) The plotted confusion matrix and macro F1 score is evaluated and sentiment is predicted on test dataset.<\/li>\n<li>7) Save the data into pickle<\/li>\n<li>8) Deployment &amp; Visualization<\/li>\n<\/ul>\n<p>\n<\/div><\/div><\/div><\/div><\/div><\/div><\/div><\/div><\/section>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"tpl-king-composer.php","meta":{"footnotes":""},"yst_prominent_words":[145,146,144,87,147,149,150,142,153,140,151,81,133,143,139,141,138,148,134,152],"class_list":["post-3066","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/pages\/3066","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/comments?post=3066"}],"version-history":[{"count":43,"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/pages\/3066\/revisions"}],"predecessor-version":[{"id":3802,"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/pages\/3066\/revisions\/3802"}],"wp:attachment":[{"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/media?parent=3066"}],"wp:term":[{"taxonomy":"yst_prominent_words","embeddable":true,"href":"https:\/\/www.iventura.ai\/index.php\/wp-json\/wp\/v2\/yst_prominent_words?post=3066"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}