Sentiments expressed in online news, comments, reviews, etc plays a vital role in todayโs business. The public is increasingly more likely to buy the product of a business, when it has positive reviews on e-commerce and social media websites. For instance, people tend to buy a book which has 4 and upย star rating on sites such as Flipkart and Amazon.
However awareness is spreading about bought and paid reviews on the web and today many customers have started to think, โIs this review a fake one?โ Yes, its hard to believe but true!ย Services such as buyazonreviews.com and buyamazonreviews.com will give you a good review; for a price ofcourse.
There are also services and groups of reviewers or spammers who will write negative reviews, again for a price. Recently, Amazon raised a suit opposing those websites for writing fake reviews on their site for financial gain.
Data analytics to the rescue again! Yes today there are some data analytics techniques that can help identify fake reviews. A few years ago three data enthusiasts, Arjun Mukherjee, Bing Liu, and Natalie Glance, proposed an algorithm in โSpotting Fake Reviewer Groups in Consumer Reviewsโ, WWW 2012, April 16โ20, 2012 in Lyon, France. The algorithm has the following three steps: a) Frequent pattern mining b) Identifying set of feature indicators c) Ranking based on relational model. The datasets used were product reviews of manufactured category extracted from the Amazon website.
Letโs now understand what these steps are:
Step 1:ย Frequent pattern mining – finding groups of people, candidate group, who jointly reviewed several products. In frequent itemset mining, the items are the reviewer ids and the transactions are the set of reviewer ids of a product. The result of the frequent itemsets give us the candidate groups who have reviewed various product together.
Step 2: Identifying set of feature indicators – the following set of spamming behaviour indicators were used.
* Group Time Window
* Group Deviation
* Group Content Similarity
* Group Member Content similarity
* Group Early Time Frame
* Group Size Ratio
* Group Size
* Group Support Count
* Individual Rating Deviation
* Individual Content Similarity
* Individual Early Time Frame
* Individual Member coupling in a group
Step 3 : Ranking Group Spam – GSRank (based on relational model)
Input: Weight matrices
Output: Ranked list of candidate spam groups
The real problem is identifying the true group spammers from the group who coincidentally happen to review the similar set of products. ย Another challenge is that it is tedious to manually label these fake reviews for model building. In other words, it is not so easy to identify an opinion spam review.
Companies these days use sentient analysis quite regularly to evaluate the sentiments of their customers. So naturally, finding and discounting fake reviews or opinions as spam is quite critical to credible sentiment analysis. Therefore any savvy analyst attempting sentiment analysis must be familiar with this technique. Hereโs to Frequent Itemset Mining!
Referencse :
https://www.cs.uic.edu/~liub/publications/WWW-2012-group-spam-camera-final.pdf
https://www.forbes.com/sites/retailwire/2015/04/13/amazon-lawsuit-takes-on-fake-reviewers/
https://www.cs.uic.edu/~liub/FBS/fake-reviews.html
Suggested Reads:
Whatโs the sentiment on โsentiment analysisโ?
The Power of Social Media Analytics
Fill in the details to know more
Understanding the Staffing Pyramid!
May 15, 2023
From The Eyes Of Emerging Technologies: IPL Through The Ages
April 29, 2023
Understanding HR Terminologies!
April 24, 2023
How Does HR Work in an Organization?
A Brief Overview: Measurement Maturity Model!
April 20, 2023
HR Analytics: Use Cases and Examples
Expert Speak: Analytics at Big Basket
March 29, 2017
What’s Exciting, Useful and Gives you Exactly the Kind of Information You Need to Hear?
June 30, 2015
As a Web Analyst this is How you Would Bring Value to the Business
May 27, 2015
Why the Attention Web or CPH is Causing Much Excitement in the Web Analytics World
May 20, 2015
A Quick Guide to Choosing the Best Web Analytics Tool
April 8, 2015
Course Video- Introduction to Web Analytics
February 10, 2015
Add your details:
By proceeding, you agree to our privacy policy and also agree to receive information from UNext through WhatsApp & other means of communication.
Upgrade your inbox with our curated newletters once every month. We appreciate your support and will make sure to keep your subscription worthwhile