Clickbait Detection using Multiple Categorization Techniques

03/29/2020
by   Abinash Pujahari, et al.
0

Clickbaits are online articles with deliberately designed misleading titles for luring more and more readers to open the intended web page. Clickbaits are used to tempted visitors to click on a particular link either to monetize the landing page or to spread the false news for sensationalization. The presence of clickbaits on any news aggregator portal may lead to unpleasant experience to readers. Automatic detection of clickbait headlines from news headlines has been a challenging issue for the machine learning community. A lot of methods have been proposed for preventing clickbait articles in recent past. However, the recent techniques available in detecting clickbaits are not much robust. This paper proposes a hybrid categorization technique for separating clickbait and non-clickbait articles by integrating different features, sentence structure, and clustering. During preliminary categorization, the headlines are separated using eleven features. After that, the headlines are recategorized using sentence formality, syntactic similarity measures. In the last phase, the headlines are again recategorized by applying clustering using word vector similarity based on t-Stochastic Neighbourhood Embedding (t-SNE) approach. After categorization of these headlines, machine learning models are applied to the data set to evaluate machine learning algorithms. The obtained experimental results indicate the proposed hybrid model is more robust, reliable and efficient than any individual categorization techniques for the real-world dataset we used.

READ FULL TEXT
research
09/25/2010

Web Page Categorization Using Artificial Neural Networks

Web page categorization is one of the challenging tasks in the world of ...
research
10/27/2018

Suspicious News Detection Using Micro Blog Text

We present a new task, suspicious news detection using micro blog text. ...
research
06/07/2018

An Exploration of Unreliable News Classification in Brazil and The U.S

The propagation of unreliable information is on the rise in many places ...
research
03/18/2022

Fake News Detection Using Majority Voting Technique

Due to the evolution of the Web and social network platforms it becomes ...
research
11/03/2017

"Attention" for Detecting Unreliable News in the Information Age

An Unreliable news is any piece of information which is false or mislead...
research
02/09/2022

High-performance automatic categorization and attribution of inventory catalogs

Techniques of machine learning for automatic text categorization are app...
research
03/27/2013

Machine Learning, Clustering, and Polymorphy

This paper describes a machine induction program (WITT) that attempts to...

Please sign up or login with your details

Forgot password? Click here to reset