|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 187 - Issue 96 |
| Published: April 2026 |
| Authors: Neha Tyagi, Gopal Krishna Sharma, Narendra Kumar Sharma |
10.5120/ijca2ad422afaaf6
|
Neha Tyagi, Gopal Krishna Sharma, Narendra Kumar Sharma . A Hybrid MuRIL–Attention–Random Forest Framework for Hate Speech Detection Against Women in Hindi. International Journal of Computer Applications. 187, 96 (April 2026), 51-59. DOI=10.5120/ijca2ad422afaaf6
@article{ 10.5120/ijca2ad422afaaf6,
author = { Neha Tyagi,Gopal Krishna Sharma,Narendra Kumar Sharma },
title = { A Hybrid MuRIL–Attention–Random Forest Framework for Hate Speech Detection Against Women in Hindi },
journal = { International Journal of Computer Applications },
year = { 2026 },
volume = { 187 },
number = { 96 },
pages = { 51-59 },
doi = { 10.5120/ijca2ad422afaaf6 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2026
%A Neha Tyagi
%A Gopal Krishna Sharma
%A Narendra Kumar Sharma
%T A Hybrid MuRIL–Attention–Random Forest Framework for Hate Speech Detection Against Women in Hindi%T
%J International Journal of Computer Applications
%V 187
%N 96
%P 51-59
%R 10.5120/ijca2ad422afaaf6
%I Foundation of Computer Science (FCS), NY, USA
Societal hate speech against women on social media is growing, especially in dialects with limited resources like Hindi, where diversity of linguistics, unofficial writing styles, and social stratification make machine-generated detection fail. Modern ML and learning methods struggle to capture contextual semantics and control differences, resulting in low accuracy. This study proposes a blended framework that combines MuRIL, a multilingual transformer-inspired language model for Indian languages, with a focus mechanism and a random forest classifier to recognize Hindi sexist comments directed at women. MuRIL embeds provide deep background visualizations, while the attention layer reveals patterns of hateful language. The 2,020 professionally labelled Hindi social network database is used for comprehensive assessments. The proposed hybrid framework is compared against TF-IDF with SVM, CNN, Bi-LSTM with attention, and separate MuRIL-based models. Studies show that the MuRIL–Attention–Random Forest design outperforms traditional models in targeted detection, with an average precision of 92.82% and a greater group-wise difference. Using transformer-driven meaning representations with machine learning ensembles improves spotting accuracy in limited-resource and unbalanced situations. configurations. The current arrangement is an effective and durable solution for Hindi offensive language identification and a solid foundation for multilingual and continuing regulatory system.