Detection of Misogyny in Hindi Code-Mixed Texts by BiGRU with Bahdanau Attention using ByT5 Embeddings
Abstract
Social media platforms became the hub for conveying messages and responses to current events, but contain more harmful aspects by influencing negative stereotypes, spreading false information, and enabling misogyny in some scenarios. Detecting misogynistic language in social media is challenging for code-mixed languages due to demographic variations, transliteration, and noisy texts. The model has been evaluated on a Hindi-English code-mixed dataset of misogynistic comments. We proposed a hybrid misogyny classification model that combines byte-level ByT5 encoder embeddings with a Bidirectional Gated Recurrent Unit (BiGRU) augmented by the Bahdanau attention mechanism. ByT5 produces robust, subword-agnostic representations that reduce sensitivity to spelling variations and code switching. The BiGRU captures contextual sequential patterns and bidirectional dependencies, while attention emphasizes the most indicative tokens of abusive intent. It is demonstrated that the proposed hybrid model outperforms recurrent neural networks with static and dynamic embeddings, producing more stable misogyny predictions in low-resource and noisy texts.
Keywords
Bahdanau Attention Mechanism, Bidirectional Gated Recurrent Unit, ByT5 Embedding, Code-Mixed, Misogyny, Noisy Text
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
S. Karishma and V. Akila, "Detection of Misogyny in Hindi Code-Mixed Texts by BiGRU with Bahdanau Attention using ByT5 Embeddings," in Journal of Communications Software and Systems, vol. 22, no. 2, pp. 257-264, June 2026, doi: https://doi.org/10.24138/jcomss-2025-0240
@article{karishma2026detectionmisogyny,
author = {S. Karishma and V. Akila},
title = {Detection of Misogyny in Hindi Code-Mixed Texts by BiGRU with Bahdanau Attention using ByT5
Embeddings},
journal = {Journal of Communications Software and Systems},
month = {6},
year = {2026},
volume = {22},
number = {2},
pages = {257--264},
doi = {https://doi.org/10.24138/jcomss-2025-0240},
url = {https://doi.org/https://doi.org/10.24138/jcomss-2025-0240}
}