View a PDF of the paper titled Investigating Annotator Bias in Giant Language Fashions for Hate Speech Detection, by Amit Das and 9 different authors
Summary:Information annotation, the follow of assigning descriptive labels to uncooked information, is pivotal in optimizing the efficiency of machine studying fashions. Nonetheless, it’s a resource-intensive course of vulnerable to biases launched by annotators. The emergence of refined Giant Language Fashions (LLMs), like ChatGPT presents a novel alternative to modernize and streamline this advanced process. Whereas present analysis extensively evaluates the efficacy of LLMs, as annotators, this paper delves into the biases current in LLMs, particularly GPT 3.5 and GPT 4o when annotating hate speech information. Our analysis contributes to understanding biases in 4 key classes: gender, race, faith, and incapacity. Particularly concentrating on extremely weak teams inside these classes, we analyze annotator biases. Moreover, we conduct a complete examination of potential components contributing to those biases by scrutinizing the annotated information. We introduce our customized hate speech detection dataset, HateSpeechCorpus, to conduct this analysis. Moreover, we carry out the identical experiments on the ETHOS (Mollas et al., 2022) dataset additionally for comparative evaluation. This paper serves as a vital useful resource, guiding researchers and practitioners in harnessing the potential of LLMs for dataannotation, thereby fostering developments on this crucial subject. The HateSpeechCorpus dataset is out there right here: this https URL
Submission historical past
 From: Aman Chadha Mr. [view email]      
 [v1]
        Mon, 17 Jun 2024 00:18:31 UTC (118 KB)
 [v2]
        Tue, 18 Jun 2024 06:21:16 UTC (118 KB)
 [v3]
        Sat, 12 Oct 2024 21:46:04 UTC (118 KB)
Source link 
 
 #Investigating #Annotator #Bias #Giant #Language #Fashions #Hate #Speech #Detection
 
   
Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the facility of synthetic intelligence to revolutionize industries. From machine studying and information analytics to pure language processing and laptop imaginative and prescient, our AI options are designed to reinforce effectivity and drive innovation. Discover the limitless potentialities of AI-driven insights and automation that propel your corporation ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be part of us on the forefront of technological development, and let AI redefine the best way you use and reach a aggressive panorama. Embrace the long run with AI excellence, the place potentialities are limitless, and competitors is surpassed.
![[2406.11109] Investigating Annotator Bias in Large Language Models for Hate Speech Detection [2406.11109] Investigating Annotator Bias in Large Language Models for Hate Speech Detection](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1200&resize=1200,700&ssl=1) 
 








