research-article

Overview of the HASOC track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages

Authors:
Thomas Mandl

University of Hildesheim, Germany

University of Hildesheim, Germany
View Profile

,
Sandip Modha

DA-IICT, Gandhinagar, India

DA-IICT, Gandhinagar, India
View Profile

,
Prasenjit Majumder

DA-IICT, Gandhinagar, India

DA-IICT, Gandhinagar, India
View Profile

,
Daksh Patel

Dalhousie University, Halifax, Canada

Dalhousie University, Halifax, Canada
View Profile

,
Mohana Dave

LDRP-ITR, Gandhinagar, India

LDRP-ITR, Gandhinagar, India
View Profile

,
Chintak Mandlia

infoAnalytica Consulting Pvt. Ltd

infoAnalytica Consulting Pvt. Ltd
View Profile

,
Aditya Patel

Dalhousie University, Halifax, Canada

Dalhousie University, Halifax, Canada
View Profile

FIRE '19: Proceedings of the 11th Annual Meeting of the Forum for Information Retrieval EvaluationDecember 2019Pages 14–17https://doi.org/10.1145/3368567.3368584

Published:12 December 2019Publication History

FIRE '19: Proceedings of the 11th Annual Meeting of the Forum for Information Retrieval Evaluation

Pages 14–17

ABSTRACT

The identification of Hate Speech in Social Media is of great importance and receives much attention in the text classification community. There is a huge demand for research for languages other than English. The HASOC track intends to stimulate development in Hate Speech for Hindi, German and English. Three datasets were developed from Twitter and Facebook and made available. Binary classification and more fine-grained subclasses were offered in 3 subtasks. For all subtasks, 321 experiments were submitted. The approaches used most often were LSTM networks processing word embedding input. The performance of the best system for identification of Hate Speech for English, Hindi, and German was a Marco-F1 score of 0.78, 0.81 and 0.61, respectively.

References

Fortuna, P., & Nunes, S. (2018). A survey on automatic detection of hate speech in text. ACM Computing Surveys (CSUR), 51(4), 85.Google ScholarDigital Library
Schmidt, A., & Wiegand, M. (2017, April). A survey on hate speech detection using natural language processing. In Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media (pp. 1--10).Google ScholarCross Ref
Wiegand, M., Siegel, M., & Ruppenhofer, J. (2018). Overview of the Germeval 2018 shared task on the identification of offensive language. Proceedings of GermEval 2018, https://ids-pub.bsz-bw.de/files/8493/Wiegand_Siegel_Ruppenhofer_Overview_of_the_GermEval_2018.pdfGoogle Scholar
Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., & Kumar, R. (2019). Semeval-2019 task 6: Identifying and categorizing offensive language in social media (offenseval). arXiv preprint arXiv:1903.08983.Google Scholar

Recommendations

Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech
FIRE '21: Proceedings of the 13th Annual Meeting of the Forum for Information Retrieval Evaluation

The HASOC track is dedicated to the evaluation of technology for finding Offensive Language and Hate Speech. HASOC is creating a multilingual data corpus mainly for English and under-resourced languages(Hindi and Marathi). This paper presents one HASOC ...
Read More
Overview of the HASOC Subtrack at FIRE 2022: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages
FIRE '22: Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation

In recent years, the spread of online offensive content has become of great concern, motivating researchers to develop robust systems capable of identifying such content automatically. To carry out a fair evaluation of these systems, several ...
Read More
Tracking Hate in Social Media: Evaluation, Challenges and Approaches
Abstract
This paper presents online hate speech as a societal and computational challenge. Offensive content detection in social media is considered as a multilingual, multi-level, multi-class classification problem for three Indo-European languages. This ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
FIRE '19: Proceedings of the 11th Annual Meeting of the Forum for Information Retrieval Evaluation
December 2019
77 pages
ISBN:9781450377508
DOI:10.1145/3368567
Editors:
Prasenjit Majumder,
Mandar Mitra,
Surupendu Gangopadhyay,
Parth Mehta
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 December 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Deep Learning
Evaluation
Hate Speech
Text Classification
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate19of64submissions,30%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 119
  Total Citations
  View Citations
- 1,749
  Total Downloads
- Downloads (Last 12 months)288
- Downloads (Last 6 weeks)14
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Overview of the HASOC track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages

FIRE '19: Proceedings of the 11th Annual Meeting of the Forum for Information Retrieval Evaluation

ABSTRACT

References

Cited By

Recommendations

Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech

Overview of the HASOC Subtrack at FIRE 2022: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages

Tracking Hate in Social Media: Evaluation, Challenges and Approaches

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Overview of the HASOC track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages

FIRE '19: Proceedings of the 11th Annual Meeting of the Forum for Information Retrieval Evaluation

ABSTRACT

References

Cited By

Recommendations

Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech

Overview of the HASOC Subtrack at FIRE 2022: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages

Tracking Hate in Social Media: Evaluation, Challenges and Approaches

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media