Text classification using improved bidirectional transformer
No Thumbnail Available
Date
2022
Journal Title
Journal ISSN
Volume Title
Publisher
Wiley
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
Text data have an important place in our daily life. A huge amount of text data is generated everyday. As a result, automation becomes necessary to handle these large text data. Recently, we are witnessing important developments with the adaptation of new approaches in text processing. Attention mechanisms and transformers are emerging as methods with significant potential for text processing. In this study, we introduced a bidirectional transformer (BiTransformer) constructed using two transformer encoder blocks that utilize bidirectional position encoding to take into account the forward and backward position information of text data. We also created models to evaluate the contribution of attention mechanisms to the classification process. Four models, including long short term memory, attention, transformer, and BiTransformer, were used to conduct experiments on a large Turkish text dataset consisting of 30 categories. The effect of using pretrained embedding on models was also investigated. Experimental results show that the classification models using transformer and attention give promising results compared with classical deep learning models. We observed that the BiTransformer we proposed showed superior performance in text classification.
Description
YILDIZ, Beytullah/0000-0001-7664-5145; Tezgider, Murat/0000-0002-4918-5697
Keywords
attention, deep learning, machine learning, text classification, text processing, transformer
Turkish CoHE Thesis Center URL
Fields of Science
Citation
14
WoS Q
Q3
Scopus Q
Q2
Source
Volume
34
Issue
9