A Rule-Based Approach To Embedding Techniques for Text Document Classification
dc.authorid | Mishra, Alok/0000-0003-1275-2050 | |
dc.authorscopusid | 57204948522 | |
dc.authorscopusid | 7201441575 | |
dc.authorwosid | Mishra, Alok/AAE-2673-2019 | |
dc.authorwosid | aubaid, asmaa/AAY-4014-2021 | |
dc.contributor.author | Aubaid, Asmaa M. | |
dc.contributor.author | Mishra, Alok | |
dc.contributor.other | Software Engineering | |
dc.date.accessioned | 2024-07-05T15:38:09Z | |
dc.date.available | 2024-07-05T15:38:09Z | |
dc.date.issued | 2020 | |
dc.department | Atılım University | en_US |
dc.department-temp | [Aubaid, Asmaa M.; Mishra, Alok] Atilim Univ, Dept Modeling & Design Engn Syst Modes, Dept Software Engn, TR-06830 Ankara, Turkey; [Aubaid, Asmaa M.] Minist Higher Educ & Sci Res Sci & Technol, Directorate Informat Technol, Baghdad 10070, Iraq; [Mishra, Alok] Molde Univ Coll Specialized Univ Logist, Fac Logist, N-6410 Molde, Norway | en_US |
dc.description | Mishra, Alok/0000-0003-1275-2050; | en_US |
dc.description.abstract | With the growth of online information and sudden expansion in the number of electronic documents provided on websites and in electronic libraries, there is difficulty in categorizing text documents. Therefore, a rule-based approach is a solution to this problem; the purpose of this study is to classify documents by using a rule-based. This paper deals with the rule-based approach with the embedding technique for a document to vector (doc2vec) files. An experiment was performed on two data sets Reuters-21578 and the 20 Newsgroups to classify the top ten categories of these data sets by using a document to vector rule-based (D2vecRule). Finally, this method provided us a good classification result according to the F-measures and implementation time metrics. In conclusion, it was observed that our algorithm document to vector rule-based (D2vecRule) was good when compared with other algorithms such as JRip, One R, and ZeroR applied to the same Reuters-21578 dataset. | en_US |
dc.identifier.citationcount | 15 | |
dc.identifier.doi | 10.3390/app10114009 | |
dc.identifier.issn | 2076-3417 | |
dc.identifier.issue | 11 | en_US |
dc.identifier.scopus | 2-s2.0-85086934327 | |
dc.identifier.uri | https://doi.org/10.3390/app10114009 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14411/3048 | |
dc.identifier.volume | 10 | en_US |
dc.identifier.wos | WOS:000543385900346 | |
dc.identifier.wosquality | Q2 | |
dc.institutionauthor | Mıshra, Alok | |
dc.language.iso | en | en_US |
dc.publisher | Mdpi | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.scopus.citedbyCount | 31 | |
dc.subject | text classification | en_US |
dc.subject | rule-based | en_US |
dc.subject | word embedding | en_US |
dc.subject | Doc2vec | en_US |
dc.title | A Rule-Based Approach To Embedding Techniques for Text Document Classification | en_US |
dc.type | Article | en_US |
dc.wos.citedbyCount | 22 | |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | de97bc0b-032d-4567-835e-6cd0cb17b98b | |
relation.isAuthorOfPublication.latestForDiscovery | de97bc0b-032d-4567-835e-6cd0cb17b98b | |
relation.isOrgUnitOfPublication | d86bbe4b-0f69-4303-a6de-c7ec0c515da5 | |
relation.isOrgUnitOfPublication.latestForDiscovery | d86bbe4b-0f69-4303-a6de-c7ec0c515da5 |