Reinforcement Learning Using Fully Connected, Attention, and Transformer Models in Knapsack Problem Solving
No Thumbnail Available
Date
2022
Journal Title
Journal ISSN
Volume Title
Publisher
Wiley
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Knapsack is a combinatorial optimization problem that involves a variety of resource allocation challenges. It is defined as non-deterministic polynomial time (NP) hard and has a wide range of applications. Knapsack problem (KP) has been studied in applied mathematics and computer science for decades. Many algorithms that can be classified as exact or approximate solutions have been proposed. Under the category of exact solutions, algorithms such as branch-and-bound and dynamic programming and the approaches obtained by combining these algorithms can be classified. Due to the fact that exact solutions require a long processing time, many approximate methods have been introduced for knapsack solution. In this research, deep Q-learning using models containing fully connected layers, attention, and transformer as function estimators were used to provide the solution for KP. We observed that deep Q-networks, which continued their training by observing the reward signals provided by the knapsack environment we developed, optimized the total reward gained over time. The results showed that our approaches give near-optimum solutions and work about 40 times faster than an exact algorithm using dynamic programming.
Description
YILDIZ, Beytullah/0000-0001-7664-5145
ORCID
Keywords
attention, combinatorial optimization problem, deep Q-learning, knapsack, reinforcement learning, transformer
Turkish CoHE Thesis Center URL
Fields of Science
0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, 01 natural sciences, 0105 earth and related environmental sciences
Citation
WoS Q
Q3
Scopus Q
Q2

OpenCitations Citation Count
14
Source
Concurrency and Computation: Practice and Experience
Volume
34
Issue
9
Start Page
End Page
PlumX Metrics
Citations
CrossRef : 2
Scopus : 19
Captures
Mendeley Readers : 23
SCOPUS™ Citations
19
checked on Feb 04, 2026
Web of Science™ Citations
11
checked on Feb 04, 2026
Page Views
2
checked on Feb 04, 2026
Google Scholar™

OpenAlex FWCI
2.57350944
Sustainable Development Goals
3
GOOD HEALTH AND WELL-BEING

5
GENDER EQUALITY

17
PARTNERSHIPS FOR THE GOALS


