Reinforcement Learning Using Fully Connected, Attention, and Transformer Models in Knapsack Problem Solving

No Thumbnail Available

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Wiley

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

Knapsack is a combinatorial optimization problem that involves a variety of resource allocation challenges. It is defined as non-deterministic polynomial time (NP) hard and has a wide range of applications. Knapsack problem (KP) has been studied in applied mathematics and computer science for decades. Many algorithms that can be classified as exact or approximate solutions have been proposed. Under the category of exact solutions, algorithms such as branch-and-bound and dynamic programming and the approaches obtained by combining these algorithms can be classified. Due to the fact that exact solutions require a long processing time, many approximate methods have been introduced for knapsack solution. In this research, deep Q-learning using models containing fully connected layers, attention, and transformer as function estimators were used to provide the solution for KP. We observed that deep Q-networks, which continued their training by observing the reward signals provided by the knapsack environment we developed, optimized the total reward gained over time. The results showed that our approaches give near-optimum solutions and work about 40 times faster than an exact algorithm using dynamic programming.

Description

YILDIZ, Beytullah/0000-0001-7664-5145

Keywords

attention, combinatorial optimization problem, deep Q-learning, knapsack, reinforcement learning, transformer

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, 01 natural sciences, 0105 earth and related environmental sciences

Citation

WoS Q

Q3

Scopus Q

Q2
OpenCitations Logo
OpenCitations Citation Count
14

Source

Concurrency and Computation: Practice and Experience

Volume

34

Issue

9

Start Page

End Page

Collections

PlumX Metrics
Citations

CrossRef : 2

Scopus : 19

Captures

Mendeley Readers : 23

SCOPUS™ Citations

19

checked on Feb 04, 2026

Web of Science™ Citations

11

checked on Feb 04, 2026

Page Views

2

checked on Feb 04, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
2.57350944

Sustainable Development Goals

3

GOOD HEALTH AND WELL-BEING
GOOD HEALTH AND WELL-BEING Logo

5

GENDER EQUALITY
GENDER EQUALITY Logo

17

PARTNERSHIPS FOR THE GOALS
PARTNERSHIPS FOR THE GOALS Logo