Transfer Learning for Operator Selection: A Reinforcement Learning Approach

Rafet Durgut, Mehmet Emin Aydin, Abdur Rakib

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)
32 Downloads (Pure)


In the past two decades, metaheuristic optimisation algorithms (MOAs) have been increasingly popular, particularly in logistic, science, and engineering problems. The fundamental characteristics of such algorithms are that they are dependent on a parameter or a strategy. Some online and offline strategies are employed in order to obtain optimal configurations of the algorithms. Adaptive operator selection is one of them, and it determines whether or not to update a strategy from the strategy pool during the search process. In the field of machine learning, Reinforcement Learning (RL) refers to goal-oriented algorithms, which learn from the environment how to achieve a goal. On MOAs, reinforcement learning has been utilised to control the operator selection process. However, existing research fails to show that learned information may be transferred from one problem-solving procedure to another. The primary goal of the proposed research is to determine the impact of transfer learning on RL and MOAs. As a test problem, a set union knapsack problem with 30 separate benchmark problem instances is used. The results are statistically compared in depth. The learning process, according to the findings, improved the convergence speed while significantly reducing the CPU time.
Original languageEnglish
Article number24
Number of pages17
Issue number1
Publication statusPublished - 17 Jan 2022
Externally publishedYes

Bibliographical note

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


  • transfer learning
  • reinforcement learning
  • adaptive operator selection
  • artificial bee colony


Dive into the research topics of 'Transfer Learning for Operator Selection: A Reinforcement Learning Approach'. Together they form a unique fingerprint.

Cite this