Optimizing the Structures of Transformer Neural Networks Using Parallel Simulated Annealing
Authors
AGH University of Krakow, Faculty of Physics and Applied Computer Science, Poland
NASK National Research Institute, Warsaw, Poland
Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland
Language: English
Page range: 267 - 282
Submitted on: Jan 9, 2024
Accepted on: May 26, 2024
Published on: Jun 11, 2024
Published by: SAN University
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year
Related subjects:
© 2024 Maciej Trzciński, Szymon Łukasik, Amir H. Gandomi, published by SAN University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.