Accès libre

Optimizing the Structures of Transformer Neural Networks Using Parallel Simulated Annealing

,  et   
11 juin 2024
À propos de cet article

Citez
Télécharger la couverture

Trzciński, Maciej ORCID Icon
AGH University of Krakow, Faculty of Physics and Applied Computer SciencePoland
NASK National Research InstituteWarsaw, Poland
Łukasik, Szymon ORCID Icon
AGH University of Krakow, Faculty of Physics and Applied Computer SciencePoland
NASK National Research InstituteWarsaw, Poland
Systems Research Institute, Polish Academy of SciencesWarsaw, Poland
Gandomi, Amir H. ORCID Icon
University of Technology Sydney, Faculty of Engineering and Information TechnologyAustralia
University Research and Innovation Center (EKIK), Óbuda UniversityBudapest, Hungary