{"id":9644,"date":"2024-10-03T16:58:00","date_gmt":"2024-10-03T14:58:00","guid":{"rendered":"https:\/\/eurocc.nscc.sk\/?p=9644"},"modified":"2024-10-03T16:58:00","modified_gmt":"2024-10-03T14:58:00","slug":"privitajte-mistral-sk-7b","status":"publish","type":"post","link":"https:\/\/eurocc.nscc.sk\/en\/privitajte-mistral-sk-7b\/","title":{"rendered":"Priv\u00edtajte Mistral-sk-7b!"},"content":{"rendered":"<div class=\"is-layout-flow wp-block-group alignfull posts-all\"><div class=\"wp-block-group__inner-container\">\n<div class=\"is-layout-flex wp-container-4 wp-block-columns\">\n<div class=\"is-layout-flow wp-block-column\" style=\"flex-basis:60%\">\n<div class=\"is-layout-flow wp-block-group alignfull\"><div class=\"wp-block-group__inner-container\">\n<p><strong>Priv\u00edtajte Mistral-sk-7b!<\/strong><\/p>\n\n\n\n<h5 class=\"post-h\"><hr style=\"border: 1px solid #b8870b; width: 100px;\"><\/h5>\n\n\n\n<p>After a long time, the Slovak AI community got a new, this time a really big language model for the Slovak language. Almost three years after the release of the first Slovak language model, SlovakBERT, the collective of authors formed by Peter Bedn\u00e1r from the Department of Cybernetics and Artificial Intelligence FEI TUKE, Marek Dobe\u0161 from the Center for Social and Psychological Sciences SAV and Radovan Garab\u00edk from the \u013dudov\u00edt \u0160t\u00far Institute of Linguistics SAV used the Mistral multilingual model as a basis -7B-v0.1 with seven billion parameters.<\/p>\n<\/div><\/div>\n<\/div>\n\n\n\n<div class=\"is-layout-flow wp-block-column\" style=\"flex-basis:50%\">\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"1024\" height=\"575\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/10\/Capture-1024x575.png\" alt=\"\" class=\"wp-image-9646\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/10\/Capture-1024x575.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/10\/Capture-300x169.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/10\/Capture-768x431.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/10\/Capture-18x10.png 18w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/10\/Capture.png 1200w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<\/div>\n<\/div>\n\n\n\n<h5 class=\"post-h\"><\/h5>\n\n\n\n<p> Data from the web corpus \"Araneum Slovacum VII Maximum\" were used for fine-tuning of all parameters. As the authors state, if necessary, the quality of the pre-trained model can be improved by further training on a specific problem. At the same time, they point out that the model does not contain any moderating mechanism.<\/p>\n\n\n\n<p>The training of the Mistral-sk-7b model was carried out on the currently seventh most powerful supercomputer in the world, Leonardo, procured by the consortium of the same name, of which Slovakia is also a member. The authors obtained the necessary computing time thanks to a successful project within the national challenge for access to the Leonardo supercomputer, which was coordinated by the Computing Center of the Slovak Academy of Sciences in cooperation with the National Supercomputer Center.<\/p>\n\n\n\n<p><a href=\"https:\/\/huggingface.co\/slovak-nlp\/mistral-sk-7b\">Model Mistral-sk-7b<\/a> is published under the license \"Apache 2.0\" on the platform Hugging Face, under the organization \"Slovak NLP Community\" <\/p>\n\n\n\n<div class=\"is-layout-flow wp-block-group alignfull call-bot\"><div class=\"wp-block-group__inner-container\">\n<div class=\"is-horizontal is-content-justification-center is-layout-flex wp-container-5 wp-block-buttons\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"\/en\/calls-for-proposals\/\">BACK<\/a><\/div>\n<\/div>\n<\/div><\/div>\n<\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Slovensk\u00e1 AI komunita sa po dlhom \u010dase do\u010dkala nov\u00e9ho, tentoraz naozaj ve\u013ek\u00e9ho jazykov\u00e9ho modelu pre slovensk\u00fd jazyk.<\/p>","protected":false},"author":2,"featured_media":9646,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"templates\/template-full-width.php","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts\/9644"}],"collection":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/comments?post=9644"}],"version-history":[{"count":4,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts\/9644\/revisions"}],"predecessor-version":[{"id":9649,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts\/9644\/revisions\/9649"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/media\/9646"}],"wp:attachment":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/media?parent=9644"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/categories?post=9644"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/tags?post=9644"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}