Kategórie
General

Webinar amália: Towards a Multimodal LLM for European Portuguese

Join us for an inspiring session on the development of amália, Portugal’s large language model designed to bring the richness of European Portuguese into the new era of multimodal artificial intelligence.
This webinar will feature Prof. João Magalhães from NOVA LINCS, Universidade NOVA de Lisboa, who will present the goals, architecture, and progress of this national AI initiative. The talk will explore how amália combines text, speech, image, and video understanding, and how it contributes to building culturally aligned and trustworthy AI systems for public, academic, and enterprise use.

Webinar amália: Towards a Multimodal LLM for European Portuguese

Join us for an inspiring session on the development of amália, Portugal’s large language model designed to bring the richness of European Portuguese into the new era of multimodal artificial intelligence.

This webinar will feature Prof. João Magalhães from NOVA LINCS, Universidade NOVA de Lisboa, who will present the goals, architecture, and progress of this national AI initiative. The talk will explore how amália combines text, speech, image, and video understanding, and how it contributes to building culturally aligned and trustworthy AI systems for public, academic, and enterprise use.

Date and Time:
Wednesday, November 12th, 2025 | 10:00 AM CEST (9:00 PT)
Online | Free Registration

This webinar is organized by the Slovak National Supercomputing Centre as part of the EuroCC project (National Competence Centre – NCC Slovakia) in cooperation with NCC Portugal , within the LLM Webinar Series connecting high-performance computing with artificial intelligence, culture, and innovation. The webinar will be held in English.

The webinar will be held in English.

Abstract:

amália is a government-backed large language model focused on European Portuguese, designed to capture linguistic nuances, cultural context, and multimodal capability across text, speech, image, and video. The project aims for high-impact applications in public administration, research, and industry, with a strong emphasis on trust, alignment, and data sovereignty.

This presentation outlines the process and key methodologies employed by the amália LLM Team at NOVA University and Instituto Superior Técnico / UL, with a focus on utilizing European HPC resources such as the Marenostrum 5 (MN5) and Deucalion supercomputers. Prior Portuguese-language initiatives (like GlórIA) and European initiatives (such as EuroLLM) inform benchmarks and evaluation and help define the roadmap from beta to public release.

The core development pipeline is explored through three key dimensions:
1️ Initial data preparation and training, involving an extensive, multi-month process of transforming noisy HTML and PDFs into high-quality raw texts, followed by tokenization and core training methodologies in language modeling and instruction tuning.
2️. Model alignment, achieved through reinforcement learning approaches, Direct Preference Optimization (DPO) and Group Relative Policy Optimization (GRPO) with Verifiable Rewards (VR) to ensure safer and more trustworthy responses.
3️. Infrastructure setup for advanced RLVR (GRPO) training on MN5, which uses inference nodes for sampling and training nodes for collecting samples and running Verifiable Rewards, highlighting the complexity of configuring multiple custom environments (e.g., mathematics, programming, biology).

The talk concludes with key insights into the computational and methodological rigor required to efficiently develop state-of-the-art LLMs, positioning this work at the forefront of Europe’s innovation path in AI.

Speaker:

Prof. João Magalhães – CMU Portugal co-Director; Head of the Multimodal Systems Group, NOVA LINCS, Universidade NOVA de Lisboa


Prof. João Miguel da Costa Magalhães is a Full Professor in the Department of Informatics at Universidade NOVA de Lisboa and a senior researcher at NOVA LINCS. He serves as co-Director of the CMU Portugal Program and leads the Multimodal Systems Group. His research focuses on vision-language models, multimodal learning, and AI systems for semantic multimedia. He earned his PhD from Imperial College London and has been a key figure in Portugal’s AI and digital innovation ecosystem.

Topics Include:

  • Building the amália model: architecture, training, and data curation
  • Multimodal AI for text, speech, image, and video
  • Cultural alignment and linguistic sovereignty in LLMs
  • Evaluation, transparency, and responsible AI governance
  • Future roadmap and collaboration opportunities

Outline:

  1. Introduction: Why a Portuguese multimodal LLM
  2. From Language to Multimodality: Scope and capabilities of amália
  3. Data and Alignment: Linguistic diversity and cultural fidelity
  4. Model Architecture and Training Process
  5. Status and Roadmap: Beta achievements and next steps
  6. Applications and Impact Scenarios
  7. Discussion and Q&A

Online Lecture: HPC & AI – Competition or Collaboration? 7 Nov - On 26 November 2025 at 10:00, we are hosting a special online lecture dedicated to the rapidly evolving topic of the interplay between high-performance computing (HPC) and artificial intelligence (AI). Our guest will be Erwin Laure, one of the most prominent figures in the European HPC ecosystem.
Success story: The Future of Soil Hidden in Data  5 Nov - High-Performance Computing (HPC) offers researchers the ability to process enormous volumes of data and uncover connections that would otherwise remain hidden. Today, it is no longer just a tool for technical disciplines – it is increasingly valuable in social and environmental research as well. A great example is a project that harnessed the power of HPC to gain deeper insight into the relationship between humans, soil, and the landscape.
AI-Driven Archaeology with LLMs — Detecting Archaeological Sites from Aerial Imagery 30 Oct - Objavte, ako umelá inteligencia a veľké jazykové modely menia spôsob, akým odhaľujeme stopy minulosti. Webinár priblíži, ako pokročilé techniky umelej inteligencie inšpirované LLM dokážu analyzovať letecké a LiDARové snímky a detegovať archeologické lokality s bezprecedentnou presnosťou. Dr. Daniel Canedo z Univerzity v Aveire predstaví praktické príklady, v ktorých Vision Transformers a multimodálne učenie odhaľujú skryté vzory v krajine a spájajú moderné technológie s kultúrnym dedičstvom.