{"id":9083,"date":"2024-06-21T17:49:42","date_gmt":"2024-06-21T15:49:42","guid":{"rendered":"https:\/\/eurocc.nscc.sk\/?p=9083"},"modified":"2024-06-21T18:21:06","modified_gmt":"2024-06-21T16:21:06","slug":"implementacia-metody-ciastocne-riadeneho-ucenia-uni-match-do-metody-frame-field-learning-pre-ulohu-extrakcie-budov-z-leteckych-snimok","status":"publish","type":"post","link":"https:\/\/eurocc.nscc.sk\/en\/implementacia-metody-ciastocne-riadeneho-ucenia-uni-match-do-metody-frame-field-learning-pre-ulohu-extrakcie-budov-z-leteckych-snimok\/","title":{"rendered":"<strong>Semi-Supervised Learning in Aerial Imagery: Implementing Uni-Match with Frame Field learning for Building Extraction<\/strong>"},"content":{"rendered":"<div class=\"is-layout-flow wp-block-group alignfull posts-all\"><div class=\"wp-block-group__inner-container\">\n<div class=\"is-layout-flex wp-container-4 wp-block-columns\">\n<div class=\"is-layout-flow wp-block-column\" style=\"flex-basis:60%\">\n<div class=\"is-layout-flow wp-block-group alignfull\"><div class=\"wp-block-group__inner-container\">\n<p><strong>Semi-Supervised Learning in Aerial Imagery: Implementing Uni-Match with Frame Field learning for Building Extraction<\/strong><\/p>\n\n\n\n<p><\/p>\n<\/div><\/div>\n\n\n\n<p>Building extraction in GIS (geographic information system) is pivotal for urban planning, environmental studies, and infrastructure management, allowing for accurate mapping of structures, including the detection of illegal constructions for regulatory compliance. Integrating extracted building data with other geospatial layers enhances the understanding of urban dynamics and spatial relationships. Given the scale and complexity of these tasks, there is a growing need to automate building extraction using deep learning techniques, which offer improved accuracy and efficiency in handling large-scale geospatial data.<\/p>\n\n\n\n<p> <\/p>\n<\/div>\n\n\n\n<div class=\"is-layout-flow wp-block-column\">\n<figure class=\"wp-block-image alignwide size-large\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1.png\"><img decoding=\"async\" loading=\"lazy\" width=\"1024\" height=\"853\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-1024x853.png\" alt=\"\" class=\"wp-image-9084\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-1024x853.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-300x250.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-768x640.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-14x12.png 14w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1.png 1200w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/a><figcaption class=\"wp-element-caption\">illustrative image<\/figcaption><\/figure>\n<\/div>\n<\/div>\n\n\n\n<p>State-of-the-art image segmentation models primarily output in raster format, whereas GIS applications often require vector polygons. One such method to meet this requirement is Frame Field learning, which addresses the gap between raster format outputs of image segmentation models and the vector format needed in GIS. This approach significantly enhances the accuracy of building vectorization by aligning with ground truth contours and provide topologically clean vector objects.<\/p>\n\n\n\n<p>These models are trained using a 'supervised learning' method, necessitating a large amount of labeled examples for training. However, obtaining such a significant volume of data can be extremely challenging and expensive. A potential solution to this problem is 'semi-supervised learning,' a method that reduces reliance on labeled data. In semi-supervised learning, the model is trained with a mix of a small set of labeled data and a larger set of unlabeled data. Hence, the goal of this collaboration between the Slovak National Competence Center for High-Performance Computing and Geodeticca Vision s.r.o. was to identify, implement, and evaluate an appropriate semi-supervised method for Frame Field learning.<\/p>\n\n\n\n<p>The aim of this cooperation between the National Competence Center for HPC and Geodeticca Vision s.r.o. was to identify, implement and evaluate a suitable partial tutor learning method for Frame Field learning.<\/p>\n\n\n\n<h5>Methods<\/h5>\n\n\n\n<h5>Frame Field learning<\/h5>\n\n\n\n<p>The key idea of the frame field learning [1] is to help the polygonization method in solving ambiguous cases caused by discrete probability maps (output from image segmentation models). This is accomplished by introducing an additional output to the neural network of image segmentation, namely a frame field (see. Fig. 1), which represents the structural features and geometrical characteristics of the building.<\/p>\n\n\n\n<p>Frame fields<\/p>\n\n\n\n<p>Frame field is a 4-PolyVector field that assigns four vectors to each point on a plane. Specifically, the first two vectors are constrained to be opposite to the other two, meaning each point is assigned a set of vectors {u, \u2212u, v, \u2212v}. This approach is particularly necessary for buildings, as they are regular structures with sharp corners, and capturing directionality at these sharp corners requires two directions.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-1024x853.png\" alt=\"\" class=\"wp-image-9084\" width=\"512\" height=\"427\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-1024x853.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-300x250.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-768x640.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1-14x12.png 14w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure1.png 1200w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/a><figcaption class=\"wp-element-caption\"><br><em>Figure 1: Visualization of the frame field output on the image from training set [1].<\/em><\/figcaption><\/figure><\/div>\n\n\n<p>Frame Field learning<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-1024x485.png\" alt=\"\" class=\"wp-image-9085\" width=\"512\" height=\"243\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-1024x485.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-300x142.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-768x364.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-1536x728.png 1536w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-2048x971.png 2048w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-18x9.png 18w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-1200x569.png 1200w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure2-1980x939.png 1980w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/a><figcaption class=\"wp-element-caption\">Figure 2: Diagram of the frame field learning [1]<\/figcaption><\/figure><\/div>\n\n\n<p>The learning process of frame fields can be summarized as follows:<\/p>\n\n\n\n<ol type=\"1\">\n<li>The network's input is a 3\u00d7H\u00d7W RGB image.<\/li>\n\n\n\n<li>To generate a feature map, any deep segmentation model could be used, such as U-Net, which is then processed to output detailed segmentation maps.<\/li>\n\n\n\n<li>The training is supervised with ground truth rasterized polygons for interiors and edges, utilizing a mix of cross-entropy and Dice loss for accurate segmentation.<\/li>\n\n\n\n<li>To train the frame field, three losses are used:<ol><li>L<sub>align<\/sub> enforces alignment of the frame field to the tangent direction.<\/li><\/ol><ol><li>L<sub>align90<\/sub> prevents the frame field from collapsing to a line field.<\/li><\/ol>\n<ol>\n<li>L<sub>smooth<\/sub> measures the smoothness of the frame field.<\/li>\n<\/ol>\n<\/li>\n\n\n\n<li>Additional losses, regularization losses, are introduced to maintain output consistency, aligning the spatial gradients of the predicted maps with the frame field.<\/li>\n<\/ol>\n\n\n\n<p>Vectorization<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-1024x186.png\" alt=\"\" class=\"wp-image-9086\" width=\"512\" height=\"93\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-1024x186.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-300x55.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-768x140.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-1536x279.png 1536w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-2048x372.png 2048w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-18x3.png 18w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-1200x218.png 1200w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure3-1980x360.png 1980w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/a><figcaption class=\"wp-element-caption\">Figure 3: Visualization of the vectorization process [1]<\/figcaption><\/figure><\/div>\n\n\n<p>The vectorization process transforms classified raster images into vector polygons using a polygonization method using the Active Skeleton Model (ASM). The principle of this algorithm is the iterative shifting of the vertices of the skeleton graph to their ideal positions. This method optimizes a skeleton graph - a network of pixels outlining the building's structure - created by a thinning method applied on a building wall probability map. The iterative shifting is controlled by a gradient optimization method aimed at minimizing an energy function, which includes specific components related to the structure and geometry being analyzed:<\/p>\n\n\n\n<ol type=\"1\">\n<li>E<sub>probability \u2013<\/sub> fits the skeleton paths to the contour of the building interior probability map at a certain probability threshold, e.g. 0.5<\/li>\n\n\n\n<li>E<sub>frame field align <\/sub>aligns each edge of the skeleton graph to the frame field.<\/li>\n\n\n\n<li>E<sub>length<\/sub> ensures that the node distribution along paths remains homogeneous as well as tight.<\/li>\n<\/ol>\n\n\n\n<p><strong>UniMatch semi-supervised learning<\/strong><\/p>\n\n\n\n<p>UniMatch [2], an advanced semi-supervised learning method in the consistency regularization category, builds upon the foundational principles established by FixMatch [3], a baseline method in this domain. primarily operates on the principle of pseudo-labeling combined with consistency regularization.<\/p>\n\n\n\n<p>The basic principle of the FixMatch method involves generating pseudo-labels for unlabeled data from the predictions of a neural network. Specifically, for a weakly perturbed unlabeled input <em>x<sup>w<\/sup><\/em> , a prediction <em>p<sup>w<\/sup><\/em>pw  is generated, which serves as a pseudo-label for the prediction of<em> x<sup>with<\/sup><\/em>, a strongly perturbed input. Subsequently, the loss function value, for example, cross-entropy<em>p<sup>w,&nbsp; <\/sup>p<sup>with<\/sup><\/em>is calculated, considering only areas from <em>p<sup>w<\/sup><\/em>pw with a probability value greater than a certain threshold, e.g., &gt;0.95.&nbsp;<\/p>\n\n\n\n<p>UniMatch builds upon and extends the FixMatch methodology, introducing two core enhancements:<\/p>\n\n\n\n<ol type=\"1\">\n<li>UniPerb (Unified Perturbations for Images and Features) - This involves applying perturbations at the feature level. Practically, this means applying a dropout function to the output (i.e., the feature) from the encoder layer of the neural network, randomly ignore features, which then proceed to the decoder part of the network, generating <em>p<sup>fp<\/sup><\/em>.<\/li>\n\n\n\n<li>Instead of using one strong perturbation, two perturbations are utilized. <em>x<sup>s1 <\/sup><\/em>from&nbsp;<em>x<sup>s2<\/sup><\/em>.<\/li>\n<\/ol>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-1024x743.png\" alt=\"\" class=\"wp-image-9087\" width=\"512\" height=\"372\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-1024x743.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-300x218.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-768x557.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-1536x1115.png 1536w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-2048x1487.png 2048w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-18x12.png 18w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-1200x871.png 1200w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure4-1980x1437.png 1980w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/a><figcaption class=\"wp-element-caption\">Figure 4: (a) The FixMatch baseline (b) used UniMatch method. The FP denotes feature pertubation, w and s means weak and strong pertubation, respectively [2].<\/figcaption><\/figure><\/div>\n\n\n<p>Ultimately, there are three error functions: crossentropy(<em>p<sup>w,&nbsp; <\/sup>p<sup>fp<\/sup><\/em>), cross-entropy(<em>p<sup>w,&nbsp; <\/sup>p<sup>s1<\/sup><\/em>), cross-entropy(<em>p<sup>w,&nbsp; <\/sup>p<sup>s2<\/sup><\/em>These are then linearly combined with the supervised error function.<\/p>\n\n\n\n<p>T\u00e1to met\u00f3da v s\u00fa\u010dasnosti patr\u00ed medzi state-of-the-art met\u00f3dy u\u010denia s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom. Hlavnou v\u00fdhodou tejto met\u00f3dy je jej jednoduchos\u0165 pri implement\u00e1ci\u00ed a nev\u00fdhodou je jej citlivos\u0165 na v\u00fdber vhodnej slabej a silnej perturb\u00e1cie.<\/p>\n\n\n\n<h4><strong>Integrating UniMatch Semi-Supervised Learning with Frame Field Learning<\/strong><\/h4>\n\n\n\n<h4><strong>Implementation Strategy for UniMatch in Frame Field Learning<\/strong><\/h4>\n\n\n\n<p>To integrate UniMatch into our Frame Field learning framework, we first differentiated between weak and strong perturbations. For weak perturbations, we chose basic spatial transformations such as rotation, mirroring, and vertical\/horizontal flips. These are well-suited for aerial imagery and straightforward to implement.<\/p>\n\n\n\n<p>For strong perturbations, we opted for photometric transformations. These include adjustments in hue, color, and brightness, providing a more significant alteration to the images compared to spatial transformations.&nbsp;<\/p>\n\n\n\n<p>Incorporating feature perturbation loss was a crucial step. We implemented this by introducing a dropout mechanism between the encoder and decoder parts of the network. This dropout selectively omits features at the feature level, which is essential for the UniMatch approach.<\/p>\n\n\n\n<p>Regarding the dual-stream perturbations of UniMatch, we adapted our model to handle two types of strong perturbations. The dual-stream approach involves using the weak perturbation prediction as a pseudo-label and training the model using the strong perturbation predictions as loss functions. We have two strong perturbations, hence the term 'dual-stream'. Each of these perturbations contributes to the overall robustness and effectiveness of the model in semi-supervised learning scenarios, especially in the context of building extraction from complex aerial imagery.<\/p>\n\n\n\n<p>Prostredn\u00edctvom t\u00fdchto \u00faprav bola UniMatch met\u00f3da \u00faspe\u0161ne integrovan\u00e1 do Frame Field learning algoritmu, \u010d\u00edm sa zv\u00fd\u0161ila jeho schopnos\u0165 efekt\u00edvne sprac\u00fava\u0165 a u\u010di\u0165 sa z&nbsp;anotovan\u00fdch a hlavne neanotovan\u00fdch d\u00e1t.<\/p>\n\n\n\n<h4><strong>Experiments<br>Dataset<br>Labeled Data<\/strong><\/h4>\n\n\n\n<p>Our labeled data comes from three different sources, which we'll detail in the accompanying Table 1.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173457.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173457.png\" alt=\"\" class=\"wp-image-9089\" width=\"452\" height=\"102\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173457.png 903w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173457-300x67.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173457-768x173.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173457-18x4.png 18w\" sizes=\"(max-width: 452px) 100vw, 452px\" \/><\/a><figcaption class=\"wp-element-caption\">Table 1: Overview of 3 data sources of labeled data used for training the models with details.<\/figcaption><\/figure><\/div>\n\n\n<h4><strong>Unlabeled Data<\/strong><\/h4>\n\n\n\n<p>For the unlabeled dataset, we selected high-quality aerial images from Geodetick\u00fd a kartografick\u00fd \u00fastav (GK\u00da) [6], available for free public use. We specifically targeted a diverse area of 7000 km<sup>2<\/sup>ensuring a wide representation of various landscapes and urban settings.<\/p>\n\n\n\n<h4><strong>Data Processing: Patching<\/strong><\/h4>\n\n\n\n<p>We processed both labeled and unlabeled images into patches of size 320x320 px. This patch size is specifically chosen to match the input requirements of our neural network. From the labeled data, this process resulted in approximately 55,000 patches. Similarly, from the unlabeled dataset, we obtained around 244,000 patches.<\/p>\n\n\n\n<h4><strong>Training setup<br>Model Architecture<\/strong><\/h4>\n\n\n\n<p>We designed our model using a U-Net architecture with an EfficientNet-B4 backbone. This combination provides a good balance of accuracy and efficiency, crucial for handling the complexity of our segmentation tasks. The EfficientNet-B4 backbone was specifically chosen for its optimal balance between memory usage and performance. In Frame Field learning, U-Net architecture has been shown to be highly effective, as evidenced by its strong performance in prior studies.<\/p>\n\n\n\n<h4><strong>Training Process<\/strong><\/h4>\n\n\n\n<p>For training, we used the AdamW optimizer, which combines the advantages of Adam optimization with weight decay, aiding in better model generalization. To prevent overfitting, we implemented L2 regularization. Additionally, we used the ReduceLROnPlateau learning rate scheduler. This scheduler adjusts the learning rate based on validation loss, ensuring efficient training progress.<\/p>\n\n\n\n<h4><strong>Semi-Supervised Learning Adjustments<\/strong><\/h4>\n\n\n\n<p>A key aspect of our training was adjusting the ratio of unlabeled to labeled patches. We experimented with ratios ranging from 1:1 to 1:5 (labeled:unlabeled). This variability allowed us to explore the impact of different amounts of unlabeled data on the learning process. It enabled us to identify the optimal balance for training our model, ensuring effective learning while leveraging the advantages of semi-supervised learning in handling large and diverse datasets.<\/p>\n\n\n\n<h4><strong>Model evaluation<\/strong><\/h4>\n\n\n\n<p>In our evaluation of the building footprint extraction model, we chose metrics that precisely measure how well our predictions align with real-world structures.<\/p>\n\n\n\n<h4><strong>Intersection over Union (IoU)<\/strong><\/h4>\n\n\n\n<p>K\u013e\u00fa\u010dovou metrikou, ktor\u00fa sme vyu\u017e\u00edvali je metrika s&nbsp;n\u00e1zvom Intersection over Union (IoU). Po\u010d\u00edta zhodu medzi predikciami modelu a skuto\u010dn\u00fdm tvarom budov. Hodnota sk\u00f3re IoU bl\u00edzka 1 znamen\u00e1, \u017ee na\u0161e predikcie s\u00fa podobn\u00e9 skuto\u010dn\u00fdm budov\u00e1m. T\u00e1to metrika je nevyhnutn\u00e1 na pos\u00fadenie geometrickej presnosti pre segmentovan\u00e9 oblasti, preto\u017ee odr\u00e1\u017ea presnos\u0165 vyt\u00fd\u010denia hran\u00edc budov. Okrem toho, vyhodnoten\u00edm pomeru spr\u00e1vne predikovanej oblasti ku kombinovanej oblasti (zjednotenie oblasti predikcie a&nbsp;skuto\u010dnej oblasti), n\u00e1m IoU poskytuje jasn\u00fa mieru efektivity modelu v&nbsp;zachyt\u00e1van\u00ed skuto\u010dn\u00e9ho kontextu a&nbsp;tvaru budov v&nbsp;komplexnej mestskej krajine.<\/p>\n\n\n\n<h4><strong>Precision, Recall and F1<\/strong><\/h4>\n\n\n\n<p>Precision measures the accuracy of the model's building predictions, indicating the proportion of correctly identified buildings out of all identified buildings, thereby reflecting the model's specificity. Recall assesses the model's ability to capture all actual buildings, with a high recall score highlighting its sensitivity in detecting buildings. The F1 Score combines precision and recall into a single metric, offering a balanced view of the model's performance by ensuring that high scores result from both high precision and high recall.<\/p>\n\n\n\n<h4><strong>Complexity Aware IoU (cIoU)<\/strong><\/h4>\n\n\n\n<p>We also utilized Complexity Aware IoU (cIoU) [7]. This metric addresses a shortfall in IoU by balancing segmentation accuracy and the complexity of the polygon shapes. While IoU alone can lead models to create overly complex polygons, cIoU ensures that the complexity of the polygons (number of vertices) is kept realistic, reflecting the typically less complex structure of real buildings.<\/p>\n\n\n\n<h4><strong>N Ratio Metric<\/strong><\/h4>\n\n\n\n<p>The N ratio metric was an additional component of our evaluation strategy. It contrasts the number of vertices in our predicted shapes with those in the actual buildings [7]. This helps in understanding whether our model accurately replicates the detailed structure of the buildings.<\/p>\n\n\n\n<h4><strong>Max Tangent Angle Error<\/strong><\/h4>\n\n\n\n<p>To ensure clean geometry in building extraction tasks, accurately measuring contour regularity is essential. The Max Tangent Angle Error (MTAE) [1] metric is designed to address this need by supplementing the Intersection over Union (IoU) metric. It specifically targets the limitation of IoU, where segmentations with rounded corners may receive higher scores than those with more precise, sharp corners. By evaluating the alignment of edges through the comparison of tangent angles at sampled points along predicted and ground truth contours, MTAE effectively penalizes inaccuracies in edge orientation. This focus on edge precision is critical for producing clean vector representations of buildings, emphasizing the importance of accurate edge delineation in segmentation tasks.<\/p>\n\n\n\n<h4><strong>Evaluation Process<\/strong><\/h4>\n\n\n\n<p>Natr\u00e9novan\u00e9 modely boli testovan\u00e9 na ve\u013ekej d\u00e1tovej mno\u017ene leteck\u00fdch sn\u00edmok v&nbsp;plnej ve\u013ekosti (namiesto mal\u00fdch \u010dast\u00ed, pomocou ktor\u00fdch bola sie\u0165 tr\u00e9novan\u00e1). Tak\u00e9to testovanie poskytuje presnej\u0161ie zobrazenie re\u00e1lnych pou\u017eit\u00ed tak\u00fdchto modelov. Na extrakciu budov zo sn\u00edmok v&nbsp;plnej ve\u013ekosti sme pou\u017eili techniku posuvn\u00e9ho okna, \u010d\u00edm boli vytvoren\u00e9 predikcie po jednotliv\u00fdch segmentoch obr\u00e1zku. Na okraje prekr\u00fdvaj\u00facich sa segmentov bola pou\u017eit\u00e1 pokro\u010dil\u00e1 priemerovacia technika, d\u00f4le\u017eit\u00e1 pre minimaliz\u00e1ciu ne\u017eiad\u00facich efektov a&nbsp;zachovanie konzistentnosti v r\u00e1mci predik\u010dnej mapy. V\u00fdstupn\u00e1 predik\u010dn\u00e1 mapa v&nbsp;plnej ve\u013ekosti bola n\u00e1sledne vektorizovan\u00e1 do presn\u00fdch vektorov\u00fdch polyg\u00f3nov s&nbsp;pou\u017eit\u00edm algoritmu Active Skeleton Model (ASM).<\/p>\n\n\n\n<p><strong>Results<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173601.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173601.png\" alt=\"\" class=\"wp-image-9090\" width=\"414\" height=\"135\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173601.png 828w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173601-300x97.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173601-768x250.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173601-18x6.png 18w\" sizes=\"(max-width: 414px) 100vw, 414px\" \/><\/a><figcaption class=\"wp-element-caption\">Tabu\u013eka 2: V\u00fdsledky tr\u00e9novania modelov pre z\u00e1kladn\u00fd pr\u00edstup (u\u010denie s&nbsp;u\u010dite\u013eom) a&nbsp;pr\u00edstupy u\u010denia s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom s&nbsp;r\u00f4znymi podielmi pou\u017eit\u00fdch anotovan\u00fdch a&nbsp;neanotovan\u00fdch obr\u00e1zkov.<\/figcaption><\/figure><\/div>\n\n\n<p>The results from our experiments, reflecting performance of segmentation model trained under different conditions, reveal significant insights (see Table 2). We evaluated the model's performance in a baseline scenario without semi-supervised learning and in scenarios where semi-supervised learning was applied with varying ratios of labeled to unlabeled data (1:1, 1:3, and 1:5).<\/p>\n\n\n\n<ol type=\"1\">\n<li><strong>IoU: <\/strong>Starting from the baseline IoU of 80.50%, we observed a steady increase in this metric as we introduced more unlabeled data into the training process, reaching up to 85.77% with a 1:5 labeled to unlabeled ratio<\/li>\n\n\n\n<li><strong>2.\tPrecision, Recall, and F1 Score: <\/strong>The precision of the model, which measures how accurate the predictions are, improved from 85.75% in the baseline to 90.04% in the 1:5 ratio setup. Similarly, recall, which indicates how well the model can find all relevant instances, slightly increased from 94.27% to 94.76%. The F1 Score, which balances precision and recall, also saw an improvement from 89.81% to 92.34%. These improvements suggest that the model became more accurate and reliable in its predictions when semi-supervised learning was used.<\/li>\n\n\n\n<li><strong>N Ratio a cIoU: <\/strong>The results show a notable decrease in the N Ratio from 2.33 in the baseline to 1.65 in the semi-supervised 1:5 ratio setup, indicating that the semi-supervised model generates simpler, yet accurate, vector shapes that more closely resemble the actual structures. This simplification likely contributes to the enhanced usability of the output in practical GIS applications. Concurrently, the complexity-aware IoU (cIoU) significantly improved from 48.89% in the baseline to 64.75% in the 1:5 ratio, suggesting that the semi-supervised learning approach not only improves the overlap between the predicted and actual building footprints but also produces simpler vector shapes, which are closer to real-world buildings in terms of geometry.<\/li>\n\n\n\n<li><strong>Mean Max Tangent Angle Error<\/strong> <strong>MTAE: <\/strong>The Mean MTAE's reduction from 18.60\u00b0 in the baseline to 17.45\u00b0 in the 1:5 semi-supervised setting signifies an improvement in the geometric precision of the model's predictions. This suggests that the semi-supervised learning model is better at capturing the architectural features of buildings with more accurately defined angles, contributing to the production of topologically simpler and cleaner vector polygons.<\/li>\n<\/ol>\n\n\n\n<h4><strong>Training on High-Performance Computing (HPC) Machine<\/strong><\/h4>\n\n\n\n<h4><strong>HPC Configuration<\/strong><\/h4>\n\n\n\n<p>Our training was conducted on a High-Performance Computing (HPC) machine equipped with substantial computational resources. The HPC had 8 nodes, each outfitted with 4 NVIDIA A100 GPUs with 40GB of VRAM, 64 CPU cores, and 256GB of RAM. For task scheduling, the system utilized Slurm.<\/p>\n\n\n\n<h4><strong>PyTorch Lightning Framework<\/strong><\/h4>\n\n\n\n<p>We employed the PyTorch Lightning framework, which offers user-friendly multi-GPU settings. This framework allows the specification of the number of GPUs per node, the total number of nodes, various distributed strategies, and the option for mixed-precision training.<\/p>\n\n\n\n<h4><strong>Experiences with Slurm and PyTorch Lightning<\/strong><\/h4>\n\n\n\n<p>When training on a single GPU, our Slurm configuration was as follows:<br>#SBATCH &#8211;partition=ngpu<br>#SBATCH &#8211;gres=gpu:1<br>#SBATCH &#8211;cpus-per-task=16<br>#SBATCH \u2013mem=64000<\/p>\n\n\n\n<p>In PyTorch Lightning, we set the trainer as: <em>Trainer<\/em>:<\/p>\n\n\n\n<p>trainer = Trainer(accelerator=&#8221;gpu&#8221;, devices=1)<\/p>\n\n\n\n<p>Since, here, we allocated one GPU from four available in one node, we allocated 16 CPUs from 64 available. Therefore, for the data loaders, we assigned 16 workers. Since semi-supervised learning uses two data loaders (one for labeled and one for unlabeled data), we allocated 8 workers to each. It was critical to ensure that the total number of cores for the data loaders did not exceed the available CPUs to prevent training crashes.<\/p>\n\n\n\n<h4><strong>Distributed Data Parallel (DDP) Strategy<\/strong><\/h4>\n\n\n\n<p>Using PyTorch Lightning's Distributed Data Parallel (DDP) option, we ensured each GPU across the nodes operated independently:<\/p>\n\n\n\n<ul>\n<li>Each GPU processed a portion of the dataset.<\/li>\n\n\n\n<li>All processes initiated the model independently.<\/li>\n\n\n\n<li>Each conducted forward and backward passes in parallel.<\/li>\n\n\n\n<li>Gradients were synchronized and averaged across processes.<\/li>\n\n\n\n<li>Each process updated its optimizer individually.<\/li>\n<\/ul>\n\n\n\n<p>With this approach, the total number of data loaders equaled the number of GPUs multiplied by the number of data loaders. For example, in a semi-supervised learning setup with 4 GPUs and two types of data loaders (labeled and unlabeled), we ended up with 8 data loaders, each with 8 workers \u2013 64 workers in total.<\/p>\n\n\n\n<p>To fully utilized one node with four GPU, we used following configurations:<\/p>\n\n\n\n<p>#SBATCH &#8211;partition=ngpu<\/p>\n\n\n\n<p>#SBATCH &#8211;gres=gpu:4<br><\/p>\n\n\n\n<p>#SBATCH \u2013exclusive<\/p>\n\n\n\n<p>#SBATCH &#8211;cpus-per-task=64<\/p>\n\n\n\n<p>#SBATCH \u2013mem=256000<\/p>\n\n\n\n<p>In PyTorch Lightning, we set the trainer as:<\/p>\n\n\n\n<p>PyTorch Lightning <em>Trainer<\/em>, nastav\u00edme nasledovne:<\/p>\n\n\n\n<p>trainer = Trainer(accelerator=&#8221;gpu&#8221;, devices=4, strategy=&#8221;ddp&#8221;)<\/p>\n\n\n\n<h4><strong>Utilizing Multiple Nodes<\/strong><\/h4>\n\n\n\n<p>Using PyTorch Lighting, it is possible to leverage multiple nodes on HPC. For instance, using 4 nodes with 4 GPUs each (16 GPUs in total) was configured as:<\/p>\n\n\n\n<p>trainer = Trainer(accelerator=&#8221;gpu&#8221;, devices=4, strategy=&#8221;ddp&#8221;, num_nodes=4)<\/p>\n\n\n\n<p>Correspondingly, the Slurm configuration was set to:<\/p>\n\n\n\n<p>#SBATCH \u2013nodes=4<\/p>\n\n\n\n<p>#SBATCH \u2013ntasks-per-node=4<\/p>\n\n\n\n<p>#SBATCH &#8211;gres=gpu:4<\/p>\n\n\n\n<p>These settings and experiences highlight the scalability and flexibility of training complex machine learning models on an HPC environment, especially for tasks demanding significant computational resources like semi-supervised learning in geospatial data analysis.<\/p>\n\n\n\n<h4><strong>Training Scalability Analysis<\/strong><\/h4>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173746.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173746.png\" alt=\"\" class=\"wp-image-9091\" width=\"411\" height=\"158\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173746.png 822w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173746-300x115.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173746-768x295.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Snimka-obrazovky-2024-06-21-173746-18x7.png 18w\" sizes=\"(max-width: 411px) 100vw, 411px\" \/><\/a><figcaption class=\"wp-element-caption\">Tabu\u013eka 3: V\u00fdsledky tr\u00e9novania pr\u00edstupov u\u010denia s&nbsp;u\u010dite\u013eom a&nbsp;u\u010denia s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom s 1, 2, 4 a 8 GPU. Pre ka\u017ed\u00fa konfigur\u00e1ciu je uveden\u00fd \u010das na jednu epochu a&nbsp;pomer ur\u00fdchlenia proti 1 GPU.<\/figcaption><\/figure><\/div>\n\n\n<p>In the Training Scalability Analysis, we carefully examined the impact of expanding computational resources on the efficiency of training models, utilizing the PyTorch Lightning framework.<br>This investigation covered both supervised and semi-supervised learning approaches, with a particular emphasis on the effects of increasing GPU numbers, including setups involving 2 nodes (or 8 GPUs).<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-1024x683.png\" alt=\"\" class=\"wp-image-9088\" width=\"512\" height=\"342\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-1024x683.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-300x200.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-768x512.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-1536x1024.png 1536w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-2048x1365.png 2048w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-18x12.png 18w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-1200x800.png 1200w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/figure5-1980x1320.png 1980w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/a><figcaption class=\"wp-element-caption\">Figure 5: This graph compares the actual speedup ratios for supervised and semi-supervised learning against the number of GPUs, alongside the ideal linear speedup ratio. It showcases the closer alignment of semi-supervised learning with ideal scalability, emphasizing its greater efficiency gains from increased computational resources.<\/figcaption><\/figure><\/div>\n\n\n<p>A key finding from this analysis was that the increase in speedup ratios for supervised learning did not perfectly align with the number of GPUs utilized. Ideally, doubling the number of GPUs would directly double the speedup ratio (e.g., using 4 GPUs would result in a 4x speedup). However, the actual speedup ratios were lower than this ideal expectation. This discrepancy can be attributed to the overhead associated with managing multiple GPUs and nodes, particularly the need to synchronize data across all GPUs, which introduces efficiency losses.<\/p>\n\n\n\n<p>U\u010denie s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom uk\u00e1zalo mierne in\u00fd trend, viac pribli\u017euj\u00faci sa ide\u00e1lnemu (line\u00e1rnemu) n\u00e1rastu ur\u00fdchlenia. Zd\u00e1 sa, \u017ee komplexnos\u0165 a&nbsp;vy\u0161\u0161ie v\u00fdpo\u010dtov\u00e9 n\u00e1roky u\u010denia s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom zmier\u0148uj\u00fa dopad overhead n\u00e1kladov a&nbsp;t\u00fdm umo\u017e\u0148uj\u00fa efekt\u00edvnej\u0161ie vyu\u017e\u00edvanie viacer\u00fdch GPU. Napriek v\u00fdzvam spojen\u00fdm so synchroniz\u00e1ciou d\u00e1t cez viacero GPU kariet a&nbsp;v\u00fdpo\u010dtov\u00fdch uzlov, vy\u0161\u0161ie v\u00fdpo\u010dtov\u00e9 n\u00e1roky u\u010denia s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom umo\u017e\u0148uj\u00fa efekt\u00edvnej\u0161ie \u0161k\u00e1lovanie zdrojov, t.j. ur\u00fdchlenie bli\u017e\u0161ie ide\u00e1lnemu scen\u00e1ru.<\/p>\n\n\n\n<h4><strong>Conclusion<\/strong><\/h4>\n\n\n\n<p>The research presented in this whitepaper has successfully demonstrated the effectiveness of integrating UniMatch semi-supervised learning with Frame Field learning for the task of building extraction from aerial imagery. This integration addresses the challenges associated with the scarcity of labeled data in deep learning applications for geographic information systems (GIS), providing a cost-effective and scalable solution.<\/p>\n\n\n\n<p>Our findings reveal that employing semi-supervised learning significantly enhances the model's performance across several key metrics, including Intersection over Union (IoU), precision, recall, F1 Score, N Ratio, complexity-aware IoU (cIoU), and Mean Max Tangent Angle Error (MTAE). Notably, the improvements in IoU and cIoU metrics underscore the model's increased accuracy in delineating building footprints and generating vector shapes that closely resemble actual structures. This outcome is pivotal for applications in urban planning, environmental studies, and infrastructure management, where precise mapping and analysis of building data are crucial.<\/p>\n\n\n\n<p>The methodology adopted, which combines Frame Field learning with the innovative UniMatch approach, has proven to be highly effective in leveraging both labeled and unlabeled data. This strategy not only improves the geometric precision of the model's predictions but also ensures the generation of cleaner, topologically accurate vector polygons. Furthermore, the scalability and efficiency of training on a High-Performance Computing (HPC) machine using the PyTorch Lightning framework and Distributed Data Parallel (DDP) strategy have been instrumental in handling the extensive computational demands of the semi-supervised learning process on the data at hand, within a time frame ranging from tens of minutes to hours.<\/p>\n\n\n\n<p>Pr\u00e1ca zd\u00f4raz\u0148uje potenci\u00e1l u\u010denia s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom v zlep\u0161ovan\u00ed automatickej extrakcie budov z&nbsp;leteck\u00fdch sn\u00edmok. Implement\u00e1cia UniMatch do Frame Field learning met\u00f3dy predstavuje v\u00fdznamn\u00fd krok vpred, poskytuj\u00fac robustn\u00e9 rie\u0161enie pre v\u00fdzvy spojen\u00e9 s&nbsp;nedostatkom d\u00e1t a&nbsp;potreby vysokej presnosti geopriestorovej d\u00e1tovej anal\u00fdzy. Tento pr\u00edstup zlep\u0161uje efekt\u00edvnos\u0165 a&nbsp;presnos\u0165 extrakcie budov, a taktie\u017e otv\u00e1ra nov\u00e9 mo\u017enosti pre aplik\u00e1cie met\u00f3d u\u010denia s&nbsp;\u010diasto\u010dn\u00fdm u\u010dite\u013eom v&nbsp;GIS a&nbsp;pr\u00edbuzn\u00fdch oblastiach.<\/p>\n\n\n\n<p><strong>Acknowledgment<\/strong><\/p>\n\n\n\n<p>Research results were obtained with the support of the Slovak National competence centre for HPC, the EuroCC 2 project and Slovak National Supercomputing Centre under grant agreement 101101903-EuroCC 2-DIGITAL-EUROHPC-JU-2022-NCC-01.<\/p>\n\n\n\n<p>Computational resources were procured in the national project National competence centre for high performance computing (project code: 311070AKF2) funded by European Regional Development Fund, EU Structural Funds Informatization of society, Operational Program Integrated Infrastructure.<\/p>\n\n\n\n<p><strong>Authors<\/strong><\/p>\n\n\n\n<p>Patrik Sabol <em>\u2013 Geodeticca Vision s.r.o., Flori\u00e1nska 19, 044 01 Ko\u0161ice, Slovakia<\/em><\/p>\n\n\n\n<p>&nbsp;Bibi\u00e1na Laj\u010dinov<em>\u00e1<sup> <\/sup> \u2013 National Supercomputing Center, D\u00fabravsk\u00e1 cesta 3484\/9, 84104 Bratislava-Karlov\u00e1 Ves, Slovakia<\/em><\/p>\n\n\n\n<p><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Geodeticca-report-SK.pdf\">Full version of the article SK<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2024\/06\/Geodeticca-report-EN.pdf\">Full version of the article EN<\/a><\/p>\n\n\n\n<h4><strong>References:<\/strong><\/h4>\n\n\n\n<p>[1] Nicolas Girard, Dmitriy Smirnov, Justin Solomon, and Yuliya Tarabalka. &#8220;Polygonal Building Extraction by Frame Field Learning&#8221;. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2021), pp. 5891-5900.<\/p>\n\n\n\n<p>[2] L. Yang, L. Qi, L. Feng, W. Zhang, and Y. Shi. &#8220;Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation&#8221;. In: 2023 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2023), pp. 7236-7246. doi: 10.1109\/CVPR52729.2023.00699.<\/p>\n\n\n\n<p>[3] Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, and Colin Raffel. &#8220;FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence&#8221;. In: CoRR, vol. abs\/2001.07685 (2020). Available: <a href=\"https:\/\/arxiv.org\/abs\/2001.07685\">https:\/\/arxiv.org\/abs\/2001.07685<\/a>.<\/p>\n\n\n\n<p>[4] Emmanuel Maggiori, Yuliya Tarabalka, Guillaume Charpiat, and Pierre Alliez. &#8220;Can Semantic Labeling Methods Generalize to Any City? The Inria Aerial Image Labeling Benchmark&#8221;. In: IEEE International Geoscience and Remote Sensing Symposium (IGARSS) (2017). IEEE.<\/p>\n\n\n\n<p>[5] Adrian Boguszewski, Dominik Batorski, Natalia Ziemba-Jankowska, Tomasz Dziedzic, and Anna Zambrzycka. &#8220;LandCover.ai: Dataset for Automatic Mapping of Buildings, Woodlands, Water and Roads from Aerial Imagery&#8221;. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (June 2021), pp. 1102-1110.<\/p>\n\n\n\n<p>[6] &#8220;Ortofotomozaika.&#8221; Geoportal SK. Accessed February 14, 2024. <a href=\"https:\/\/www.geoportal.sk\/sk\/zbgis\/ortofotomozaika\/\">https:\/\/www.geoportal.sk\/sk\/zbgis\/ortofotomozaika\/<\/a>.<\/p>\n\n\n\n<p>[7] Stefano Zorzi, Shabab Bazrafkan, Stefan Habenschuss, and Friedrich Fraundorfer. &#8220;PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images&#8221;. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2022), pp. 1848-1857.<\/p>\n\n\n\n<h3>&nbsp;<\/h3>\n\n\n\n<h3>&nbsp;<\/h3>\n\n\n\n<h5><br><br><\/h5>\n\n\n\n<div class=\"is-horizontal is-content-justification-center is-layout-flex wp-container-5 wp-block-buttons\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"\/en\/success-stories\/\">Success-Stories<\/a><\/div>\n<\/div>\n\n\n<div class=\"display-posts-listing grid\"><div class=\"listing-item\"><a class=\"image\" href=\"https:\/\/eurocc.nscc.sk\/en\/odvratena-strana-hier\/\"><img width=\"300\" height=\"164\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-300x164.png\" class=\"attachment-medium size-medium wp-post-image\" alt=\"\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-300x164.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-1024x559.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-768x419.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-1536x838.png 1536w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-2048x1117.png 2048w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-18x10.png 18w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-1200x655.png 1200w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/07\/Gemini_Generated_Image_ln32f2ln32f2ln32-1980x1080.png 1980w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a> <a class=\"title\" href=\"https:\/\/eurocc.nscc.sk\/en\/odvratena-strana-hier\/\">Odvr\u00e1ten\u00e1 strana bezplatn\u00fdch hier<\/a> <span class=\"date\">11 Jul<\/span> <span class=\"excerpt-dash\">-<\/span> <span class=\"excerpt\">Ako slovensk\u00fd t\u00edm s pomocou superpo\u010d\u00edta\u010da vyrie\u0161il r\u00e9bus extr\u00e9mne vz\u00e1cneho spr\u00e1vania hr\u00e1\u010dov<\/span><\/div><div class=\"listing-item\"><a class=\"image\" href=\"https:\/\/eurocc.nscc.sk\/en\/asai-ai-osobnost-2026\/\"><img width=\"300\" height=\"225\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-300x225.jpg\" class=\"attachment-medium size-medium wp-post-image\" alt=\"\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-300x225.jpg 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-1024x768.jpg 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-768x576.jpg 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-1536x1152.jpg 1536w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-16x12.jpg 16w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-1200x900.jpg 1200w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419-1980x1485.jpg 1980w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/20260617_145419.jpg 2048w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a> <a class=\"title\" href=\"https:\/\/eurocc.nscc.sk\/en\/asai-ai-osobnost-2026\/\">ASAI AI Osobnos\u0165 2026<\/a> <span class=\"date\">17 Jun<\/span> <span class=\"excerpt-dash\">-<\/span> <span class=\"excerpt\">Prest\u00ed\u017ene ocenenie ASAI AI Osobnos\u0165 2026<\/span><\/div><div class=\"listing-item\"><a class=\"image\" href=\"https:\/\/eurocc.nscc.sk\/en\/ked-ai-klope-na-branu-teologie\/\"><img width=\"300\" height=\"164\" src=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-300x164.png\" class=\"attachment-medium size-medium wp-post-image\" alt=\"\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-300x164.png 300w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-1024x559.png 1024w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-768x419.png 768w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-1536x838.png 1536w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-2048x1117.png 2048w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-18x10.png 18w, https:\/\/eurocc.nscc.sk\/wp-content\/uploads\/2026\/06\/Gemini_Generated_Image_b5pyykb5pyykb5py-1200x655.png 1200w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a> <a class=\"title\" href=\"https:\/\/eurocc.nscc.sk\/en\/ked-ai-klope-na-branu-teologie\/\">Ke\u010f AI klope na br\u00e1nu teol\u00f3gie<\/a> <span class=\"date\">11 Jun<\/span> <span class=\"excerpt-dash\">-<\/span> <span class=\"excerpt\">Pre\u010do umel\u00e1 inteligencia patr\u00ed aj do teol\u00f3gie?<\/span><\/div><\/div>\n<\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Extrakcia budov v Geografick\u00fdch informa\u010dn\u00fdch syst\u00e9moch (GIS) je k\u013e\u00fa\u010dov\u00e1 pre urbanistick\u00e9 pl\u00e1novanie, environment\u00e1lne \u0161t\u00fadie a riadenie infra\u0161trukt\u00fary, preto\u017ee umo\u017e\u0148uje presn\u00e9 mapovanie stavieb, vr\u00e1tane odha\u013eovania neleg\u00e1lnych stavieb za \u00fa\u010delom dodr\u017eiavania pr\u00e1vnych predpisov, alebo efekt\u00edvnej\u0161ieho vyberania dan\u00ed. Integr\u00e1cia extrahovan\u00fdch \u00fadajov o budov\u00e1ch s in\u00fdmi geopriestorov\u00fdmi vrstvami zlep\u0161uje pochopenie dynamiky miest a priestorov\u00fdch vz\u0165ahov.<\/p>","protected":false},"author":2,"featured_media":9084,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"templates\/template-full-width.php","format":"standard","meta":[],"categories":[9,1],"tags":[],"_links":{"self":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts\/9083"}],"collection":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/comments?post=9083"}],"version-history":[{"count":18,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts\/9083\/revisions"}],"predecessor-version":[{"id":9113,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/posts\/9083\/revisions\/9113"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/media\/9084"}],"wp:attachment":[{"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/media?parent=9083"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/categories?post=9083"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/eurocc.nscc.sk\/en\/wp-json\/wp\/v2\/tags?post=9083"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}