<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">INFORMATICA</journal-id>
<journal-title-group><journal-title>Informatica</journal-title></journal-title-group>
<issn pub-type="epub">1822-8844</issn><issn pub-type="ppub">0868-4952</issn><issn-l>0868-4952</issn-l>
<publisher>
<publisher-name>Vilnius University</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">INFOR633</article-id>
<article-id pub-id-type="doi">10.15388/26-INFOR633</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Research Article</subject></subj-group></article-categories>
<title-group>
<article-title>Transformer-Based Detection of Propaganda Techniques in a Low-Resource Language: A Case Study in Lithuanian</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Rizgelienė</surname><given-names>Ieva</given-names></name><email xlink:href="ieva.rizgeliene@mif.vu.lt">ieva.rizgeliene@mif.vu.lt</email><xref ref-type="aff" rid="j_infor633_aff_001"/><xref ref-type="corresp" rid="cor1">∗</xref><bio>
<p><bold>I. Rizgelienė</bold> is a PhD student at the Institute of Data Science and Digital Technologies, Vilnius University. Her primary research interests include propaganda detection and analysis, with an emphasis on low-resource languages.</p></bio>
</contrib>
<contrib contrib-type="author">
<name><surname>Zaranka</surname><given-names>Paulius</given-names></name><email xlink:href="paulius.zaranka@mif.vu.lt">paulius.zaranka@mif.vu.lt</email><xref ref-type="aff" rid="j_infor633_aff_001"/><bio>
<p><bold>P. Zaranka</bold> received his master’s degree in computer modelling from Vilnius University in 2025 and is currently a lecturer in NLP at Vilnius University. His primary research interests include large language models, natural language processing, and agent-based modelling.</p></bio>
</contrib>
<contrib contrib-type="author">
<name><surname>Korvel</surname><given-names>Gražina</given-names></name><email xlink:href="grazina.korvel@mif.vu.lt">grazina.korvel@mif.vu.lt</email><xref ref-type="aff" rid="j_infor633_aff_001"/><bio>
<p><bold>P. Zaranka</bold> received his master’s degree in computer modelling from Vilnius University in 2025 and is currently a lecturer in NLP at Vilnius University. His primary research interests include large language models, natural language processing, and agent-based modelling.</p></bio>
</contrib>
<contrib contrib-type="author">
<name><surname>Marcinkevičius</surname><given-names>Virginijus</given-names></name><email xlink:href="virginijus.marcinkevicius@mif.vu.lt">virginijus.marcinkevicius@mif.vu.lt</email><xref ref-type="aff" rid="j_infor633_aff_001"/><bio>
<p><bold>P. Zaranka</bold> received his master’s degree in computer modelling from Vilnius University in 2025 and is currently a lecturer in NLP at Vilnius University. His primary research interests include large language models, natural language processing, and agent-based modelling.</p></bio>
</contrib>
<aff id="j_infor633_aff_001"><label>1</label><institution>Vilnius University, Faculty of Mathematics and Informatics, Institute of Data Science and Digital Technologies</institution>, Akademijos st. 4, Vilnius, LT-08412, <country>Lithuania</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2026</year></pub-date><pub-date pub-type="epub"><day>12</day><month>6</month><year>2026</year></pub-date><volume content-type="ahead-of-print">0</volume><issue>0</issue><fpage>1</fpage><lpage>30</lpage><history><date date-type="received"><month>4</month><year>2026</year></date><date date-type="accepted"><month>6</month><year>2026</year></date></history>
<permissions><copyright-statement>© 2026 Vilnius University</copyright-statement><copyright-year>2026</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Propaganda techniques are a key tool for creating misleading content, often disseminated in native languages to increase their impact. Therefore, it is increasingly important to develop detection models not only for high-resource languages but also for low-resource languages, which still face significant limitations in propaganda detection. This study presents the first approach to automated propaganda technique detection in Lithuanian using the HALT-PROP corpus. We adapt the standard framework to account for frequent overlap between techniques. Experiments with the Lithuanian transformer LT-MLKM-modernBERT show that BILOU tagging improves span identification, while sentence classification based on span-level information enhances technique detection for most techniques. The results also indicate that training separate binary classifiers is more effective than multi-label classification in this setting. Overall, the proposed approach outperforms GPT-5.3 on most techniques and provides a strong baseline for propaganda technique detection in Lithuanian.</p>
</abstract>
<kwd-group>
<label>Key words</label>
<kwd>propaganda technique detection</kwd>
<kwd>low-resource language</kwd>
<kwd>transformers</kwd>
</kwd-group>
<funding-group><funding-statement>This research was supported by the Lithuanian Government Priority Research Program “Building Societal Resilience and Crisis Management in the Context of Con temporary Geopolitical Developments” (implemented through the Lithuania Research Council) under grant number S-VIS-23-8. Project title: “Propaganda and Disinformation Research: Machine Learning-Based Automatic Detection, Impact and Societal Resilience.”</funding-statement></funding-group>
</article-meta>
</front>
<body>
<sec id="j_infor633_s_001">
<label>1</label>
<title>Introduction</title>
<p>Over time, propaganda has increasingly shifted toward digital dissemination through social media and online news platforms. As propaganda often relies on rhetorical manipulation, sometimes incorporating factual elements while still misleading audiences through specific persuasive techniques, it is important not only to identify whether an article contains propagandistic content, but also to detect the specific techniques employed. Moreover, recent experimental research demonstrates that labelling content as propaganda and explicitly highlighting the rhetorical techniques used can significantly reduce users’ intentions to share such content online (Jose <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_010">2025</xref>).</p>
<p>With the advancement of machine learning technologies, particularly in natural language processing, substantial efforts have been devoted to automated propaganda detection. This is reflected in a series of shared tasks and studies focusing on propaganda and persuasion technique detection (Rashkin <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_015">2017</xref>; Barrón-Cedeño <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_002">2019</xref>; Da San Martino <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_004">2019</xref>, <xref ref-type="bibr" rid="j_infor633_ref_005">2020</xref>; Dimitrov <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_006">2021</xref>; Piskorski <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_014">2023</xref>; Dimitrov <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_007">2024</xref>; Alam <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_001">2022</xref>; Hasanain <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_008">2024</xref>; Moral <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_011">2023</xref>, <xref ref-type="bibr" rid="j_infor633_ref_012">2024</xref>; Horák <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_009">2024</xref>). Despite this progress, significant challenges remain, particularly for low-resource languages. A key limitation is the scarcity of essential resources, particularly annotated corpora tailored to these languages.</p>
<p>To address this gap, particularly for the Lithuanian language, the first human-annotated corpus, HALT-PROP, was released in 2025 (Rizgelienė <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_017">2025</xref>). The corpus contains Lithuanian texts annotated by five experts for propaganda techniques and narratives using a cross-annotation methodology, allowing multiple techniques and narratives to be assigned to the same text. As this is a newly released resource, there is currently no research on propaganda technique detection based on this corpus, nor are there any dedicated studies on propaganda technique detection for the Lithuanian language.</p>
<p>The main goal of this research is to develop the first approach for propaganda techniques detection in Lithuanian. To achieve this, the study addresses the following research questions:</p>
<list>
<list-item id="j_infor633_li_001">
<label>1.</label>
<p>Does incorporating span boundary information and increasing the maximum input sequence length improve propaganda span identification performance in Lithuanian?</p>
</list-item>
<list-item id="j_infor633_li_002">
<label>2.</label>
<p>How does incorporating span-level information, indicating where propaganda techniques occur, affect the performance of propaganda techniques classification?</p>
</list-item>
<list-item id="j_infor633_li_003">
<label>3.</label>
<p>Does modelling propaganda techniques as independent binary classification tasks improve detection performance compared to multi-class classification in scenarios with high label overlap?</p>
</list-item>
</list>
<p>In this study, a <italic>span</italic> refers to an uninterrupted fragment of text annotated as containing at least one propaganda technique.</p>
<p>To address these questions, we first perform an exploratory analysis of the HALT-PROP corpus. Based on these insights, we propose a methodology for detecting propaganda techniques in Lithuanian news articles that reflects the corpus characteristics. We then leverage the monolingual Lithuanian transformer model LT-MLKM-modernBERT to conduct experiments on both span identification and techniques classification. We explore different training setups, including the use of span-level information, as well as binary and multi-class classification strategies. Finally, we evaluate the approach under multiple conditions, including gold and predicted spans, and compare the results with zero-shot and few-shot performance of large language model.</p>
</sec>
<sec id="j_infor633_s_002">
<label>2</label>
<title>Related Work</title>
<p>Until 2019, propaganda detection research primarily focused on document-level classification (Rashkin <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_015">2017</xref>; Barrón-Cedeño <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_002">2019</xref>). However, because propaganda often contain both propagandistic and non-propagandistic content, assigning a single label to an entire document is overly coarse and has motivated a shift toward fine-grained analysis. One of the early approaches to propaganda technique detection focused on sentence-level classification and defined two tasks based on an expert-annotated English dataset covering 18 propaganda techniques: (i) Sentence-Level Classification (SLC) and (ii) Fragment-Level Classification (FLC) (Da San Martino <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_004">2019</xref>). Subsequently, the SemEval-2020 Task 11 shared task was introduced (Da San Martino <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_005">2020</xref>), which extended propaganda technique detection from sentence-level classification to span-level analysis. It defined two subtasks: (i) Span Identification (SI), a binary sequence labelling task aimed at identifying text fragments containing at least one propaganda technique, and (ii) Technique Classification (TC), a multi-class classification task that assigns a specific propaganda technique label to the identified spans. Later SemEval tasks further expanded the scope to include propaganda technique detection in multimodal content (text and images) (Dimitrov <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_006">2021</xref>), multilingual analysis (Piskorski <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_014">2023</xref>), and multilingual meme analysis (Dimitrov <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_007">2024</xref>). Beyond SemEval, several initiatives have also explored non-English and multilingual settings, including Arabic shared tasks on multi-label technique classification and persuasion detection in tweets (Alam <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_001">2022</xref>; Hasanain <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_008">2024</xref>), as well as the DIPROMATS shared tasks, which address propaganda identification, characterization, technique classification, and narrative detection in English and Spanish (Moral <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_011">2023</xref>, <xref ref-type="bibr" rid="j_infor633_ref_012">2024</xref>).</p>
<p>However, despite several initiatives extending propaganda technique detection beyond the English language, research in this area still faces significant limitations, particularly for low-resource languages. One of the few dedicated efforts is the work of Horák <italic>et al.</italic> (<xref ref-type="bibr" rid="j_infor633_ref_009">2024</xref>), which introduces an annotated corpus for the Czech language and proposes an initial approach to propaganda technique detection. This approach combines stylometric features with representations from pretrained transformer models. However, it focuses on document-level classification, identifying whether an article contains specific techniques. While this represents an improvement over binary propaganda detection, it still lacks fine-grained explainability, as it does not indicate where in the text the techniques occur.</p>
<p>In addition, some studies have also explored technique classification for non-English languages, such as Arabic (Alam <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_001">2022</xref>), which is generally considered a high-resource language, as well as multilingual approaches (Piskorski <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_014">2023</xref>) covering languages such as Italian, Russian, French, German, and Polish. However, none of the existing resources include the Lithuanian language, nor do they cover languages spoken in the Baltic countries. More broadly, languages spoken in countries neighbouring Russia, as well as in former Eastern Bloc countries, remain severely underrepresented, with entire regions lacking datasets and models for propaganda identification, despite being primary targets of information warfare. This work addresses this gap by proposing a method for propaganda technique detection in the Lithuanian language, representing the first such approach in the Baltic region, as well as one of the first approaches in general for a low-resource language spoken in the Russian neighbourhood, and one of the few in European languages.</p>
</sec>
<sec id="j_infor633_s_003">
<label>3</label>
<title>Data</title>
<p>In this study, we used the first Lithuanian corpus for propaganda narratives and techniques (Rizgelienė <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_017">2025</xref>). The corpus consists of two complementary datasets: (1) 2 870 news articles manually labelled by five experts at the article level to identify the presence of propaganda; and (2) a subset of 1 000 articles annotated for specific propaganda techniques and narratives using a cross-annotation approach, in which each article was independently annotated by two of the five experts, and the final annotation was confirmed through pairwise discussion. In this study, we focus only on propaganda techniques annotations. Our selected corpus is annotated for the following ten propaganda techniques: 
<list>
<list-item id="j_infor633_li_004">
<label>1.</label>
<p><italic>Emotional Expression</italic>. Intentionally uses emotionally charged language (e.g. fear, anger, pride, sympathy) to provoke strong feelings and influence audience beliefs or actions. Often avoids logical reasoning and relies on exaggeration, personal attacks, or vague but positive terms to shape perception.</p>
</list-item>
<list-item id="j_infor633_li_005">
<label>2.</label>
<p><italic>Whataboutism/Red Herring/Straw Man</italic>. Distracts from the main issue by shifting blame or criticism to others (Whataboutism), introducing irrelevant information or arguments (Red Herring), or misrepresenting an opponent’s view by exaggerating, distorting, or oversimplifying it to attack a weaker version (Straw Man). These strategies serve to divert or deflect attention from the core argument.</p>
</list-item>
<list-item id="j_infor633_li_006">
<label>3.</label>
<p><italic>Simplification</italic>. Deliberately reduces complex issues to overly simple explanations by attributing blame or responsibility to a single cause or group, framing problems as having only two opposing options, using stereotyped phrases that shut down deeper thinking, and relying on short, catchy slogans that appeal more to emotions than logic. These strategies limit critical analysis and obscure the true complexity of issues.</p>
</list-item>
<list-item id="j_infor633_li_007">
<label>4.</label>
<p><italic>Intentional Vagueness</italic> (<italic>Obfuscation</italic>). Uses ambiguous or imprecise language to obscure meaning, allowing multiple interpretations and helping avoid accountability or direct scrutiny.</p>
</list-item>
<list-item id="j_infor633_li_008">
<label>5.</label>
<p><italic>Appeal to Authority</italic>. Refers to perceived authoritative figures or institutions to legitimize a claim, implying it is true based solely on the authority’s status, often without supporting evidence.</p>
</list-item>
<list-item id="j_infor633_li_009">
<label>6.</label>
<p><italic>Flag-Waving</italic>. Promotes a position by invoking patriotism or national pride, suggesting the idea serves the country’s interests, even in the absence of a clear rationale or evidence.</p>
</list-item>
<list-item id="j_infor633_li_010">
<label>7.</label>
<p><italic>Bandwagon</italic>. Encourages alignment with a belief or action by implying widespread acceptance. Leverages social pressure and the desire to conform to persuade individuals to adopt the majority view.</p>
</list-item>
<list-item id="j_infor633_li_011">
<label>8.</label>
<p><italic>Doubt/Smears</italic>. Seeks to undermine credibility by casting suspicion or attacking character – either subtly (Doubt) or directly through baseless accusations or insinuations (Smears) – without presenting concrete evidence.</p>
</list-item>
<list-item id="j_infor633_li_012">
<label>9.</label>
<p><italic>Reduction ad Hitlerum/Stalinum</italic>. Discredits a person, idea, or group by associating them with historically vilified figures (e.g. Hitler, Stalin), appealing to emotion rather than addressing the actual argument.</p>
</list-item>
<list-item id="j_infor633_li_013">
<label>10.</label>
<p><italic>Repetition</italic>. Reinforces a message through frequent repetition. Over time, repeated statements may appear more familiar and thus more believable, even in the absence of evidence, a psychological effect known as the “illusion of truth.”</p>
</list-item>
</list>
</p>
<sec id="j_infor633_s_004">
<label>3.1</label>
<title>Dataset Analysis</title>
<fig id="j_infor633_fig_001">
<label>Fig. 1</label>
<caption>
<p>Statistics of propaganda techniques in the HALT-PROP corpus. (A) Number of articles containing each technique. (B) Average share of article text covered by the technique. (C) Average number of spans per article. (D) Average span length in characters.</p>
</caption>
<graphic xlink:href="infor633_g001.jpg"/>
</fig>
<sec id="j_infor633_s_005">
<label>3.1.1</label>
<title>Techniques Distribution</title>
<p>We conducted an exploratory analysis of the HALT-PROP dataset to examine the distribution of propaganda techniques and the density and coverage of annotated spans (see Fig. <xref rid="j_infor633_fig_001">1</xref>). The results show that the dataset is imbalanced, with a small number of techniques dominating the corpus in both frequency and coverage. Additionally, techniques differ substantially in typical span length and annotation density, reflecting their different manipulation strategies: some are expressed through short lexical cues, while others appear as longer discourse-level segments. Below, we summarize the main findings for the techniques. 
<list>
<list-item id="j_infor633_li_014">
<label>•</label>
<p><bold>Emotional Expression</bold>, <bold>Simplification</bold>, and <bold>Doubt/Smears</bold> are the three most frequent propaganda techniques in the corpus. Emotional Expression appears in 820 articles, Simplification in 689 articles, and Doubt/Smears in 649 articles. These techniques also have the highest text coverage: on average, 39.5% of the text is annotated with Emotional Expression, 23.2% with Simplification, and 22.8% with Doubt/Smears in articles where they appear. They also show relatively high span densities, with Emotional Expression appearing in 5–6 spans per article on average, Simplification in 4 spans, and Doubt/Smears in 3–4 spans. All three techniques have moderate-to-long span lengths compared to other techniques in the corpus.</p>
</list-item>
<list-item id="j_infor633_li_015">
<label>•</label>
<p><bold>Whataboutism/Red Herring/Straw Man</bold>, <bold>Appeal to Authority</bold>, and <bold>Uncertainty</bold> appear with medium to lower frequency in the corpus. Whataboutism/Red Herring/Straw Man occurs in 275 articles, Appeal to Authority in 272 articles, and Uncertainty in 210 articles. Among these techniques, Whataboutism/Red Herring/Straw Man has the highest text coverage, covering approximately 19% of the article text on average, while Appeal to Authority covers about 12.3% and Uncertainty 8.7%. The span density is similar for Whataboutism/Red Herring/Straw Man and Uncertainty, which, on average, appear in about 1–2 spans per article, whereas Appeal to Authority has a slightly higher span density, appearing in approximately 2–3 spans per article. In terms of span length, Whataboutism/Red Herring/Straw Man has the longest spans, averaging about 271 characters. This is likely because the goal of this technique is to redirect attention from the main issue or topic, which often requires longer text segments to shift the context.</p>
</list-item>
<list-item id="j_infor633_li_016">
<label>•</label>
<p><bold>Waving the Flag</bold> and <bold>Repetition</bold> occur with relatively lower frequency, appearing in 190 and 184 articles, respectively. These techniques show distinct annotation patterns. In particular, Repetition stands out among all techniques, with the smallest text coverage (5.84%), the highest span density (approximately 5–6 spans per article), and the shortest spans (an average length of only 33 characters). This reflects the main characteristic of this technique: repeating the same message, phrase, or word multiple times within an article rather than expressing a complete argument or cue in a single span. Waving the Flag shows a pattern more similar to other techniques, with relatively low coverage (10.2%), a moderate span density (approximately 2–3 spans per article), and medium-length spans (about 143 characters on average).</p>
</list-item>
<list-item id="j_infor633_li_017">
<label>•</label>
<p><bold>Reductio ad Hitlerum</bold> and <bold>Following Behind</bold> are the least frequent techniques in the corpus. Reductio ad Hitlerum appears in 114 articles, while Following Behind appears in only 78 articles. Following Behind has higher text coverage (13.2%) compared to Reductio ad Hitlerum (8.8%). The two techniques have similar span densities, typically appearing in about 1–2 spans per article. However, Following Behind generally has longer spans (about 191 characters on average), whereas Reductio ad Hitlerum has moderately long spans (about 143 characters on average).</p>
</list-item>
</list>
</p>
</sec>
<sec id="j_infor633_s_006">
<label>3.1.2</label>
<title>Span Overlap Analysis</title>
<p>To better understand how propaganda techniques interact, we conducted a span-overlap analysis. Since a single phrase can express multiple manipulation strategies, overlap analysis helps identify which techniques commonly co-occur and which appear more independently. Figure <xref rid="j_infor633_fig_002">2</xref> shows overall and pairwise overlap between techniques at the character level. The results indicate that overlap is common: for most techniques, more than 50% of spans overlap with at least one other technique, although the strength of overlap varies. Emotional Expression shows moderate overall overlap (46.7%), but pairwise analysis reveals that many techniques most frequently co-occur with it, which is expected given that it is the most common technique in the HALT-PROP corpus. The other frequent techniques, Simplification and Doubt/Smears, show slightly higher overall overlap (53.7% and 52.9%, respectively), but their pairwise overlaps are generally weaker than those involving Emotional Expression.</p>
<p>Two notable exceptions emerge. Repetition has extremely high overlap (85.8%), indicating that its spans almost always co-occur with other techniques, most often with Emotional Expression (63.6%). In contrast, Appeal to Authority is relatively independent, with only 26.3% of its spans overlapping with other techniques.</p>
<p>Overall, Emotional Expression is the most common overlapping technique, followed by Simplification and Doubt/Smears. An additional pattern appears for Following Behind, which frequently overlaps with Waving the Flag, likely reflecting their shared appeals to collective identity or patriotism. Taken together, the analysis shows that multi-technique spans are common in the dataset and that overlap is a regular characteristic of the annotations.</p>
<fig id="j_infor633_fig_002">
<label>Fig. 2</label>
<caption>
<p>Overlap between propaganda techniques at the character-span level. (A) Percentage of annotated characters for each technique that overlap with at least one other technique. (B) Pairwise overlap matrix showing the percentage of characters of the row technique that overlap with the column technique. Bubble size and colour indicate the magnitude of overlap.</p>
</caption>
<graphic xlink:href="infor633_g002.jpg"/>
</fig>
</sec>
<sec id="j_infor633_s_007">
<label>3.1.3</label>
<title>Sentence-Level Analysis</title>
<p>As the previous analysis showed that technique spans are generally annotated as longer segments rather than short phrases marking only a few terms (with the exception of Repetition), we also conducted a sentence-level analysis to examine annotation coverage at the sentence level. To assess how much of each sentence is annotated, we measured the percentage of characters covered by technique spans within annotated sentences (see Fig. <xref rid="j_infor633_fig_003">3</xref>). The results show that the average coverage exceeds 78% for all techniques except Repetition, and for the most frequent techniques: Emotional Expression, Simplification, and Doubt/Smears – it exceeds 90%. This indicates that annotations are typically applied at the sentence level, capturing broader rhetorical context rather than isolated phrases or individual words. The main exception is Repetition, which often consists of short repeated words or phrases and therefore covers a smaller portion of the sentence.</p>
<p>We also analysed the distribution of techniques at the sentence level by measuring the proportion of sentences expressing each technique across the corpus and within sentences containing annotated propaganda spans. In the latter case, only sentences containing at least one propaganda technique were considered, excluding non-propagandistic sentences. The results again confirm the imbalance of the dataset. Restricting the analysis to span sentences slightly improves the balance. This effect is most evident for Emotional Expression, which appears in 30.5% of all sentences in the corpus but rises to 55.3% when only span sentences are considered. For rarely annotated techniques such as Uncertainty, Reductio ad Hitlerum, and Following Behind, the difference between the all and span sentences is small, and each still accounts for less than 5% of sentences.</p>
<fig id="j_infor633_fig_003">
<label>Fig. 3</label>
<caption>
<p>Distribution and coverage of propaganda techniques at the sentence level. (A) Percentage of sentences containing each technique, shown for all sentences and for sentences that include annotated spans. (B) Average proportion of sentence text covered by the annotated technique within sentences where it appears.</p>
</caption>
<graphic xlink:href="infor633_g003.jpg"/>
</fig>
</sec>
<sec id="j_infor633_s_008">
<label>3.1.4</label>
<title>Key Findings from the Dataset Analysis</title>
<p>After a detailed analysis of the HALT-PROP corpus, several main insights can be drawn:</p>
<list>
<list-item id="j_infor633_li_018">
<label>•</label>
<p>The corpus is generally imbalanced, with <bold>Emotional Expression</bold>, <bold>Simplification</bold>, and <bold>Doubt/Smears</bold> as the dominant techniques. These techniques are not only the most frequent, but also have the highest text coverage. Among them, Emotional Expression has the highest share in the corpus, and other techniques often overlap with it.</p>
</list-item>
<list-item id="j_infor633_li_019">
<label>•</label>
<p>Overlap between techniques is very common in the annotations, meaning that a single annotated span often contains multiple propaganda techniques at the same time.</p>
</list-item>
<list-item id="j_infor633_li_020">
<label>•</label>
<p>In general, propaganda techniques are annotated as longer fragments rather than short phrases or individual words. With the exception of <italic>Repetition</italic>, most techniques are annotated at the sentence level, with annotated spans covering more than 78% of the sentence characters on average, indicating that they are typically expressed across most of the sentence rather than through isolated lexical markers.</p>
</list-item>
<list-item id="j_infor633_li_021">
<label>•</label>
<p><italic>Repetition</italic> shows a distinct annotation pattern compared to the other techniques: it has the shortest spans, the lowest text coverage, and the highest span density, reflecting its nature as a recurring word- or phrase-level phenomenon.</p>
</list-item>
</list>
<p>Based on the results of the data analysis, we decided to exclude the <italic>Repetition</italic> technique. This decision is motivated by the fact that <italic>Repetition</italic> exhibits a different usage pattern compared to the other techniques. Unlike most techniques, repetition is not defined solely by individual annotated spans, but also by relations between multiple occurrences of the same word, phrase, or message across a document. Consequently, detecting repetition would require a different training strategy that models links between repeated elements throughout the document rather than relying only on local contextual features.</p>
</sec>
</sec>
</sec>
<sec id="j_infor633_s_009" sec-type="methods">
<label>4</label>
<title>Methodology</title>
<sec id="j_infor633_s_010">
<label>4.1</label>
<title>Task Formulation</title>
<p>The main goal of this research is to develop the first approach for propaganda technique detection in Lithuanian. Propaganda technique detection is typically formulated following the SemEval framework (Da San Martino <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_005">2020</xref>), which splits the task into two subtasks: (i) span identification and (ii) technique classification. In this pipeline, the algorithm first identifies text fragments containing propaganda and then assigns a specific propaganda technique label to each detected span using multi-class classification.</p>
<p>However, the HALT-PROP corpus differs substantially from the datasets used in SemEval-style propaganda techniques detection frameworks. In particular, our analysis shows that propaganda techniques in HALT-PROP frequently overlap and are typically annotated as longer textual fragments, often covering large parts of sentences. In contrast, in the PTC-SemEval20 corpus, overlapping annotations are relatively rare: only about 1.8% of spans are associated with multiple techniques (Da San Martino <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_005">2020</xref>). Due to this small proportion of overlapping cases, the SemEval task formulation simplified the problem by treating technique classification as a single-label multi-class task. When a span was annotated with multiple techniques, the dataset included duplicate instances of the same span, each associated with one label, effectively avoiding a multi-label formulation.</p>
<p>Another important difference concerns the length of annotated spans. In the PTC-SemEval20 corpus, most techniques are typically expressed through short lexical cues and therefore correspond to relatively short spans. In contrast, the HALT-PROP corpus contains a much higher degree of overlap between techniques, with more than 50% of spans overlapping with other techniques, and annotations often covering larger textual units such as full clauses or sentences. Because of this high overlap and broader span coverage, applying the standard SemEval pipeline directly would be problematic. In particular, reducing the problem to a single-label classification task would fail to capture the multi-technique nature of many annotated fragments. Therefore, the detection approach must be adapted to account for the multi-label and highly overlapping structure of propaganda annotations in the HALT-PROP corpus.</p>
<p>Based on the insights discussed earlier, we adopt the standard two-subtask framework consisting of <italic>span identification</italic> and <italic>technique classification</italic>, while incorporating adaptations that account for the nuances of the HALT-PROP corpus. For the span identification task, we follow the same standard formulation used in prior approaches. However, the second subtask, technique classification, is modified to address the high degree of overlap between techniques and the longer annotated fragments present in our corpus. As a result, our approach consists of the following two subtasks:</p>
<list>
<list-item id="j_infor633_li_022">
<label>1.</label>
<p><italic><bold>Span Identification</bold></italic>. Given a text, the task is to identify spans that contain at least one propaganda technique. This task is formulated as a token-level sequence tagging problem.</p>
</list-item>
<list-item id="j_infor633_li_023">
<label>2.</label>
<p><italic><bold>Technique Classification</bold></italic>. Given spans identified as containing propaganda techniques, the task is to determine which specific technique is expressed in each span. Due to the high overlap between techniques and the presence of long annotated fragments that often cover entire sentences in our corpus, we fine-tune a separate model for each technique. Consequently, the task is formulated as a binary sentence classification problem, where a sentence is assigned label 1 if it contains the target technique and 0 otherwise.</p>
</list-item>
</list>
<p>In addition to developing models for the selected subtasks, we also investigate several task-specific research questions and modelling decisions. In particular, we examine different sequence tagging approaches for span identification and explore alternative formulations of the technique classification task, including whether sentence classification should be performed on annotated spans or on all sentences, as well as whether a binary or multi-class formulation is more appropriate. Furthermore, we define the evaluation metrics used to assess the performance of the proposed approaches. The details of these investigations and modelling choices are described in the subsections dedicated to each task.</p>
</sec>
<sec id="j_infor633_s_011">
<label>4.2</label>
<title>Span Identification Task</title>
<p>In the span identification task, the objective is to detect text fragments that contain any propaganda technique, without distinguishing between specific technique types. A span is defined as a continuous segment of text corresponding to the annotated region of a propaganda technique in the corpus. Figure <xref rid="j_infor633_fig_004">4</xref> illustrates an example of spans in an article.</p>
<fig id="j_infor633_fig_004">
<label>Fig. 4</label>
<caption>
<p>Example of propaganda spans in an article. The English translation of the example is provided in Appendix <xref rid="j_infor633_app_001">A</xref>.</p>
</caption>
<graphic xlink:href="infor633_g004.jpg"/>
</fig>
<sec id="j_infor633_s_012">
<label>4.2.1</label>
<title>Tagging Schemes</title>
<p>Span identification task is formulated as a sequence tagging problem at the token level, where each token in the document is assigned a label indicating whether it belongs to a propaganda span. Specifically, the model predicts a label for every token in the sequence, allowing contiguous tokens with propaganda labels to form the boundaries of propaganda fragments. We investigate two tagging schemes for the span identification task: <italic>binary tagging</italic> and <italic>BILOU tagging</italic> (Ratinov and Roth, <xref ref-type="bibr" rid="j_infor633_ref_016">2009</xref>).</p>
<p>In the binary tagging approach, each token is assigned one of two labels indicating whether it belongs to a propaganda span or not. Specifically, tokens that are part of a propaganda fragment receive the label 1, while all other tokens receive the label 0. This formulation focuses only on identifying whether a token is inside a propaganda span, without explicitly modelling span boundaries.</p>
<p>In contrast, the BILOU tagging scheme explicitly represents span boundaries using five labels: <italic>B</italic> (Beginning), <italic>I</italic> (Inside), <italic>L</italic> (Last), <italic>O</italic> (Outside), and <italic>U</italic> (Unit). Tokens labelled <italic>B</italic> mark the beginning of a multi-token span, <italic>I</italic> indicate tokens inside the span, and <italic>L</italic> mark the final token of the span. The label <italic>U</italic> is used for spans consisting of a single token, while <italic>O</italic> denotes tokens outside any propaganda span. Table <xref rid="j_infor633_tab_001">1</xref> illustrates an example of span tagging using both tagging schemes. In this example, the BILOU tagging does not include the <italic>U</italic> label, since it is only used when a span consists of a single token, which occurs very rarely in our corpus.</p>
<table-wrap id="j_infor633_tab_001">
<label>Table 1</label>
<caption>
<p>Example of token-level binary and BILOU tagging for the propaganda span “<italic>Lietuvą masiškai palieka jauni žmonės</italic>”.</p>
</caption>
<table>
<thead>
<tr>
<td style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Representation</td>
<td style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Content</td>
</tr>
</thead>
<tbody>
<tr>
<td style="vertical-align: top; text-align: left">Original fragment</td>
<td style="vertical-align: top; text-align: left"><bold>Lietuvą masiškai palieka jauni žmonės</bold>, dalis jų užsienyje jau įsitvirtino, ten sukūrė šeimas ir pasiliko gyventi. <!--br role="newline" /--><italic>Translation: <bold>Young people are leaving Lithuania en masse</bold>, some of them have already settled abroad, started families there, and stayed to live.</italic></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Tokenized fragment</td>
<td style="vertical-align: top; text-align: left"><bold>Lietuvą, mas, iškai, palieka, jauni, žmonės</bold>, „ dalis, jų, už, sienyje, jau, įsitvirtin, o, ten, sukūrė, šeima, s, ir, pasilik, o, gyventi, .</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Binary tagging</td>
<td style="vertical-align: top; text-align: left"><bold>1 1 1 1 1 1</bold> 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">BILOU tagging</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin"><bold>B I I I I L</bold> O O O O O O O O O O O O O O O O</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="j_infor633_s_013">
<label>4.2.2</label>
<title>Evaluation Metrics for Span Identification</title>
<p>For the evaluation of the span identification task, we use an <italic>overlap-based span F</italic><sub>1</sub> <italic>score</italic> (Da San Martino <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_005">2020</xref>), which assigns partial credit to predictions that partially overlap with the gold annotations.</p>
<p>Let <italic>d</italic> denote a document, <inline-formula id="j_infor633_ineq_001"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[${T_{d}}=\{{t_{1}},\dots ,{t_{n}}\}$]]></tex-math></alternatives></inline-formula> the set of gold technique spans in document <italic>d</italic>, and <inline-formula id="j_infor633_ineq_002"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">s</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">s</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[${S_{d}}=\{{s_{1}},\dots ,{s_{m}}\}$]]></tex-math></alternatives></inline-formula> the set of predicted spans. Each span is represented as a contiguous sequence of character indices. Predicted spans may overlap and are merged prior to evaluation. Precision (P) and recall (R) are computed by measuring the character-level overlap between predicted and gold spans: <disp-formula-group id="j_infor633_dg_001">
<disp-formula id="j_infor633_eq_001">
<label>(1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">P</mml:mi>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">S</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munder>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mi mathvariant="italic">D</mml:mi>
</mml:mrow>
</mml:munder>
<mml:munder>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:munder>
<mml:munder>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:munder><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo>∩</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& P=\frac{1}{|S|}\sum \limits_{d\in D}\sum \limits_{s\in {S_{d}}}\sum \limits_{t\in {T_{d}}}\frac{|s\cap t|}{|s|},\end{aligned}\]]]></tex-math></alternatives>
</disp-formula>
<disp-formula id="j_infor633_eq_002">
<label>(2)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">R</mml:mi>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">T</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munder>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mi mathvariant="italic">D</mml:mi>
</mml:mrow>
</mml:munder>
<mml:munder>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:munder>
<mml:munder>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:munder><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo>∩</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& R=\frac{1}{|T|}\sum \limits_{d\in D}\sum \limits_{s\in {S_{d}}}\sum \limits_{t\in {T_{d}}}\frac{|s\cap t|}{|t|},\end{aligned}\]]]></tex-math></alternatives>
</disp-formula>
</disp-formula-group> where <italic>D</italic> denotes the set of all documents in the dataset, <inline-formula id="j_infor633_ineq_003"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">S</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|S|$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_infor633_ineq_004"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">T</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|T|$]]></tex-math></alternatives></inline-formula> are the total numbers of predicted and gold spans, respectively. For a span <italic>s</italic>, <inline-formula id="j_infor633_ineq_005"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|s|$]]></tex-math></alternatives></inline-formula> denotes its length in characters, and <inline-formula id="j_infor633_ineq_006"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo>∩</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|s\cap t|$]]></tex-math></alternatives></inline-formula> denotes the number of overlapping characters between spans <italic>s</italic> and <italic>t</italic>.</p>
<p>Precision measures the proportion of predicted characters that are correctly assigned to a technique and therefore penalizes <italic>over-tagging</italic>. Recall measures the proportion of gold characters that are successfully recovered by the model and therefore penalizes <italic>under-tagging</italic>. Figure <xref rid="j_infor633_fig_005">5</xref> illustrates an example showing how precision and recall are calculated, providing a clearer understanding of what the metric measures. The final evaluation score is computed as the harmonic mean of precision and recall: 
<disp-formula id="j_infor633_eq_003">
<label>(3)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">P</mml:mi>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {F_{1}}=\frac{2PR}{P+R}.\]]]></tex-math></alternatives>
</disp-formula>
</p>
<fig id="j_infor633_fig_005">
<label>Fig. 5</label>
<caption>
<p>Example illustrating the calculation of precision and recall for span identification based on character-level overlap between predicted and gold spans. The full sentence translates as: “Everyone understands that Ukraine must be liberated <bold>from Nazism and Banderites</bold>”. The bold part corresponds to the gold span.</p>
</caption>
<graphic xlink:href="infor633_g005.jpg"/>
</fig>
</sec>
</sec>
<sec id="j_infor633_s_014">
<label>4.3</label>
<title>Technique Classification Task</title>
<p>This task is formulated as a binary sentence classification problem. We investigate two settings: (i) a setting without span information, in which all sentences in an article are considered, and (ii) a setting with span information, in which only sentences containing annotated propaganda spans, i.e. text fragments labelled with at least one propaganda technique, are considered.</p>
<p>The data for this task is prepared as follows. First, documents are split into sentences. Then, binary labels are assigned for each propaganda technique separately. In the span-sentence setting, sentences that do not overlap with any annotated span are excluded; in other words, sentences that receive zero labels for all techniques are removed. An example of the data preparation process for this task is illustrated in Fig. <xref rid="j_infor633_fig_006">6</xref>.</p>
<fig id="j_infor633_fig_006">
<label>Fig. 6</label>
<caption>
<p>Illustration of converting span-level annotations into sentence-level binary labels. Highlighted spans correspond to annotated techniques. The tables show the resulting sentence-level labels when using all sentences and when using only sentences containing annotated spans. The English translation of the example is provided in Appendix <xref rid="j_infor633_app_002">B</xref>.</p>
</caption>
<graphic xlink:href="infor633_g006.jpg"/>
</fig>
<p>In this task, we investigate several research questions to assess whether formulating technique detection as a sentence-level classification task is appropriate for our corpus. First, we evaluate the effect of span-based filtering by comparing models trained on all sentences with models trained only on sentences containing propaganda spans. Second, we examine whether a binary classification formulation is more suitable than a multi-class approach. To this end, we also fine-tune models in a multi-class setting, where each sentence may receive multiple technique labels.</p>
<p>Overall, for the technique classification task we fine-tune the same transformer-based model under several experimental configurations:</p>
<list>
<list-item id="j_infor633_li_024">
<label>•</label>
<p>Fine-tuning the transformer model separately for each technique using all sentences, formulated as a binary classification task.</p>
</list-item>
<list-item id="j_infor633_li_025">
<label>•</label>
<p>Fine-tuning the transformer model separately for each technique using only sentences that contain propaganda spans (i.e. sentences containing at least one propaganda technique). When a span starts or ends in the middle of a sentence, the entire sentence is still used for classification.</p>
</list-item>
<list-item id="j_infor633_li_026">
<label>•</label>
<p>Fine-tuning the transformer model jointly for all techniques as a multi-label classification task using all sentences.</p>
</list-item>
<list-item id="j_infor633_li_027">
<label>•</label>
<p>Fine-tuning the transformer model jointly for all techniques as a multi-label classification task using only sentences containing propaganda spans.</p>
</list-item>
</list>
<sec id="j_infor633_s_015">
<label>4.3.1</label>
<title>Evaluation Metrics for Techniques Classification on Sentence Level</title>
<p>To compare the binary and multi-label approaches, predictions are evaluated separately for each propaganda technique as a binary classification task. We use <italic>macro-F1</italic> as the main evaluation metric. For each technique, macro-F1 is computed as the unweighted average of the F1 scores for the positive and negative classes: 
<disp-formula id="j_infor633_eq_004">
<label>(4)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">F</mml:mi>
<mml:mn mathvariant="normal">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">macro</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">F</mml:mi>
<mml:mn mathvariant="normal">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">F</mml:mi>
<mml:mn mathvariant="normal">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {\mathrm{F}\mathrm{1}_{\mathrm{macro}}}=\frac{1}{2}({\mathrm{F}\mathrm{1}_{+}}+{\mathrm{F}\mathrm{1}_{-}}),\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_infor633_ineq_007"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">F</mml:mi>
<mml:mn mathvariant="normal">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\mathrm{F}\mathrm{1}_{+}}$]]></tex-math></alternatives></inline-formula> denotes the F1 score for the positive class, corresponding to sentences containing the target technique, and <inline-formula id="j_infor633_ineq_008"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">F</mml:mi>
<mml:mn mathvariant="normal">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\mathrm{F}\mathrm{1}_{-}}$]]></tex-math></alternatives></inline-formula> denotes the F1 score for the negative class, corresponding to sentences not containing the target technique. This metric is more suitable than accuracy in the presence of class imbalance, since it assigns equal importance to both classes.</p>
<p>Accuracy is also reported as a supplementary metric to provide a general indication of the proportion of correctly classified instances.</p>
</sec>
<sec id="j_infor633_s_016">
<label>4.3.2</label>
<title>Multi-Class Case Evaluation</title>
<p>Since we also fine-tune a propaganda technique detection model in a multi-class setting, we use multi-class evaluation metrics during training, specifically for monitoring model performance and selecting the best model. In particular, the performance of the multi-class model is monitored using the macro-F1 score computed over the positive classes, which is defined as: 
<disp-formula id="j_infor633_eq_005">
<label>(5)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mtext>macro</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">K</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">K</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ F{1_{\text{macro}}}=\frac{1}{K}{\sum \limits_{k=1}^{K}}F{1_{k}},\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_infor633_ineq_009"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$F{1_{k}}$]]></tex-math></alternatives></inline-formula> is the F1-score computed for technique <italic>k</italic>, and <italic>K</italic> is the total number of techniques.</p>
</sec>
</sec>
<sec id="j_infor633_s_017">
<label>4.4</label>
<title>Transformer Model</title>
<p>In this study, we employ the <italic>LT-MLKM-modernBERT</italic> model (State Digital Solutions Agency, <xref ref-type="bibr" rid="j_infor633_ref_018">2025</xref>). <italic>LT-MLKM-modernBERT</italic> is a monolingual, encoder-only transformer based on the <italic>ModernBERT-base</italic> architecture and specifically pretrained for the Lithuanian language. Pretraining was conducted on approximately 1.87 billion words (around 49 billion tokens) collected from diverse Lithuanian sources, including news, legal, academic, and public sector texts. The model comprises 22 Transformer encoder layers with 12 attention heads and a hidden representation size of 768 dimensions, resulting in approximately 149 million parameters. It employs a custom Lithuanian tokenizer with a vocabulary of 64 000 tokens and supports a maximum input sequence length of 8 192 tokens. We selected this model because it is currently the largest publicly available Lithuanian language model and supports the longest input sequence length among Lithuanian pretrained transformer models, making it particularly suitable for processing longer textual contexts.</p>
<p>To assess whether <italic>LT-MLKM-modernBERT</italic> indeed provides superior performance, we additionally evaluate two alternative transformer models specifically on the span identification task. These models are included solely for comparative purposes and are not used in other tasks within this study. In particular, we consider two multilingual models that support the Lithuanian language: XLM-RoBERTa (Conneau <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_003">2020</xref>) and LitLatBERT (Ulčar and Robnik-Šikonja, <xref ref-type="bibr" rid="j_infor633_ref_019">2020</xref>). This comparison allows us to determine whether the selected monolingual model offers a tangible advantage over widely used multilingual alternatives for span identification.</p>
<fig id="j_infor633_fig_007">
<label>Fig. 7</label>
<caption>
<p>Overview of a Transformer encoder architecture and its application. (A) Encoder producing contextual token representations. (B) Sentence classification using the [CLS] representation. (C) Token-level sequence tagging.</p>
</caption>
<graphic xlink:href="infor633_g007.jpg"/>
</fig>
<p>Figure <xref rid="j_infor633_fig_007">7</xref> illustrates the general Transformer encoder architecture and its adaptations for the tasks analysed in this study: sequence tagging and sentence classification. In both tasks, the input data undergoes the same processing stages, including tokenization, embedding, encoding, and generation of contextual embeddings. The primary difference lies in the final stage, where task-specific classification layers are applied. In the following section, we describe the architecture in detail.</p>
<p><italic>Transformer Encoder Architecture</italic>. A Transformer encoder maps an input token sequence <inline-formula id="j_infor633_ineq_010"><alternatives><mml:math>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$x=({x_{0}},{x_{1}},\dots ,{x_{n}},{x_{n+1}})$]]></tex-math></alternatives></inline-formula> into contextualized hidden representations <inline-formula id="j_infor633_ineq_011"><alternatives><mml:math>
<mml:mi mathvariant="italic">H</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$H=({h_{0}},{h_{1}},\dots ,{h_{n}},{h_{n+1}})$]]></tex-math></alternatives></inline-formula>, where <inline-formula id="j_infor633_ineq_012"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>CLS</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[${x_{0}}=[\text{CLS}]$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_infor633_ineq_013"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>SEP</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[${x_{n+1}}=[\text{SEP}]$]]></tex-math></alternatives></inline-formula> denote special tokens. First, the input text is tokenized and converted into token identifiers. Each token identifier is mapped to a trainable token embedding vector, which is combined with a positional embedding to encode word order information. The resulting sequence of embeddings is then passed through a stack of Transformer encoder layers.</p>
<p>Each encoder layer applies multi-head self-attention followed by a position-wise feed-forward network, together with residual connections and layer normalization. Self-attention allows each token representation to attend to all other tokens in the sequence, enabling the model to capture long-range dependencies and contextual interactions. As a result, the final hidden state <inline-formula id="j_infor633_ineq_014"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${h_{i}}$]]></tex-math></alternatives></inline-formula> corresponding to token <inline-formula id="j_infor633_ineq_015"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${x_{i}}$]]></tex-math></alternatives></inline-formula> represents a contextual embedding that incorporates information from the entire input sequence.</p>
<p><italic>Sentence-Level Classification</italic>. In the sentence classification setting, the model predicts a single label for the entire input sequence based on the contextual representation of the special classification token <inline-formula id="j_infor633_ineq_016"><alternatives><mml:math>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>CLS</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[$[\text{CLS}]$]]></tex-math></alternatives></inline-formula>, i.e. the hidden state <inline-formula id="j_infor633_ineq_017"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>CLS</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${h_{[\text{CLS}]}}$]]></tex-math></alternatives></inline-formula>. This vector serves as a fixed-dimensional representation of the whole sequence.</p>
<p>A classification layer maps this representation to a two-dimensional output space corresponding to binary classes. The resulting vector <inline-formula id="j_infor633_ineq_018"><alternatives><mml:math>
<mml:mi mathvariant="italic">z</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$z\in {\mathbb{R}^{2}}$]]></tex-math></alternatives></inline-formula> contains unnormalized scores for each class and is computed as: 
<disp-formula id="j_infor633_eq_006">
<label>(6)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">z</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>sent</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>CLS</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>sent</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ z={W_{\text{sent}}}{h_{[\text{CLS}]}}+{b_{\text{sent}}},\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_infor633_ineq_019"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>sent</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${W_{\text{sent}}}\in {\mathbb{R}^{2\times d}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_infor633_ineq_020"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>sent</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${b_{\text{sent}}}\in {\mathbb{R}^{2}}$]]></tex-math></alternatives></inline-formula>, and <italic>d</italic> is the hidden dimension of the encoder.</p>
<p>The predicted label distribution is obtained by applying the softmax function to these scores. During training, the model is optimized using cross-entropy loss with respect to the gold binary label.</p>
<p><italic>Sequence Tagging</italic>. In the sequence tagging setting, the model predicts a label for each input token in the sequence, excluding special tokens such as <inline-formula id="j_infor633_ineq_021"><alternatives><mml:math>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>CLS</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[$[\text{CLS}]$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_infor633_ineq_022"><alternatives><mml:math>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>SEP</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[$[\text{SEP}]$]]></tex-math></alternatives></inline-formula>. Instead of using only the sequence-level representation <inline-formula id="j_infor633_ineq_023"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mtext>CLS</mml:mtext>
<mml:mo fence="true" stretchy="false">]</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${h_{[\text{CLS}]}}$]]></tex-math></alternatives></inline-formula>, the model uses the contextual representations of individual tokens, i.e. <inline-formula id="j_infor633_ineq_024"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${h_{1}},{h_{2}},\dots ,{h_{n}}$]]></tex-math></alternatives></inline-formula>. A token-level classification layer is applied independently to each token representation: 
<disp-formula id="j_infor633_eq_007">
<label>(7)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">z</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>tok</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">b</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>tok</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {z_{i}}={W_{\text{tok}}}{h_{i}}+{b_{\text{tok}}},\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_infor633_ineq_025"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">z</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${z_{i}}$]]></tex-math></alternatives></inline-formula> denotes the vector of unnormalized scores for token <italic>i</italic>.</p>
<p>In this work, token-level labels are modelled using two different tagging formulations: a binary tagging scheme and the BILOU tagging scheme. In the binary tagging formulation, each token is assigned one of two labels indicating whether the token belongs to the target span or not. Formally, the label set is <inline-formula id="j_infor633_ineq_026"><alternatives><mml:math>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\{0,1\}$]]></tex-math></alternatives></inline-formula>, where label 1 denotes that the token is part of a target span and label 0 indicates that the token does not belong to any labelled span. This formulation simplifies the sequence labelling problem by focusing only on the presence or absence of the target phenomenon at the token level.</p>
<p>In addition to the binary formulation, we also employ the BILOU tagging scheme. The BILOU label set includes the tag <italic>O</italic> and structured span labels of the form <inline-formula id="j_infor633_ineq_027"><alternatives><mml:math>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mtext>-</mml:mtext>
<mml:mi mathvariant="italic">t</mml:mi></mml:math><tex-math><![CDATA[$B\text{-}t$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_infor633_ineq_028"><alternatives><mml:math>
<mml:mi mathvariant="italic">I</mml:mi>
<mml:mtext>-</mml:mtext>
<mml:mi mathvariant="italic">t</mml:mi></mml:math><tex-math><![CDATA[$I\text{-}t$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_infor633_ineq_029"><alternatives><mml:math>
<mml:mi mathvariant="italic">L</mml:mi>
<mml:mtext>-</mml:mtext>
<mml:mi mathvariant="italic">t</mml:mi></mml:math><tex-math><![CDATA[$L\text{-}t$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_infor633_ineq_030"><alternatives><mml:math>
<mml:mi mathvariant="italic">U</mml:mi>
<mml:mtext>-</mml:mtext>
<mml:mi mathvariant="italic">t</mml:mi></mml:math><tex-math><![CDATA[$U\text{-}t$]]></tex-math></alternatives></inline-formula>, where <italic>t</italic> denotes a target category. The tag <italic>B</italic> marks the beginning of a multi-token span, <italic>I</italic> marks a token inside the span, <italic>L</italic> marks the last token of the span, and <italic>U</italic> denotes a single-token span. The tag <italic>O</italic> indicates that the token does not belong to any labelled span.</p>
<p>Compared to binary tagging, the BILOU scheme explicitly models span boundaries, allowing the model to distinguish between the beginning, inside, and end of multi-token spans, as well as single-token spans. This provides richer structural information about entity boundaries.</p>
</sec>
</sec>
<sec id="j_infor633_s_018">
<label>5</label>
<title>Experimental Setup</title>
<p>First, we separate a test set that is used exclusively for the final evaluation in all tasks. This set remains untouched during all fine-tuning procedures and is used only for testing. Specifically, 105 articles are selected using stratified sampling by propaganda technique. The remaining data are used for model training and validation. For each task, the data are further split into training and validation sets, with 15% reserved for validation to monitor model performance. The split is performed at the article level to ensure that text chunks from the same article do not appear in both the training and validation sets, thereby preventing data leakage.</p>
<p>For all tasks, we use the same main fine-tuning hyperparameters. We did not perform an exhaustive hyperparameter optimization procedure, such as grid search or Bayesian optimization. Instead, the main fine-tuning hyperparameters were fixed across experiments using commonly adopted settings for transformer-based models. Specifically, all models were trained for 10 epochs using a batch size of 16 and the AdamW optimizer with a learning rate of <inline-formula id="j_infor633_ineq_031"><alternatives><mml:math>
<mml:mn>3</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$3\times {10^{-5}}$]]></tex-math></alternatives></inline-formula> and a weight decay of 0.01. Full fine-tuning was applied in all experiments: all transformer encoder parameters and the task-specific classification layers were updated during training, and no layers were frozen. The reported results are based on a single fine-tuning run for each experimental configuration. These hyperparameters and fine-tuning settings were kept constant across experiments to ensure comparability between different modelling settings.</p>
<sec id="j_infor633_s_019">
<label>5.1</label>
<title>Task 1: Span Identification</title>
<p>In this task, we fine-tune the transformer model using different input sequence lengths (512, 1024, and 2048 tokens) and two tagging schemes: Binary and BILOU. In total, the model is fine-tuned six times, covering all combinations of these parameters: (512, Binary), (1024, Binary), (2048, Binary), (512, BILOU), (1024, BILOU), and (2048, BILOU).</p>
<p>Since some articles exceed the maximum input length, they are split into shorter textual fragments so that each fragment does not exceed this limit at the sentence level. This ensures that each fragment ends at a sentence boundary rather than in the middle of a sentence. The best-performing model is selected based on the overall span-level F1 score (see Section <xref rid="j_infor633_s_013">4.2.2</xref>). The models are trained using the standard cross-entropy loss without class weighting. The goal of this step is to obtain the span identification model with the highest performance, which is selected based on the performance on test set.</p>
<p>Additionally, for comparative purposes, we fine-tune the XLM-RoBERTa and LitLatBERT models on the same span identification task using the same parameters. These experiments are conducted solely to compare their performance with the <italic>LT-MLKM-modernBERT</italic> model and to assess whether the selected model provides superior results.</p>
</sec>
<sec id="j_infor633_s_020">
<label>5.2</label>
<title>Task 2: Technique Classification</title>
<p>In this task, we fine-tune a Transformer model for sentence-level classification. The model is trained separately for each technique using two different training settings: (i) using all sentences extracted from the articles, without incorporating span-level information, and (ii) using only sentences corresponding to gold-annotated spans, thereby explicitly leveraging span-level information. Overall, the model is fine-tuned for nine techniques under both settings, resulting in a total of 18 training runs.</p>
<p>Before fine-tuning, the data is preprocessed by splitting all articles into individual sentences. Based on the annotations, each sentence is assigned nine binary labels corresponding to the nine techniques. Each label indicates whether the respective technique appears anywhere within the sentence, where label 1 denotes the presence of the technique and label 0 indicates its absence. For the experiments using only span sentences, we remove sentences that contain only negative labels (i.e. sentences where all nine technique labels are 0).</p>
<p>The best performing model is selected based on the macro-averaged F1 score (see Section <xref rid="j_infor633_s_015">4.3.1</xref>). Since most techniques are highly imbalanced, we apply a weighted cross-entropy loss during training. The only exception is the <italic>Emotional Expression</italic> technique, for which the class distribution is relatively balanced; therefore, the standard (unweighted) cross-entropy loss is used.</p>
</sec>
<sec id="j_infor633_s_021">
<label>5.3</label>
<title>Overall Performance Evaluation</title>
<p>For the final evaluation, we assess the technique classification models in three different ways. First, we report the macro-F1 score of the sentence classification model trained using all sentences, without incorporating any span information. Second, we report sentence classification results when using gold span information. Third, we evaluate sentence classification performance using spans predicted by the best-performing span identification model obtained in Task 1 (Span Identification). This evaluation setup allows us to analyse the overall effectiveness of span-based information for technique classification and to estimate how much bias or performance variation is introduced when span identification predictions are used instead of gold annotations.</p>
<sec id="j_infor633_s_022">
<label>5.3.1</label>
<title>Comparison with ChatGPT</title>
<p>For the final evaluation, we also compare our technique classification results with the GPT-5.3 model, one of the latest GPT models. The GPT model is accessed through the agent interface and configured as a sentence-level labelling agent for each propaganda technique. We investigate two variants: zero-shot, where the prompt contains only the definition of the technique, and few-shot, where the prompt additionally includes ten examples: five sentences labelled with 1 (technique present) and five labelled with 0 (technique absent).</p>
<p>We aim to replicate the logic used in our approach, where each model is fine-tuned separately as a binary sentence-level classifier for each technique. Following the same setup with GPT-5.3, we provide a separate prompt for each technique and instruct the model to focus only on the specified technique during annotation. The prompts used in this approach are shown in Fig. <xref rid="j_infor633_fig_008">8</xref>.</p>
<fig id="j_infor633_fig_008">
<label>Fig. 8</label>
<caption>
<p>Zero-shot and few-shot prompts used for propaganda technique annotation.</p>
</caption>
<graphic xlink:href="infor633_g008.jpg"/>
</fig>
</sec>
</sec>
</sec>
<sec id="j_infor633_s_023">
<label>6</label>
<title>Results</title>
<sec id="j_infor633_s_024">
<label>6.1</label>
<title>Span Identification</title>
<p>Table <xref rid="j_infor633_tab_002">2</xref> reports the results of span identification models fine-tuned with different input sequence lengths and tagging schemes. Overall, the results on the test set clearly show that the BILOU tagging scheme consistently outperforms the Binary scheme across all input lengths. This outcome is expected, since BILOU explicitly models span boundaries by distinguishing the beginning, inside, last, and unit tokens of spans, whereas the Binary scheme only indicates whether a token belongs to a span or not. The results also indicate that increasing the maximum input sequence length does not improve performance. For both tagging schemes, the best results are achieved with the smallest input size of 512 tokens.</p>
<p>From the perspective of precision and recall, a consistent pattern can be observed. Binary tagging achieves higher recall than precision across all experimental settings on the test set, indicating that it favours broader coverage of gold spans rather than strict boundary accuracy. In some cases, Binary tagging even achieves higher recall than BILOU. For example, with an input length of 512 tokens, Binary tagging reaches a recall of 77.08%, compared to 71.49% for BILOU. A similar pattern appears for the input length of 1024 tokens, where Binary achieves 75.86% recall compared to 67.23% for BILOU.</p>
<p>However, BILOU tagging consistently achieves substantially higher precision across all configurations, often outperforming Binary tagging by nearly 20 percentage points. Overall, BILOU tagging demonstrates a more balanced trade-off between precision and recall, whereas Binary tagging shows a clear imbalance between these two metrics. This suggests that BILOU tagging produces more stable span predictions by simultaneously capturing a larger proportion of gold span characters while also maintaining more accurate span boundaries. In contrast, Binary tagging primarily focuses on identifying tokens that belong to gold spans but does not explicitly model span boundaries. As a result, it often predicts spans that are overly broad or include additional characters that should not belong to the span. Considering the overall performance measured by the F1 score on the test set, the BILOU tagging scheme consistently yields better results. The best performance is achieved with BILOU tagging and a maximum input length of 512 tokens, reaching an F1 score of 71.95%. Therefore, this configuration is selected as the final span identification model.</p>
<p>Additionally, for comparative purposes, we fine-tuned the multilingual transformer models XLM-RoBERTa and LitLatBERT using the BILOU tagging scheme and a maximum input size of 512 tokens, which is the largest supported sequence length for these models. The results, presented in Table <xref rid="j_infor633_tab_003">3</xref>, confirm that <italic>LT-MLKM-modernBERT</italic> outperforms these transformers and achieves the highest performance in span identification. Based on these results, the other transformer models are not used in subsequent experiments, as <italic>LT-MLKM-modernBERT</italic> demonstrates superior performance.</p>
<table-wrap id="j_infor633_tab_002">
<label>Table 2</label>
<caption>
<p>Span identification performance of LT-MLKM-modernBERT under different input sizes and tagging schemes.</p>
</caption>
<table>
<thead>
<tr>
<td rowspan="2" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Max input size</td>
<td rowspan="2" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Tagging scheme</td>
<td colspan="3" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Train</td>
<td colspan="3" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Validation</td>
<td colspan="3" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Test</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Precision</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Recall</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Precision</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Recall</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Precision</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Recall</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
</tr>
</thead>
<tbody>
<tr>
<td style="vertical-align: top; text-align: left">512</td>
<td style="vertical-align: top; text-align: left">Binary</td>
<td style="vertical-align: top; text-align: left">61.34%</td>
<td style="vertical-align: top; text-align: left">80.44%</td>
<td style="vertical-align: top; text-align: left">69.60%</td>
<td style="vertical-align: top; text-align: left">55.77%</td>
<td style="vertical-align: top; text-align: left">74.92%</td>
<td style="vertical-align: top; text-align: left">63.94%</td>
<td style="vertical-align: top; text-align: left">55.52%</td>
<td style="vertical-align: top; text-align: left">77.08%</td>
<td style="vertical-align: top; text-align: left">64.55%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>512</bold></td>
<td style="vertical-align: top; text-align: left"><bold>BILOU</bold></td>
<td style="vertical-align: top; text-align: left">96.09%</td>
<td style="vertical-align: top; text-align: left">89.05%</td>
<td style="vertical-align: top; text-align: left">92.44%</td>
<td style="vertical-align: top; text-align: left">66.84%</td>
<td style="vertical-align: top; text-align: left">71.91%</td>
<td style="vertical-align: top; text-align: left">69.28%</td>
<td style="vertical-align: top; text-align: left">72.41%</td>
<td style="vertical-align: top; text-align: left">71.49%</td>
<td style="vertical-align: top; text-align: left"><bold>71.95%</bold></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">1024</td>
<td style="vertical-align: top; text-align: left">Binary</td>
<td style="vertical-align: top; text-align: left">68.46%</td>
<td style="vertical-align: top; text-align: left">88.28%</td>
<td style="vertical-align: top; text-align: left">77.12%</td>
<td style="vertical-align: top; text-align: left">54.53%</td>
<td style="vertical-align: top; text-align: left">75.93%</td>
<td style="vertical-align: top; text-align: left">63.47%</td>
<td style="vertical-align: top; text-align: left">55.75%</td>
<td style="vertical-align: top; text-align: left">75.86%</td>
<td style="vertical-align: top; text-align: left">64.27%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">1024</td>
<td style="vertical-align: top; text-align: left">BILOU</td>
<td style="vertical-align: top; text-align: left">95.72%</td>
<td style="vertical-align: top; text-align: left">78.90%</td>
<td style="vertical-align: top; text-align: left">86.50%</td>
<td style="vertical-align: top; text-align: left">70.86%</td>
<td style="vertical-align: top; text-align: left">66.58%</td>
<td style="vertical-align: top; text-align: left">68.65%</td>
<td style="vertical-align: top; text-align: left">73.31%</td>
<td style="vertical-align: top; text-align: left">67.23%</td>
<td style="vertical-align: top; text-align: left">70.14%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">2048</td>
<td style="vertical-align: top; text-align: left">Binary</td>
<td style="vertical-align: top; text-align: left">67.15%</td>
<td style="vertical-align: top; text-align: left">90.81%</td>
<td style="vertical-align: top; text-align: left">77.21%</td>
<td style="vertical-align: top; text-align: left">55.89%</td>
<td style="vertical-align: top; text-align: left">76.14%</td>
<td style="vertical-align: top; text-align: left">64.46%</td>
<td style="vertical-align: top; text-align: left">54.85%</td>
<td style="vertical-align: top; text-align: left">60.85%</td>
<td style="vertical-align: top; text-align: left">57.70%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">2048</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">BILOU</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">94.91%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">79.25%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">86.38%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">71.19%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">70.05%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">70.62%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">75.09%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">64.55%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">69.42%</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="j_infor633_tab_003">
<label>Table 3</label>
<caption>
<p>Comparison of different models for span identification using a maximum input size of 512 tokens and the BILOU tagging scheme.</p>
</caption>
<table>
<thead>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left; border-top: solid thin; border-bottom: solid thin">Model</td>
<td colspan="3" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Train</td>
<td colspan="3" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Validation</td>
<td colspan="3" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Test</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Precision</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Recall</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Precision</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Recall</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Precision</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Recall</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
</tr>
</thead>
<tbody>
<tr>
<td style="vertical-align: top; text-align: left">LT-MLKM-modernBERT</td>
<td style="vertical-align: top; text-align: left">96.09%</td>
<td style="vertical-align: top; text-align: left">89.05%</td>
<td style="vertical-align: top; text-align: left">92.44%</td>
<td style="vertical-align: top; text-align: left">66.84%</td>
<td style="vertical-align: top; text-align: left">71.91%</td>
<td style="vertical-align: top; text-align: left">69.28%</td>
<td style="vertical-align: top; text-align: left">72.41%</td>
<td style="vertical-align: top; text-align: left">71.49%</td>
<td style="vertical-align: top; text-align: left"><bold>71.95%</bold></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">XLM-RoBERTa</td>
<td style="vertical-align: top; text-align: left">59.26%</td>
<td style="vertical-align: top; text-align: left">92.28%</td>
<td style="vertical-align: top; text-align: left">72.17%</td>
<td style="vertical-align: top; text-align: left">55.27%</td>
<td style="vertical-align: top; text-align: left">79.10%</td>
<td style="vertical-align: top; text-align: left">65.07%</td>
<td style="vertical-align: top; text-align: left">60.44%</td>
<td style="vertical-align: top; text-align: left">81.71%</td>
<td style="vertical-align: top; text-align: left">69.49%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">LitLatBERT</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">61.51%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">75.02%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">67.59%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">59.35%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">60.74%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">60.03%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">62.35%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">67.15%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">64.66%</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="j_infor633_s_025">
<label>6.2</label>
<title>Techniques Classification</title>
<p>Table <xref rid="j_infor633_tab_004">4</xref> reports the results for the sentence-level propaganda technique classification task. It should be noted that in the span-sentence setting, the sentences are selected based on gold spans obtained directly from the annotations.</p>
<table-wrap id="j_infor633_tab_004">
<label>Table 4</label>
<caption>
<p>Sentence-level classification results for all techniques under two training settings: using all sentences and using only span sentences.</p>
</caption>
<table>
<thead>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left; border-top: solid thin; border-bottom: solid thin">Technique</td>
<td rowspan="2" style="vertical-align: middle; text-align: left; border-top: solid thin; border-bottom: solid thin">Setting</td>
<td colspan="2" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Training</td>
<td colspan="2" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Validation</td>
<td colspan="2" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Testing</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Acc.</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Acc.</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Acc.</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Emotional Expression</td>
<td style="vertical-align: top; text-align: left"><bold>All sentences</bold></td>
<td style="vertical-align: top; text-align: left">76.83%</td>
<td style="vertical-align: top; text-align: left">71.38%</td>
<td style="vertical-align: top; text-align: left">69.09%</td>
<td style="vertical-align: top; text-align: left">65.18%</td>
<td style="vertical-align: top; text-align: left">70.84%</td>
<td style="vertical-align: top; text-align: left"><bold>66.08</bold>%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Span sentences</td>
<td style="vertical-align: top; text-align: left">73.13%</td>
<td style="vertical-align: top; text-align: left">72.75%</td>
<td style="vertical-align: top; text-align: left">63.13%</td>
<td style="vertical-align: top; text-align: left">62.61%</td>
<td style="vertical-align: top; text-align: left">61.24%</td>
<td style="vertical-align: top; text-align: left">60.62%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Simplification</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">98.28%</td>
<td style="vertical-align: top; text-align: left">97.27%</td>
<td style="vertical-align: top; text-align: left">78.51%</td>
<td style="vertical-align: top; text-align: left">61.11%</td>
<td style="vertical-align: top; text-align: left">78.23%</td>
<td style="vertical-align: top; text-align: left">60.25%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left">60.69%</td>
<td style="vertical-align: top; text-align: left">57.61%</td>
<td style="vertical-align: top; text-align: left">66.53%</td>
<td style="vertical-align: top; text-align: left">62.94%</td>
<td style="vertical-align: top; text-align: left">70.22%</td>
<td style="vertical-align: top; text-align: left"><bold>61.70</bold>%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Doubt</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">82.14%</td>
<td style="vertical-align: top; text-align: left">70.71%</td>
<td style="vertical-align: top; text-align: left">76.39%</td>
<td style="vertical-align: top; text-align: left">61.99%</td>
<td style="vertical-align: top; text-align: left">76.18%</td>
<td style="vertical-align: top; text-align: left">59.90%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left">64.51%</td>
<td style="vertical-align: top; text-align: left">58.32%</td>
<td style="vertical-align: top; text-align: left">76.92%</td>
<td style="vertical-align: top; text-align: left">68.45%</td>
<td style="vertical-align: top; text-align: left">74.22%</td>
<td style="vertical-align: top; text-align: left"><bold>64.97</bold>%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Whataboutism/Red Herring/Strawman</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">85.86%</td>
<td style="vertical-align: top; text-align: left">50.73%</td>
<td style="vertical-align: top; text-align: left">89.37%</td>
<td style="vertical-align: top; text-align: left">53.83%</td>
<td style="vertical-align: top; text-align: left">87.75%</td>
<td style="vertical-align: top; text-align: left">49.36%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left">75.38%</td>
<td style="vertical-align: top; text-align: left">51.35%</td>
<td style="vertical-align: top; text-align: left">89.99%</td>
<td style="vertical-align: top; text-align: left">53.87%</td>
<td style="vertical-align: top; text-align: left">90.21%</td>
<td style="vertical-align: top; text-align: left"><bold>51.74</bold>%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Appeal to Authority</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">75.35%</td>
<td style="vertical-align: top; text-align: left">50.97%</td>
<td style="vertical-align: top; text-align: left">79.65%</td>
<td style="vertical-align: top; text-align: left">52.49%</td>
<td style="vertical-align: top; text-align: left">77.08%</td>
<td style="vertical-align: top; text-align: left">48.85%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left">99.34%</td>
<td style="vertical-align: top; text-align: left">97.62%</td>
<td style="vertical-align: top; text-align: left">91.58%</td>
<td style="vertical-align: top; text-align: left">69.37%</td>
<td style="vertical-align: top; text-align: left">91.54%</td>
<td style="vertical-align: top; text-align: left"><bold>67.06</bold>%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Waving the Flag</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">97.91%</td>
<td style="vertical-align: top; text-align: left">80.34%</td>
<td style="vertical-align: top; text-align: left">96.29%</td>
<td style="vertical-align: top; text-align: left">67.62%</td>
<td style="vertical-align: top; text-align: left">96.31%</td>
<td style="vertical-align: top; text-align: left">68.85%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left">99.91%</td>
<td style="vertical-align: top; text-align: left">99.40%</td>
<td style="vertical-align: top; text-align: left">95.38%</td>
<td style="vertical-align: top; text-align: left">70.61%</td>
<td style="vertical-align: top; text-align: left">94.67%</td>
<td style="vertical-align: top; text-align: left"><bold>69.85</bold>%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Uncertainty</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">96.60%</td>
<td style="vertical-align: top; text-align: left">66.97%</td>
<td style="vertical-align: top; text-align: left">97.05%</td>
<td style="vertical-align: top; text-align: left">53.90%</td>
<td style="vertical-align: top; text-align: left">97.33%</td>
<td style="vertical-align: top; text-align: left">50.46%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left">94.51%</td>
<td style="vertical-align: top; text-align: left">68.36%</td>
<td style="vertical-align: top; text-align: left">89.29%</td>
<td style="vertical-align: top; text-align: left">52.57%</td>
<td style="vertical-align: top; text-align: left">91.02%</td>
<td style="vertical-align: top; text-align: left"><bold>57.47</bold>%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left">Reductio Ad Hitlerum</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">99.20%</td>
<td style="vertical-align: top; text-align: left">85.23%</td>
<td style="vertical-align: top; text-align: left">98.12%</td>
<td style="vertical-align: top; text-align: left">61.93%</td>
<td style="vertical-align: top; text-align: left">98.76%</td>
<td style="vertical-align: top; text-align: left">67.43%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left">99.83%</td>
<td style="vertical-align: top; text-align: left">97.76%</td>
<td style="vertical-align: top; text-align: left">97.85%</td>
<td style="vertical-align: top; text-align: left">72.98%</td>
<td style="vertical-align: top; text-align: left">98.32%</td>
<td style="vertical-align: top; text-align: left"><bold>74.13</bold>%</td>
</tr>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left; border-bottom: solid thin">Following Behind</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">97.44%</td>
<td style="vertical-align: top; text-align: left">52.20%</td>
<td style="vertical-align: top; text-align: left">99.16%</td>
<td style="vertical-align: top; text-align: left">51.92%</td>
<td style="vertical-align: top; text-align: left">98.67%</td>
<td style="vertical-align: top; text-align: left">51.89%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin"><bold>Span sentences</bold></td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">95.73%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">52.89%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">96.89%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">58.86%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">96.93%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin"><bold>71.32</bold>%</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Overall, the results show that for most techniques using only propaganda span sentences improves classification performance. The only exception is the <italic>emotional expression</italic> technique, for which better results are achieved when training on all sentences.</p>
<p>One possible explanation relates to the distribution of this technique in the dataset. Emotional expression accounts for a large proportion of span sentences, covering approximately 56% of all annotated spans (Fig. <xref rid="j_infor633_fig_003">3</xref>). However, this becomes more complex when considering the overlap between techniques (Fig. <xref rid="j_infor633_fig_002">2</xref>), as emotional expression frequently co-occurs with other propaganda techniques and often appears in sentences containing multiple rhetorical patterns.</p>
<p>When training only on span sentences, non-propagandistic sentences are removed, reducing the number of negative examples that help distinguish emotional expression from other techniques. As a result, the model may learn less distinctive features for emotional expression and struggle to differentiate them from similar rhetorical patterns, such as simplification.</p>
<p>For all other techniques, a clear improvement can be observed when using span sentences instead of all sentences. This suggests that span identification helps the classification model focus on propagandistic content and improves technique detection, particularly for less frequent techniques. A likely explanation is that removing non-propagandistic sentences effectively increases the proportion of sentences containing the target technique, which leads to a more balanced training distribution.</p>
<p>Interestingly, the highest performance is achieved for the techniques <italic>Reductio Ad Hitlerum</italic> (F1 = 74.13%) and <italic>Following Behind</italic> (F1 = 71.32%). This may be explained by the fact that these techniques often contain very distinctive linguistic cues. For example, the <italic>Following Behind</italic> (Bandwagon) technique is typically expressed through phrases that signal broad consensus, such as “everyone”, “the majority”, or “the whole nation”. Similarly, <italic>Reductio Ad Hitlerum</italic> frequently appears in contexts referring to Nazism, fascism and etc. Such clearly identifiable patterns allow the model to learn more discriminative features.</p>
<p>A similar observation can be made for other relatively rare techniques such as <italic>Waving the Flag</italic> (69.85%) and <italic>Appeal to Authority</italic> (67.06%). These techniques also tend to appear in recognizable rhetorical contexts. For example, <italic>Waving the Flag</italic> often relies on patriotic language, while <italic>Appeal to Authority</italic> references influential figures or institutions, which makes these patterns easier for the model to detect.</p>
<p>For the most dominant techniques in the dataset, such as <italic>Emotional Expression</italic>, <italic>Simplification</italic>, and <italic>Doubt</italic>, the performance remains above 60% F1. However, the frequent overlap between these techniques may make it more difficult for the model to distinguish their boundaries. These techniques often appear together in the same sentences or in similar rhetorical contexts, which can complicate the learning of clearly separable features.</p>
<p>The lowest performance is observed for the <italic>Uncertainty</italic> and <italic>Whataboutism/Red Herring/Strawman</italic> techniques. This may be explained by the fact that these techniques are generally more difficult to identify, even for human annotators. In the HALT-PROP corpus (Rizgelienė <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_017">2025</xref>), these techniques were reported to have the lowest inter-annotator agreement scores.</p>
</sec>
<sec id="j_infor633_s_026">
<label>6.3</label>
<title>Final Evaluation</title>
<table-wrap id="j_infor633_tab_005">
<label>Table 5</label>
<caption>
<p>Comparison of LT-MLKM-modernBERT and GPT-5.3 performance across propaganda techniques under different sentence settings. Binary and multi-class classification results are reported.</p>
</caption>
<table>
<thead>
<tr>
<td rowspan="2" style="vertical-align: middle; text-align: left; border-top: solid thin; border-bottom: solid thin">Technique</td>
<td rowspan="2" style="vertical-align: middle; text-align: left; border-top: solid thin; border-bottom: solid thin">Model</td>
<td rowspan="2" style="vertical-align: middle; text-align: left; border-top: solid thin; border-bottom: solid thin">Setting</td>
<td colspan="2" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Binary</td>
<td colspan="2" style="vertical-align: top; text-align: left; border-top: solid thin; border-bottom: solid thin">Multi-class</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Acc.</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">Acc.</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">F1</td>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Emotional Expression</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left"><bold>All sentences</bold></td>
<td style="vertical-align: top; text-align: left">70.84%</td>
<td style="vertical-align: top; text-align: left"><bold>66.08</bold>%</td>
<td style="vertical-align: top; text-align: left">51.05%</td>
<td style="vertical-align: top; text-align: left">47.49%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Span sentences (gold)</td>
<td style="vertical-align: top; text-align: left">61.24%</td>
<td style="vertical-align: top; text-align: left">60.62%</td>
<td style="vertical-align: top; text-align: left">48.78%</td>
<td style="vertical-align: top; text-align: left">48.67%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Span sentences (predicted)</td>
<td style="vertical-align: top; text-align: left">58.81%</td>
<td style="vertical-align: top; text-align: left">57.82%</td>
<td style="vertical-align: top; text-align: left">50.79%</td>
<td style="vertical-align: top; text-align: left">48.69%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">62.22%</td>
<td style="vertical-align: top; text-align: left">52.64%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">60.08%</td>
<td style="vertical-align: top; text-align: left">52.23%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Simplification</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">78.23%</td>
<td style="vertical-align: top; text-align: left">60.25%</td>
<td style="vertical-align: top; text-align: left">55.15%</td>
<td style="vertical-align: top; text-align: left">47.30%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">70.22%</td>
<td style="vertical-align: top; text-align: left"><bold>61.70</bold>%</td>
<td style="vertical-align: top; text-align: left">52.43%</td>
<td style="vertical-align: top; text-align: left">50.21%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (predicted)</bold></td>
<td style="vertical-align: top; text-align: left">69.67%</td>
<td style="vertical-align: top; text-align: left"><bold>61.21</bold>%</td>
<td style="vertical-align: top; text-align: left">54.27%</td>
<td style="vertical-align: top; text-align: left">47.31%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">51.98%</td>
<td style="vertical-align: top; text-align: left">43.46%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">84.89%</td>
<td style="vertical-align: top; text-align: left">46.12%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Doubt</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">76.18%</td>
<td style="vertical-align: top; text-align: left">59.90%</td>
<td style="vertical-align: top; text-align: left">50.28%</td>
<td style="vertical-align: top; text-align: left">40.61%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">74.22%</td>
<td style="vertical-align: top; text-align: left"><bold>64.97</bold>%</td>
<td style="vertical-align: top; text-align: left">52.89%</td>
<td style="vertical-align: top; text-align: left">47.02%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (predicted)</bold></td>
<td style="vertical-align: top; text-align: left">73.57%</td>
<td style="vertical-align: top; text-align: left"><bold>60.59</bold>%</td>
<td style="vertical-align: top; text-align: left">48.77%</td>
<td style="vertical-align: top; text-align: left">40.21%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">74.53%</td>
<td style="vertical-align: top; text-align: left">52.75%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">86.56%</td>
<td style="vertical-align: top; text-align: left">52.45%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Whataboutism/Red Herring/Strawman</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">87.75%</td>
<td style="vertical-align: top; text-align: left">49.36%</td>
<td style="vertical-align: top; text-align: left">54.31%</td>
<td style="vertical-align: top; text-align: left">38.18%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">90.21%</td>
<td style="vertical-align: top; text-align: left"><bold>51.74</bold>%</td>
<td style="vertical-align: top; text-align: left">51.11%</td>
<td style="vertical-align: top; text-align: left">38.79%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (predicted)</bold></td>
<td style="vertical-align: top; text-align: left">93.60%</td>
<td style="vertical-align: top; text-align: left"><bold>51.64</bold>%</td>
<td style="vertical-align: top; text-align: left">51.63%</td>
<td style="vertical-align: top; text-align: left">37.11%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">71.18%</td>
<td style="vertical-align: top; text-align: left">45.16%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">89.60%</td>
<td style="vertical-align: top; text-align: left">51.10%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Appeal to Authority</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">77.08%</td>
<td style="vertical-align: top; text-align: left">48.85%</td>
<td style="vertical-align: top; text-align: left">46.71%</td>
<td style="vertical-align: top; text-align: left">35.01%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">91.54%</td>
<td style="vertical-align: top; text-align: left"><bold>67.06</bold>%</td>
<td style="vertical-align: top; text-align: left">56.73%</td>
<td style="vertical-align: top; text-align: left">43.41%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (predicted)</bold></td>
<td style="vertical-align: top; text-align: left">92.19%</td>
<td style="vertical-align: top; text-align: left"><bold>54.97</bold>%</td>
<td style="vertical-align: top; text-align: left">52.59%</td>
<td style="vertical-align: top; text-align: left">38.38%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">79.40%</td>
<td style="vertical-align: top; text-align: left">50.36%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">94.45%</td>
<td style="vertical-align: top; text-align: left">52.67%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Waving the Flag</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">96.31%</td>
<td style="vertical-align: top; text-align: left">68.85%</td>
<td style="vertical-align: top; text-align: left">47.18%</td>
<td style="vertical-align: top; text-align: left">33.71%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">94.67%</td>
<td style="vertical-align: top; text-align: left"><bold>69.85</bold>%</td>
<td style="vertical-align: top; text-align: left">57.53%</td>
<td style="vertical-align: top; text-align: left">42.66%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (predicted)</bold></td>
<td style="vertical-align: top; text-align: left">96.61%</td>
<td style="vertical-align: top; text-align: left"><bold>71.35</bold>%</td>
<td style="vertical-align: top; text-align: left">60.15%</td>
<td style="vertical-align: top; text-align: left">41.59%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">97.05%</td>
<td style="vertical-align: top; text-align: left">52.22%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">97.52%</td>
<td style="vertical-align: top; text-align: left">53.92%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Uncertainty</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">97.33%</td>
<td style="vertical-align: top; text-align: left">50.46%</td>
<td style="vertical-align: top; text-align: left">53.81%</td>
<td style="vertical-align: top; text-align: left">37.39%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">91.02%</td>
<td style="vertical-align: top; text-align: left"><bold>57.47</bold>%</td>
<td style="vertical-align: top; text-align: left">54.87%</td>
<td style="vertical-align: top; text-align: left">39.26%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Span sentences (predicted)</td>
<td style="vertical-align: top; text-align: left">90.21%</td>
<td style="vertical-align: top; text-align: left">51.64%</td>
<td style="vertical-align: top; text-align: left">55.07%</td>
<td style="vertical-align: top; text-align: left">37.81%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">95.81%</td>
<td style="vertical-align: top; text-align: left">51.73%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">95.07%</td>
<td style="vertical-align: top; text-align: left">52.78%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="5" style="vertical-align: middle; text-align: left">Reductio Ad Hitlerum</td>
<td rowspan="3" style="vertical-align: middle; text-align: left">LT-MLKM-modernBERT</td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">98.76%</td>
<td style="vertical-align: top; text-align: left">67.43%</td>
<td style="vertical-align: top; text-align: left">49.47%</td>
<td style="vertical-align: top; text-align: left">34.26%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Span sentences (gold)</td>
<td style="vertical-align: top; text-align: left">98.32%</td>
<td style="vertical-align: top; text-align: left">74.13%</td>
<td style="vertical-align: top; text-align: left">52.37%</td>
<td style="vertical-align: top; text-align: left">36.45%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">Span sentences (predicted)</td>
<td style="vertical-align: top; text-align: left">98.64%</td>
<td style="vertical-align: top; text-align: left">74.66%</td>
<td style="vertical-align: top; text-align: left">56.93%</td>
<td style="vertical-align: top; text-align: left">37.65%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>GPT-5.3 Zero Shot</bold></td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">98.86%</td>
<td style="vertical-align: top; text-align: left"><bold>77.42</bold>%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">98.44%</td>
<td style="vertical-align: top; text-align: left">66.71%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="4" style="vertical-align: middle; text-align: left">Following Behind</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">98.67%</td>
<td style="vertical-align: top; text-align: left">51.89%</td>
<td style="vertical-align: top; text-align: left">66.35%</td>
<td style="vertical-align: top; text-align: left">41.17%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">96.93%</td>
<td style="vertical-align: top; text-align: left"><bold>71.32</bold>%</td>
<td style="vertical-align: top; text-align: left">50.00%</td>
<td style="vertical-align: top; text-align: left">35.92%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (predicted)</bold></td>
<td style="vertical-align: top; text-align: left">97.14%</td>
<td style="vertical-align: top; text-align: left"><bold>65.34</bold>%</td>
<td style="vertical-align: top; text-align: left">49.80%</td>
<td style="vertical-align: top; text-align: left">34.76%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">98.07%</td>
<td style="vertical-align: top; text-align: left">59.77%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"/>
<td style="vertical-align: top; text-align: left">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">94.78%</td>
<td style="vertical-align: top; text-align: left">49.50%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td rowspan="4" style="vertical-align: middle; text-align: left">Average across techniques</td>
<td rowspan="3" style="vertical-align: middle; text-align: left"><bold>LT-MLKM-modernBERT</bold></td>
<td style="vertical-align: top; text-align: left">All sentences</td>
<td style="vertical-align: top; text-align: left">86.79%</td>
<td style="vertical-align: top; text-align: left">58.12%</td>
<td style="vertical-align: top; text-align: left">52.70%</td>
<td style="vertical-align: top; text-align: left">39.46%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (gold)</bold></td>
<td style="vertical-align: top; text-align: left">85.37%</td>
<td style="vertical-align: top; text-align: left"><bold>64.32%</bold></td>
<td style="vertical-align: top; text-align: left">52.97%</td>
<td style="vertical-align: top; text-align: left">42.49%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left"><bold>Span sentences (predicted)</bold></td>
<td style="vertical-align: top; text-align: left">85.60%</td>
<td style="vertical-align: top; text-align: left"><bold>61.02</bold>%</td>
<td style="vertical-align: top; text-align: left">53.33%</td>
<td style="vertical-align: top; text-align: left">40.39%</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left">GPT-5.3 Zero Shot</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">81.01%</td>
<td style="vertical-align: top; text-align: left">53.95%</td>
<td style="vertical-align: top; text-align: left">–</td>
<td style="vertical-align: top; text-align: left">–</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin"/>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">GPT-5.3 Few Shot</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">–</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">89.04%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">53.05%</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">–</td>
<td style="vertical-align: top; text-align: left; border-bottom: solid thin">–</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Table <xref rid="j_infor633_tab_005">5</xref> presents the overall results for propaganda technique detection. The table reports the performance of the LT-MLKM-modernBERT model fine-tuned under different experimental settings: using all sentences and using only sentences containing propaganda spans. <italic>Gold spans</italic> refer to results obtained using the span annotations from the corpus, while <italic>predicted spans</italic> refer to spans generated by the best-performing span identification model (see Table <xref rid="j_infor633_tab_002">2</xref>). In addition to the per-technique results, we report the average values of the evaluation metrics across all techniques.</p>
<p>For comparison, we also include results for a <italic>multi-class multi-label</italic> setup, where instead of fine-tuning LT-MLKM-modernBERT separately for each propaganda technique, the model was fine-tuned jointly for all techniques. In this setting, we again experimented with both all sentences and span-only sentences, and evaluated performance using both gold spans and predicted spans. We also report the results of the GPT-5.3 model obtained through prompting in both zero-shot and few-shot settings (Section <xref rid="j_infor633_s_022">5.3.1</xref>).</p>
<p>Overall, the results indicate that the selected <italic>binary per-technique training approach</italic>, in which a separate classifier is fine-tuned for each propaganda technique, achieves substantially better detection performance than the multi-class multi-label model across all techniques. This outcome can be explained by the high degree of overlap between propaganda techniques in the corpus. In a joint multi-label setting, the model must learn several overlapping technique patterns simultaneously, which may make it more difficult to capture features that are specific to individual techniques. In contrast, training separate binary classifiers allows each model to focus on detecting one technique at a time.</p>
<p>When comparing the fine-tuned models trained using span sentences and those trained using all sentences, span-based training performs better for all techniques except <italic>emotional expression</italic>. This observation is consistent with earlier findings discussed in Section <xref rid="j_infor633_s_025">6.2</xref>. Furthermore, when predicted spans were used instead of gold spans, the results remained comparable and still outperformed the all-sentences setting. These findings suggest that span identification indeed improves propaganda technique detection.</p>
<p>However, high performance degradation is observed for some techniques when predicted spans are used instead of gold spans. For example, the macro-F1 score for <italic>appeal to authority</italic> decreases from 67.06% to 54.97%, for <italic>uncertainty</italic> from 57.47% to 51.64%, and for <italic>following behind</italic> from 71.32% to 65.34%. This can be explained by the strong class imbalance present in these techniques, each representing less than 4% of all span sentences. If the predicted span model fails to identify even a small number of sentences that contain these techniques, their proportion in the dataset decreases further, significantly affecting overall performance.</p>
<p>Comparing the fine-tuned transformer models with GPT results, the fine-tuned model outperforms GPT-5.3 for all techniques except <italic>reductio ad hitlerum</italic>. This exception can be explained by the fact that this technique has a highly distinctive contextual pattern and is generally easier to identify, as it involves references to Nazism or fascism. Consequently, providing GPT with a clear definition of the technique appears sufficient for accurate detection. In the zero-shot setting, GPT achieved a macro-F1 score of 77.42%, outperforming the fine-tuned transformer model. Interestingly, this high performance was achieved only in the zero-shot setting, while the few-shot configuration produced the lowest results for this technique. This may suggest that the additional examples unintentionally biased the model.</p>
<p>Overall, the best-performing techniques were <italic>waving the flag</italic>, <italic>reductio ad hitlerum</italic>, and <italic>following behind</italic>. For <italic>waving the flag</italic>, the span-based model achieved 69.85% (gold spans) and 71.35% (predicted spans). For <italic>reductio ad hitlerum</italic>, GPT-5.3 zero-shot achieved 77.42%, while the span-based models achieved 74.13% (gold) and 74.66% (predicted). For <italic>following behind</italic>, the span-based model achieved 71.32% (gold) and 65.34% (predicted). As previously noted, these techniques have clear contextual patterns and are among the easiest to recognize. They also achieved the highest annotation agreement rates in the HALT-PROP corpus (Rizgelienė <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_017">2025</xref>), indicating that they are relatively easy for human annotators to identify as well.</p>
<p>The averaged results across all techniques show the same general trend: the binary per-technique approach outperforms the multi-class multi-label model. In addition, the transformer model fine-tuned on span-only sentences outperforms the model fine-tuned on all sentences, both when using gold spans and predicted spans. The span-based transformer models also outperform GPT-5.3 in both zero-shot and few-shot settings.</p>
</sec>
</sec>
<sec id="j_infor633_s_027">
<label>7</label>
<title>Conclusion</title>
<p>In this study, we present the first approach for propaganda technique detection in Lithuanian. In addition, we investigate several research questions that are not only specific to Lithuanian but are also applicable to other languages, particularly low-resource settings. These questions are designed to inform the development of propaganda techniques detection models in similar settings, focusing on: (i) whether span boundary information and longer input sequences improve span identification, (ii) whether incorporating span-level information improves technique classification, and (iii) whether modelling techniques as separate binary classifiers outperform a multi-class approach in scenarios with high overlap between techniques.</p>
<p>Using the monolingual Lithuanian transformer <italic>LT-MLKM-modernBERT</italic>, we experiment with different input lengths, tagging schemes, and classification setups (binary vs. multi-class, with and without span-level information). Overall, the results show that: 
<list>
<list-item id="j_infor633_li_028">
<label>•</label>
<p>Incorporating span boundary information improves span identification performance. The BILOU tagging scheme consistently outperforms binary tagging (71.95% vs 64.55%), producing more accurate and consistent spans and capturing a larger proportion of gold span characters.</p>
</list-item>
<list-item id="j_infor633_li_029">
<label>•</label>
<p>Incorporating span-level information improves techniques classification for most techniques, particularly those with class imbalance. Training only on sentences within propaganda spans yields better performance, and this effect persists even when using predicted spans instead of gold annotations.</p>
</list-item>
<list-item id="j_infor633_li_030">
<label>•</label>
<p>In scenarios with high overlap between techniques, modelling each technique as a separate binary classifier outperforms the multi-class approach, highlighting the benefit of learning technique-specific features.</p>
</list-item>
<list-item id="j_infor633_li_031">
<label>•</label>
<p>Fine-tuned transformer models outperform GPT-5.3 in both zero-shot and few-shot settings for most techniques. The only exception is reductio ad Hitlerum, where GPT-5.3 achieves the best zero-shot performance. This result is likely driven by the distinctive context and clear definition of this technique, which also makes it one of the easiest for humans to recognize. Additionally, the results indicate that fine-tuned models also perform better on techniques with clearly separable contexts, despite class imbalance. In contrast, techniques that are more frequent in the corpus but lack a clearly distinctive context tend to achieve lower performance, even when the training data is relatively balanced.</p>
</list-item>
</list>
</p>
<sec id="j_infor633_s_028">
<label>7.1</label>
<title>Proposed Approach</title>
<fig id="j_infor633_fig_009">
<label>Fig. 9</label>
<caption>
<p>Illustration of the proposed two-stage approach: full-text input, propaganda span identification, and technique classification of sentences containing the identified spans. English translations are provided alongside the original Lithuanian examples.</p>
</caption>
<graphic xlink:href="infor633_g009.jpg"/>
</fig>
<p>Since the results show that span identification improves propaganda technique classification, we propose a two-stage approach for propaganda technique detection in Lithuanian. Figure <xref rid="j_infor633_fig_009">9</xref> illustrates the proposed method. First, the full text is given as input to the span identification model, which detects propaganda spans. Then, only the identified spans are further analysed, and the corresponding sentences are classified according to propaganda technique. In this example, the pink colour indicates the <italic>emotional expression</italic> technique, while the blue colour indicates the <italic>appeal to authority</italic> technique.</p>
</sec>
</sec>
<sec id="j_infor633_s_029">
<label>8</label>
<title>Limitations and Future Work</title>
<p>Although this study presents the first approach for propaganda technique detection in Lithuanian, several limitations remain. These limitations also point to important directions for future work.</p>
<list>
<list-item id="j_infor633_li_032">
<label>•</label>
<p><italic><bold>Model comparison under fixed hyperparameters.</bold></italic> This work mainly focused on LT-MLKM-modernBERT, one of the latest monolingual Lithuanian transformer models. However, for the span identification task, we also compared its performance with two multilingual transformer models: XLM-RoBERTa and LitLatBERT. Although all three models are transformer encoders, they differ in architecture. XLM-RoBERTa and LitLatBERT are RoBERTa-based models, whereas LT-MLKM-modernBERT is based on the ModernBERT architecture. To ensure comparability, the same hyperparameters were used across all models and tasks. We also did not investigate layer-freezing strategies; instead, all model layers were fine-tuned in each experiment. Future work should include model-specific hyperparameter optimization and different fine-tuning strategies, such as freezing selected layers, to provide a more comprehensive comparison of transformer models for Lithuanian propaganda detection, since different hyperparameter choices may affect classification behaviour (Perišić <italic>et al.</italic>, <xref ref-type="bibr" rid="j_infor633_ref_013">2025</xref>).</p>
</list-item>
<list-item id="j_infor633_li_033">
<label>•</label>
<p><italic><bold>Single-run evaluation.</bold></italic> Another limitation of this study is that each experimental configuration was evaluated using a single fine-tuning run. Since transformer fine-tuning can be affected by stochastic factors, such as random initialization of classification heads, performance may vary across runs with different random seeds. Future work should evaluate each configuration over multiple runs and report averaged results together with standard deviations or confidence intervals.</p>
</list-item>
<list-item id="j_infor633_li_034">
<label>•</label>
<p><italic><bold>Technique overlap.</bold></italic> The technique overlap analysis conducted in this study shows that techniques frequently co-occur, suggesting that some of them may share similar linguistic patterns. In the current modelling approach, techniques are treated as independent labels, which may overlook important relationships between them. When several techniques occur within the same sentence or span, modelling them as fully independent categories may limit the model’s ability to capture these dependencies. A more fine-grained linguistic analysis could examine whether different co-occurrence patterns are associated with different ways in which techniques are expressed in text. For example, <italic>emotional expression</italic> may be expressed differently when it co-occurs with <italic>simplification</italic> than when it co-occurs with <italic>doubt</italic>. Future research could therefore explore modelling approaches that explicitly capture dependencies between techniques. This may improve the detection of techniques that frequently overlap.</p>
</list-item>
<list-item id="j_infor633_li_035">
<label>•</label>
<p><italic><bold>LLM-based approaches.</bold></italic> Another limitation of this study is that the LLM-based comparison is limited to one model. Future research could extend this analysis by evaluating multiple LLMs, including multilingual and open-weight models. In addition, fine-tuning such models on the annotated dataset could be explored, as they may better capture broader contextual information relevant to propaganda technique detection.</p>
</list-item>
</list>
</sec>
</body>
<back>
<app-group>
<app id="j_infor633_app_001"><label>A</label>
<title>English Translation of the Example in Fig. <xref rid="j_infor633_fig_004">4</xref></title><disp-quote>
<p>“The United Kingdom, Germany, Italy, Canada, the United States, France, and Japan recognized the success of the special military operation of the Russian armed forces,” he emphasized.</p>
<p>According to the former Prime Minister of the State Duma, <bold>the statement adopted by the leaders of the G7 countries about the intention “not to allow Russia to win in Ukraine” can only be assessed in this way.</bold> Vyacheslav Volodin added that <bold>in this way, the G7 countries oppose the demilitarization and denazification of Ukraine, the status of a non-aligned and non-nuclear Ukraine, as well as the right to self-determination of the citizens of the Donetsk and Luhansk People’s Republics.</bold></p>
<p>“Now it is clear why there was no response to our President’s proposed initiatives on security issues. Russia thwarted their plans to expand NATO further east,” he concluded.</p>
<p><bold>“Everyone understands that Ukraine must be liberated from Nazism and Banderites. Everyone understands that the decision of the President of the Russian Federation to begin the operation for the demilitarization and denazification of Ukraine is historic. However, not everyone understands that if we stop and the Nazis remain in power, the situation will become even worse.”</bold></p>
<p>I would like to remind you that today we are not ending the Great Patriotic War. <bold>At that time, we failed to stop the Banderites;</bold> during the ten post-war years, attempts were made to eliminate them; 57 thousand Soviet soldiers, military personnel, state security officers, party and Komsomol Soviet activists were killed,” said Gennady Zyuganov, the head of the Communist Party faction, at the plenary session.</p></disp-quote></app>
<app id="j_infor633_app_002"><label>B</label>
<title>English Translation of the Example in Fig. <xref rid="j_infor633_fig_006">6</xref></title><disp-quote>
<p><bold>While Ukrainians heroically defend their homeland and us, while the entire Lithuanian nation is fighting in the information war, leftist traitors strike Lithuania in the back with the Partnership Law that destroys the core of the nation – the family.</bold></p>
<p>As if that were not enough, <bold>they equate defenders of the natural family with the Kremlin regime and personally with Vladimir Putin.</bold></p>
<p><bold>Apparently, he opposes genderism, so everyone who opposes genderism sits at the same table as the Moscow aggressor.</bold></p>
<p><bold>Reality is completely the opposite.</bold></p>
<p>There is no partnership law in Ukraine.</p>
<p>Nor is there such a law in the countries of our strategic partners – Poland, Latvia, and Georgia.</p>
<p><bold>Ukrainians are fighting not for genderism or globalism, but for their national state, their families, and their freedom.</bold></p></disp-quote></app></app-group>
<ref-list id="j_infor633_reflist_001">
<title>References</title>
<ref id="j_infor633_ref_001">
<mixed-citation publication-type="chapter"><string-name><surname>Alam</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Mubarak</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Zaghouani</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Nakov</surname>, <given-names>P.</given-names></string-name> (<year>2022</year>). <chapter-title>Overview of the WANLP 2022 shared task on propaganda detection in arabic</chapter-title>. In: <string-name><surname>Bouamor</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Al-Khalifa</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Darwish</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Rambow</surname>, <given-names>O.</given-names></string-name>, <string-name><surname>Bougares</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Abdelali</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Tomeh</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Khalifa</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Zaghouani</surname>, <given-names>W.</given-names></string-name> (Eds.), <source>Proceedings of the Seventh Arabic Natural Language Processing Workshop (WANLP)</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Abu Dhabi, United Arab Emirates</publisher-loc>, pp. <fpage>108</fpage>–<lpage>118</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/2022.wanlp-1.11" xlink:type="simple">https://doi.org/10.18653/v1/2022.wanlp-1.11</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_002">
<mixed-citation publication-type="journal"><string-name><surname>Barrón-Cedeño</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Jaradat</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Nakov</surname>, <given-names>P.</given-names></string-name> (<year>2019</year>). <article-title>Proppy: organizing the news based on their propagandistic content</article-title>. <source>Information Processing &amp; Management</source>, <volume>56</volume>(<issue>5</issue>), <fpage>1849</fpage>–<lpage>1864</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.ipm.2019.03.005" xlink:type="simple">https://doi.org/10.1016/j.ipm.2019.03.005</ext-link>. <uri>https://www.sciencedirect.com/science/article/pii/S0306457318306058</uri>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_003">
<mixed-citation publication-type="chapter"><string-name><surname>Conneau</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Khandelwal</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Goyal</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Chaudhary</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>Wenzek</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Guzmán</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Grave</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Ott</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Zettlemoyer</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Stoyanov</surname>, <given-names>V.</given-names></string-name> (<year>2020</year>). <chapter-title>Unsupervised cross-lingual representation learning at scale</chapter-title>. In: <string-name><surname>Jurafsky</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Chai</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Schluter</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Tetreault</surname>, <given-names>J.</given-names></string-name> (Eds.), <source>Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, pp. <fpage>8440</fpage>–<lpage>8451</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/2020.acl-main.747" xlink:type="simple">https://doi.org/10.18653/v1/2020.acl-main.747</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_004">
<mixed-citation publication-type="chapter"><string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Yu</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Barrón-Cedeño</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Petrov</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Nakov</surname>, <given-names>P.</given-names></string-name> (<year>2019</year>). <chapter-title>Fine-grained analysis of propaganda in news articles</chapter-title>. In: <string-name><surname>Inui</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Jiang</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Ng</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>Wan</surname>, <given-names>X.</given-names></string-name> (Eds.), <source>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Hong Kong, China</publisher-loc>, pp. <fpage>5636</fpage>–<lpage>5646</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/D19-1565" xlink:type="simple">https://doi.org/10.18653/v1/D19-1565</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_005">
<mixed-citation publication-type="chapter"><string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Barrón-Cedeño</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Wachsmuth</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Petrov</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Nakov</surname>, <given-names>P.</given-names></string-name> (<year>2020</year>). <chapter-title>SemEval-2020 Task 11: detection of propaganda techniques in news articles</chapter-title>. In: <string-name><surname>Herbelot</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Zhu</surname>, <given-names>X.</given-names></string-name>, <string-name><surname>Palmer</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Schneider</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>May</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Shutova</surname>, <given-names>E.</given-names></string-name> (Eds.), <source>Proceedings of the Fourteenth Workshop on Semantic Evaluation</source>. <publisher-name>International Committee for Computational Linguistics</publisher-name>, <publisher-loc>Barcelona</publisher-loc>, pp. <fpage>1377</fpage>–<lpage>1414</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/2020.semeval-1.186" xlink:type="simple">https://doi.org/10.18653/v1/2020.semeval-1.186</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_006">
<mixed-citation publication-type="chapter"><string-name><surname>Dimitrov</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Bin Ali</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Shaar</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Alam</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Silvestri</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Firooz</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Nakov</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name> (<year>2021</year>). <chapter-title>SemEval-2021 Task 6: detection of persuasion techniques in texts and images</chapter-title>. In: <string-name><surname>Palmer</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Schneider</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Schluter</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Emerson</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Herbelot</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Zhu</surname>, <given-names>X.</given-names></string-name> (Eds.), <source>Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, pp. <fpage>70</fpage>–<lpage>98</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/2021.semeval-1.7" xlink:type="simple">https://doi.org/10.18653/v1/2021.semeval-1.7</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_007">
<mixed-citation publication-type="chapter"><string-name><surname>Dimitrov</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Alam</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Hasanain</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Hasnat</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Silvestri</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Nakov</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name> (<year>2024</year>). <chapter-title>SemEval-2024 Task 4: multilingual detection of persuasion techniques in memes</chapter-title>. In: <string-name><surname>Ojha</surname>, <given-names>A.K.</given-names></string-name>, <string-name><surname>Doğruöz</surname>, <given-names>A.S.</given-names></string-name>, <string-name><surname>Tayyar Madabushi</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Rosenthal</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Rosá</surname>, <given-names>A.</given-names></string-name> (Eds.), <source>Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Mexico City, Mexico</publisher-loc>, pp. <fpage>2009</fpage>–<lpage>2026</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/2024.semeval-1.275" xlink:type="simple">https://doi.org/10.18653/v1/2024.semeval-1.275</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_008">
<mixed-citation publication-type="chapter"><string-name><surname>Hasanain</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Hasan</surname>, <given-names>M.A.</given-names></string-name>, <string-name><surname>Ahmad</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Suwaileh</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Biswas</surname>, <given-names>M.R.</given-names></string-name>, <string-name><surname>Zaghouani</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Alam</surname>, <given-names>F.</given-names></string-name> (<year>2024</year>). <chapter-title>ArAIEval shared task: propagandistic techniques detection in unimodal and multimodal arabic content</chapter-title>. In: <string-name><surname>Habash</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Bouamor</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Eskander</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Tomeh</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Abu Farha</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>Abdelali</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Touileb</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Hamed</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>Onaizan</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Alhafni</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Antoun</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Khalifa</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Haddad</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Zitouni</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>AlKhamissi</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Almatham</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Mrini</surname>, <given-names>K.</given-names></string-name> (Eds.), <source>Proceedings of the Second Arabic Natural Language Processing Conference</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Bangkok, Thailand</publisher-loc>, pp. <fpage>456</fpage>–<lpage>466</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/2024.arabicnlp-1.44" xlink:type="simple">https://doi.org/10.18653/v1/2024.arabicnlp-1.44</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_009">
<mixed-citation publication-type="journal"><string-name><surname>Horák</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Sabol</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Herman</surname>, <given-names>O.</given-names></string-name>, <string-name><surname>Baisa</surname>, <given-names>V.</given-names></string-name> (<year>2024</year>). <article-title>Recognition of propaganda techniques in newspaper texts: fusion of content and style analysis</article-title>. <source>Expert Systems with Applications</source>, <volume>251</volume>, <elocation-id>124085</elocation-id>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.eswa.2024.124085" xlink:type="simple">https://doi.org/10.1016/j.eswa.2024.124085</ext-link>. <uri>https://www.sciencedirect.com/science/article/pii/S0957417424009515</uri>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_010">
<mixed-citation publication-type="journal"><string-name><surname>Jose</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Geeng</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Morales</surname>, <given-names>K.O.</given-names></string-name>, <string-name><surname>McCoy</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Greenstadt</surname>, <given-names>R.</given-names></string-name> (<year>2025</year>). <article-title>What’s in a label? Propaganda labels and user sharing behavior on social media platforms</article-title>. <source>Proceedings of the International AAAI Conference on Web and Social Media</source>, <volume>19</volume>(<issue>1</issue>), <fpage>918</fpage>–<lpage>934</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1609/icwsm.v19i1.35853" xlink:type="simple">https://doi.org/10.1609/icwsm.v19i1.35853</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_011">
<mixed-citation publication-type="journal"><string-name><surname>Moral</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Marco</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Gonzalo</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Carrillo-de-Albornoz</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Gonzalo-Verdugo</surname>, <given-names>I.</given-names></string-name> (<year>2023</year>). <article-title>Overview of DIPROMATS 2023: automatic detection and characterization of propaganda techniques in messages from diplomats and authorities of world powers</article-title>. <source>Procesamiento del Lenguaje Natural</source>, <volume>71</volume>, <fpage>397</fpage>–<lpage>407</lpage>. <uri>http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6569</uri>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_012">
<mixed-citation publication-type="journal"><string-name><surname>Moral</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Fraile</surname>, <given-names>J.M.</given-names></string-name>, <string-name><surname>Marco</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Peñas</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Gonzalo</surname>, <given-names>J.</given-names></string-name> (<year>2024</year>). <article-title>Overview of DIPROMATS 2024: detection, characterization and tracking of propaganda in messages from diplomats and authorities of world powers</article-title>. <source>Procesamiento del Lenguaje Natural</source>, <volume>73</volume>, <fpage>347</fpage>–<lpage>358</lpage>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_013">
<mixed-citation publication-type="journal"><string-name><surname>Perišić</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Vanbelle</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Petričević</surname>, <given-names>R.B.</given-names></string-name> (<year>2025</year>). <article-title>Quantifying binary classifier algorithms similarity with a consensus agreement approach</article-title>. <source>Informatica</source>, <volume>36</volume>(<issue>3</issue>), <fpage>657</fpage>–<lpage>676</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.15388/25-INFOR601" xlink:type="simple">https://doi.org/10.15388/25-INFOR601</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_014">
<mixed-citation publication-type="chapter"><string-name><surname>Piskorski</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Stefanovitch</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Nakov</surname>, <given-names>P.</given-names></string-name> (<year>2023</year>). <chapter-title>SemEval-2023 Task 3: detecting the category, the framing, and the persuasion techniques in online news in a multi-lingual setup</chapter-title>. In: <string-name><surname>Ojha</surname>, <given-names>A.K.</given-names></string-name>, <string-name><surname>Doğruöz</surname>, <given-names>A.S.</given-names></string-name>, <string-name><surname>Da San Martino</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Tayyar Madabushi</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Kumar</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Sartori</surname>, <given-names>E.</given-names></string-name> (Eds.), <source>Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Toronto, Canada</publisher-loc>, pp. <fpage>2343</fpage>–<lpage>2361</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/2023.semeval-1.317" xlink:type="simple">https://doi.org/10.18653/v1/2023.semeval-1.317</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_015">
<mixed-citation publication-type="chapter"><string-name><surname>Rashkin</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Choi</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Jang</surname>, <given-names>J.Y.</given-names></string-name>, <string-name><surname>Volkova</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Choi</surname>, <given-names>Y.</given-names></string-name> (<year>2017</year>). <chapter-title>Truth of varying shades: analyzing language in fake news and political fact-checking</chapter-title>. In: <string-name><surname>Palmer</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Hwa</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Riedel</surname>, <given-names>S.</given-names></string-name> (Eds.), <source>Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Copenhagen, Denmark</publisher-loc>, pp. <fpage>2931</fpage>–<lpage>2937</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18653/v1/D17-1317" xlink:type="simple">https://doi.org/10.18653/v1/D17-1317</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_016">
<mixed-citation publication-type="chapter"><string-name><surname>Ratinov</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Roth</surname>, <given-names>D.</given-names></string-name> (<year>2009</year>). <chapter-title>Design Challenges and Misconceptions in Named Entity Recognition</chapter-title>. In: <source>Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009)</source>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Boulder, Colorado</publisher-loc>, pp. <fpage>147</fpage>–<lpage>155</lpage>. <uri>https://aclanthology.org/W09-1119/</uri>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_017">
<mixed-citation publication-type="journal"><string-name><surname>Rizgelienė</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>Zubaitienė</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>Maliukevičius</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Marcinkevičius</surname>, <given-names>V.</given-names></string-name> (<year>2025</year>). <article-title>HALT-PROP: Human-Annotated Lithuanian Textual Corpus for Propaganda Narratives and Techniques</article-title>. <source>Scientific Data</source>, <volume>13</volume>(<issue>1</issue>), <fpage>47</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1038/s41597-025-06367-w" xlink:type="simple">https://doi.org/10.1038/s41597-025-06367-w</ext-link>.</mixed-citation>
</ref>
<ref id="j_infor633_ref_018">
<mixed-citation publication-type="other"><string-name><surname>State Digital Solutions Agency (SDSA)</surname></string-name> (2025). LT-MLKM-modernBERT: Lithuanian ModernBERT Language Model. <uri>https://huggingface.co/VSSA-SDSA/LT-MLKM-modernBERT</uri>. Developed by Vytautas Magnus University (VMU), UAB Neurotechnology, UAB Tilde informacinės technologijos, MB Krilas.</mixed-citation>
</ref>
<ref id="j_infor633_ref_019">
<mixed-citation publication-type="other"><string-name><surname>Ulčar</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Robnik-Šikonja</surname>, <given-names>M.</given-names></string-name> (2020). EMBEDDIA: LitLat BERT: Model Card. <uri>https://huggingface.co/EMBEDDIA/litlat-bert</uri>. XLM-RoBERTa-base configuration; 12 layers, 12 heads; vocabulary size 84,201.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
