{"id":1931,"date":"2025-06-22T00:56:33","date_gmt":"2025-06-22T00:56:33","guid":{"rendered":"https:\/\/catedramasmovil.uc3m.es\/2025\/06\/22\/tv-cover-title-extraction\/"},"modified":"2026-06-03T14:28:44","modified_gmt":"2026-06-03T14:28:44","slug":"tv-cover-title-extraction","status":"publish","type":"post","link":"https:\/\/catedramasmovil.uc3m.es\/en\/2025\/06\/22\/tv-cover-title-extraction\/","title":{"rendered":"Code-switching processing in ASR for low-resource languages \u2013 Use case: Basque"},"content":{"rendered":"\n[et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.17.0&#8243; custom_padding=&#8221;0px||||false|false&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_row _builder_version=&#8221;4.17.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;0px||||false|false&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.17.0&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_gallery gallery_ids=&#8221;2101,2098,2096,2093,2091,2089&#8243; fullwidth=&#8221;on&#8221; _builder_version=&#8221;4.27.6&#8243; _module_preset=&#8221;default&#8221; hover_enabled=&#8221;0&#8243; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221; sticky_enabled=&#8221;0&#8243;][\/et_pb_gallery][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.16&#8243; background_size=&#8221;initial&#8221; background_position=&#8221;top_left&#8221; background_repeat=&#8221;repeat&#8221; custom_margin=&#8221;|auto||103px||&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221; width=&#8221;80.2%&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.16&#8243; custom_padding=&#8221;|||&#8221; global_colors_info=&#8221;{}&#8221; custom_padding__hover=&#8221;|||&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_text _builder_version=&#8221;4.25.1&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;||0px|||&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;]<h2>Objective<\/h2>\n<p><span style=\"font-weight: 400\">In an ASR context where most phenomena have been studied for more widespread languages, the present work seeks to propose a solution for code-switching, a phenomenon whereby a multilingual speaker alternates between the different languages they master throughout their speech, in intersentential contexts where the speaker switches languages between sentences, typically due to syntactic differences among the languages they speak.<\/span><\/p>\n<p><span style=\"font-weight: 400\">To this end, the fine-tuning of Whisper, a technology widely used for audio transcription, is proposed in order to deeply train its models on the different languages involved in the speaker\u2019s code-switching phenomenon, and to create an architecture that enables reinforced transcription for languages with less coverage.<\/span><\/p>\n<p><span style=\"font-weight: 400\"><br\/>Specifically, the present work proposes an architecture for the use case of code-switching between Spanish and Basque, where both languages exhibit different syntactic structures and the language switch does not usually entail a loss of meaning in the message. <\/span><\/p>\n<p><span style=\"font-weight: 400\">This architecture begins with two blocks that properly process the audio in order to maximize accuracy in each of its segments: a VAD block, using Silero VAD, to divide the audio into segments where speech activity is present; followed by an LID block, which aims to identify the language of each of these audio segments in order to perform a transcription adapted to the corresponding language. <\/span><\/p>\n<p><span style=\"font-weight: 400\">Once the segments have been divided and labeled by language, a specific transcription for Basque is carried out using a fine-tuned Whisper model, while the base model is retained for Spanish (since it already has sufficient coverage). The transcriptions of each segment are then combined into a final text file.<\/span><\/p>\n<p><span style=\"font-weight: 400\">All of this is carried out based on two datasets: one for fine-tuning, obtained from the Mozilla Common Voice platform, and a second one for architecture validation, which is synthetic and contains code-switched audio samples that follow the intersentential structure addressed in this work.<\/span><\/p>\n<p dir=\"ltr\" style=\"text-align: justify\"><\/p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;20px||||false|false&#8221; custom_padding=&#8221;0px||||false|false&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_row column_structure=&#8221;1_2,1_2&#8243; _builder_version=&#8221;4.25.1&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_image src=&#8221;https:\/\/storage.googleapis.com\/wp-uploads.bucket.wp.uc3m.es\/wp-content\/uploads\/sites\/70\/2026\/05\/19111753\/Foto.jpeg&#8221; title_text=&#8221;Foto&#8221; _builder_version=&#8221;4.27.6&#8243; _module_preset=&#8221;default&#8221; hover_enabled=&#8221;0&#8243; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221; sticky_enabled=&#8221;0&#8243;][\/et_pb_image][\/et_pb_column][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_text _builder_version=&#8221;4.27.6&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;||0px|||&#8221; hover_enabled=&#8221;0&#8243; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221; sticky_enabled=&#8221;0&#8243;]<p><span style=\"color: #003366\"><strong>BACHELOR&#8217;S THESIS BY<\/strong><\/span><\/p>\n<p><span style=\"color: #003366\"><b>JAVIER RAM\u00cdREZ ZARZOSO<\/b><\/span><\/p>[\/et_pb_text][et_pb_text _builder_version=&#8221;4.25.1&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;]<p><strong><\/strong><\/p>\n<p><strong><\/strong><\/p>\n<p><strong><\/strong><\/p>\n<p><strong> Academic experience<\/strong><\/p>\n<ul>\n<li>Dual Bachelor in Computer Science and Engineering and Business Administration, Universidad Carlos III de Madrid (september 2021 &#8211; july 2026)<\/li>\n<li>Google Cloud Data Analyst Certificate (june &#8211; june 2025)<\/li>\n<\/ul>\n<ul><\/ul>\n<p>&nbsp;<\/p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.20.2&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.20.2&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_text _builder_version=&#8221;4.25.1&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;||9px|||&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;]<p><strong>Work experience<\/strong><\/p>\n<ul>\n<li>Machine Learning Researcher \u2013 Universidad Carlos III de Madrid in collaboration with Grupo MasOrange (september 2025 \u2014 june 2026)<\/li>\n<li><span style=\"font-size: 14px\">Machine Learning Researcher &#8211; Universidad Carlos III de Madrid in collaboration with DEIMOS-SPACE (january 2025 \u2013 july 2025)<\/span><\/li>\n<li>Tennis instructor (julio 2019 \u2014 agosto 2020)<\/li>\n<\/ul>\n<p><strong><br>Skills<\/strong><\/p>\n<ul>\n<li>Programming languages: Python, C\/C++, SQL, HTML\/CSS, JavaScript.<\/li>\n<li>Development libraries: Pandas, OpenCV, Numpy, PyTorch, Keras, Sci-kit Learn.<\/li>\n<li>Cloud platforms: Google Cloud.<\/li>\n<li>Frameworks: GitHub, GitLab.<\/li>\n<\/ul>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.25.1&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.25.1&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_text _builder_version=&#8221;4.27.6&#8243; _module_preset=&#8221;default&#8221; hover_enabled=&#8221;0&#8243; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221; sticky_enabled=&#8221;0&#8243;]<blockquote>\n<p><span style=\"color: #000000\"><span style=\"text-decoration: underline\"><a href=\"https:\/\/github.com\/javier-rmrz\" target=\"_blank\" rel=\"noopener\" style=\"color: #000000\">GitHub<\/a><\/span><\/span><\/p>\n<p><a href=\"https:\/\/www.linkedin.com\/in\/javier-ram\u00edrez-zarzoso-a1565b295\/\"><span style=\"color: #000000\"><span style=\"text-decoration: underline\">LinkedIn<\/span><\/span><\/a><\/p>\n<\/blockquote>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section]\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":172,"featured_media":1922,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"categories":[79],"tags":[],"class_list":["post-1931","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-proyectos-2025-2026"],"_links":{"self":[{"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/posts\/1931","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/users\/172"}],"replies":[{"embeddable":true,"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/comments?post=1931"}],"version-history":[{"count":6,"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/posts\/1931\/revisions"}],"predecessor-version":[{"id":2266,"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/posts\/1931\/revisions\/2266"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/media\/1922"}],"wp:attachment":[{"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/media?parent=1931"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/categories?post=1931"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/catedramasmovil.uc3m.es\/en\/wp-json\/wp\/v2\/tags?post=1931"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}