{"id":8033,"date":"2017-05-23T07:17:13","date_gmt":"2017-05-23T05:17:13","guid":{"rendered":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/"},"modified":"2025-03-26T10:12:55","modified_gmt":"2025-03-26T09:12:55","slug":"catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history","status":"publish","type":"articles","link":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/","title":{"rendered":"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History"},"content":{"rendered":"\n<section class=\"wp-block-unilux-blocks-free-section section\"><div class=\"container xl:max-w-screen-xl\">\n\n<p><strong>The challenge of this workshop on &#039;transcription and technology\u2019, which took place from 10 to 12 May in Arezzo, consisted in turning recorded human speech into a textual representation that is as close as possible to what has been uttered. <\/strong><\/p>\n\n<p>Say <strong>Arezzo<\/strong> and any art historian\u2019s thoughts will divert to <strong>Piero della Francesca\u2019s<\/strong> innovation in the visual representation of reality. It was this renaissance painter of the 15<sup>th<\/sup> century who in his frescos dared to depicture religious figures as real humans. By creating the illusion of depth and light and by paying detailed attention to the anatomy of their bodies, Christ, the Madonna and other holy personalities were no longer spiritual creatures floating in the air, but <a href=\"http:\/\/www.wga.hu\/html_m\/p\/piero\/2\/index.html\" target=\"_blank\" rel=\"noopener\">looked like real people<\/a>.<\/p>\n\n<p>The efforts of a workshop on &#8216;transcription and technology\u2019, which we attended from 10 to 12 May in this beautiful Tuscan town, were also geared towards an accurate representation of reality. This time the challenge was turning recorded human speech into a textual representation that is as close as possible to what has been uttered. Speech retrieval technology can be relevant for humanities research in two ways: it can open up huge amounts of spoken data in archives of which the content is mostly unknown, and it can speed up the lengthy process of manual transcription for scholars who want to analyze their interviews in depth.<\/p>\n\n<p>The importance of studying speech is evident when taking into consideration the role of recorded voice and moving image for the human expression of the 20th century. Digital tools have already conquered the world of text, magnifying the scale and speed at which phenomena can be observed, but too little attention is given to how we use spoken language and memory to shape our lived experiences into a set of meaningful and coherent stories.<\/p>\n\n<p>With this agenda in mind, a mix of Italian, Dutch, British, Czech and German oral historians, linguists, data specialists and speech technologists got together to assess which <em>Digitization,<\/em> <em>Speech Retrieval<\/em>, <em>Alignment<\/em> and <em>Transcription<\/em> tools are suitable for creating a semi-automated workflow that can turn analogue recordings into readable transcripts. The workshop was supported by <a href=\"https:\/\/www.clarin.eu\/\" target=\"_blank\" rel=\"noopener\">CLARIN ERIC<\/a>, a European infrastructure offering digital data and tools for Digital Humanities scholarly research. It was created to serve a broad range of scholars, but until recently it was foremost a much cherished treasure trove for linguists. The increased interest for cross-disciplinary approaches to data is the appropriate context for making efforts to recruit more enthusiastic users from the humanities and social science fields. This objective begs the question of which requirements are relevant to which type of scholar who works with speech data. It also asks scholars to step out of their &#8216;comfort zone&#8217; and consider other approaches.<\/p>\n\n<p>Contributors were asked to present an overview of conventions and practices that should be considered to create a well suited workflow: What are the metadata schemes used in speech data? What are the guidelines for transcription? What are existing digital infrastructures capable of providing? And what has proprietary commercial software already have in store? After the technical partners presented a parade of tools, the real fun part started: testing the various tools with 5 minutes clips of audio. The speech retrieval tools are of course language-specific. For English they could try out the web service offered by Sheffield University: <a href=\"http:\/\/www.webasr.org\/\" target=\"_blank\" rel=\"noopener\">http:\/\/www.webasr.org\/<\/a> . The Dutch could try out <a href=\"https:\/\/webservices-lst.science.ru.nl\/oral_history\/\" target=\"_blank\" rel=\"noopener\">https:\/\/webservices-lst.science.ru.nl\/oral_history\/<\/a> from the Radboud University in Nijmegen, and the Italians could practice with the stand-alone alignment software <strong>\u2018Segmenta\u2019<\/strong> created by Piero Cosi at the CNR in Padua. As expected, the speech retrieval software performed poorly with clips containing language with strong regional accents, such as a corpus with Tuscan dialect, or an interview with a narrator speaking with a Flemish\/Moroccan accent. The good news is however, that it performed excellently with language clips that contained regular speech, and that this applied to all three languages.<\/p>\n\n\n<section class=\"alignfull wp-block-unilux-blocks-gallery-carousel\">\n    <div class=\"swiper swiper-gallery\" aria-roledescription=\"carousel\" aria-label=\"A gallery of images\">\n        <!-- Swiper button Next & Prev -->\n        <div class=\"swiper-nav\">\n            <div class=\"swiper-nav__container\">\n                <div class=\"swiper-nav__grid\">\n                    <button type=\"button\" class=\"swiper-button-next\">\n                        <svg aria-hidden=\"true\" focusable=\"false\" class=\"icon icon-outline icon--arrow-right \"><use xlink:href=\"https:\/\/www.uni.lu\/wp-content\/themes\/unilux-theme\/assets\/images\/icons\/icons-outline.svg#icon--arrow-right\"><\/use><\/svg>                    <\/button>\n                    <button type=\"button\" class=\"swiper-button-prev\">\n                        <svg aria-hidden=\"true\" focusable=\"false\" class=\"icon icon-outline icon--arrow-left \"><use xlink:href=\"https:\/\/www.uni.lu\/wp-content\/themes\/unilux-theme\/assets\/images\/icons\/icons-outline.svg#icon--arrow-left\"><\/use><\/svg>                    <\/button>\n                <\/div>\n            <\/div>\n        <\/div>\n\n        <!-- swiper slides -->\n        <ul class=\"swiper-wrapper\">\n            \n<li class=\"swiper-slide\" aria-roledescription=\"slide\">\n    <figure class=\"wp-block-dev4-reusable-blocks-image swiper-slide__bg object-fit--contain\">\n    \n<img decoding=\"async\" class=\"wp-block-image unilux-custom-image-block\"\n                alt=\"Arezzo paintings\"\n            src=\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-1.jpg\"\n                    style=\"object-position: 50.00% 50.00%; font-family: &quot;object-fit: contain; object-position: 50.00% 50.00%;&quot;; aspect-ratio: 16\/9; object-fit: contain; width: 100%;\"\n        loading=\"lazy\"\n\/>    <\/figure><\/li>\n<li class=\"swiper-slide\" aria-roledescription=\"slide\">\n    <figure class=\"wp-block-dev4-reusable-blocks-image swiper-slide__bg object-fit--contain\">\n    \n<img decoding=\"async\" class=\"wp-block-image unilux-custom-image-block\"\n                alt=\"\"\n            src=\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-3.jpg\"\n                    style=\"object-position: 50.00% 50.00%; font-family: &quot;object-fit: contain; object-position: 50.00% 50.00%;&quot;; aspect-ratio: 16\/9; object-fit: contain; width: 100%;\"\n        loading=\"lazy\"\n\/>    <\/figure><\/li>\n<li class=\"swiper-slide\" aria-roledescription=\"slide\">\n    <figure class=\"wp-block-dev4-reusable-blocks-image swiper-slide__bg object-fit--contain\">\n    \n<img decoding=\"async\" class=\"wp-block-image unilux-custom-image-block\"\n                alt=\"Arezzo workshop\"\n            src=\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-4.jpg\"\n                    style=\"object-position: 50.00% 50.00%; font-family: &quot;object-fit: contain; object-position: 50.00% 50.00%;&quot;; aspect-ratio: 16\/9; object-fit: contain; width: 100%;\"\n        loading=\"lazy\"\n\/>    <\/figure><\/li>        <\/ul>\n\n        <!-- Swiper pagination -->\n        <div class=\"swiper-pagination\">\n            <div class=\"swiper-pagination__bullets\"><\/div>\n        <\/div>\n    <\/div>\n<\/section>\n\n<p>For the less technically savvy scholars, it was a surprise to hear how confident speech technologists were about the chances of success when trying to customize speech recognition software to work on non-standard language varieties and lesser-researched languages. The most important requirements seem to be to have enough training material in the form of a lexicon, a language model and an acoustic model that can be fed into the software. Success and low word error rates (WER) appear to be a question of scale, training and perseverance. This might raise new hopes for mobilizing awareness and fostering research on small language groups such as <strong>Luxemburgisch<\/strong>.<\/p>\n\n<p>The next step in the workflow is the transcription. Several tools were presented, such as <strong>OCTRA-2D<\/strong> from Muenchen (<a href=\"https:\/\/www.phonetik.uni-muenchen.de\/apps\/octra\/octra\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.phonetik.uni-muenchen.de\/apps\/octra\/octra\/<\/a>), <strong>Subtitle Edit<\/strong> (<a href=\"http:\/\/www.nikse.dk\/subtitleedit\/\" target=\"_blank\" rel=\"noopener\">http:\/\/www.nikse.dk\/subtitleedit\/<\/a>), created by Danish developers, and an unexpected contribution from the world of journalism was <strong>OTranscribe<\/strong>, <a href=\"http:\/\/otranscribe.com\/\" target=\"_blank\" rel=\"noopener\">http:\/\/otranscribe.com\/<\/a>, which seemed to be the easiest to handle. The challenge is of course to customize these tools in a way that they can effectively import the outputs of the speech recognition, so that the correction can begin, without having to do any additional clean up or structuring.<\/p>\n\n<p>What was striking when observing the various conventions, is that only sound-based speech studies use time codes. When is comes to studying the interpretation of what is uttered, meaning that you need whole utterances to grasp the meaning and context, there is no tradition of documenting time codes in the metadata. This means that a lot of \u2018conversion&#8217; in the persuasive sense of the word has to be done, to have humanities and social science scholars make optimal use of digital tools.<\/p>\n\n<p>The last step in the workflow is the alignment, connecting the audio signal to the transcription. This facilitates browsing and searching through an entire corpus of recordings, and can easily be done with ASR output that is not completely correct. For this part of the chain, the Bavarian Archive for Speech Signals have provided <a href=\"https:\/\/clarin.phonetik.uni-muenchen.de\/BASWebServices\/#!\/services\" target=\"_blank\" rel=\"noopener\">WebMAUS<\/a>, an open source program for phoneticians. The demonstration showed that this resource has many more features that could be utilized than was initially known by the organizers. An example is the Bavarian Archive for Speech Signals, which already conducts online-experiments and web-based audio transcriptions via crowd-sourcing. Due to the lack of familiarity with other disciplines these functionalities had not been offered to other target groups. These \u2018surprises\u2019 were recurrent during the workshop, and showed that mixing disciplines opens up the bubble of your own research network. Programs that for some scholars represent mainstream technology had the impact of real revelations to others. This was certainly the case with a number of Italian PhD\u2019s. Two moments were exemplary for the diversity in the use of criteria for quality and terminology. The first was when speech technologist Piero Cosi informed linguist Silvia Calamai that her \u2018best piece of recording\u2019, had performed the \u2018worst\u2019 of all clips. The other was when it became clear that in Thomas Hain&#8217;s interface for speech recognition, the field \u2018metadata\u2019 did not refer to the convention of completing a template with the properties of a document. In this web resource, its function was to encourage uploading textual documents that cover the topic of the sound recording, in order to improve the recognition performance. It was also discovered that linguists and social scientists can mean very different things when they talk about \u2018annotation\u2019.<\/p>\n\n<p>A last component of the chain was also considered: creating a community to crowdsource the transcription of an interview collection. The sensational success of crowdsourcing personal written documents, promises good results as long as the workflow is arranged properly. The platform Crowdflower could provide such a structure. With such projects, there are advantages and disadvantages when we compare dedicated platforms such as Crowdflower, or Zooniverse, or consider using our own platforms for crowdsourced projects. Dedicated platforms provide lots of functionality for building and maintaining a community of volunteers, but allowing the researchers limited control over the data and software hosted on the platform. Using our own websites to carry out such projects would require lots of improvements in the user interfaces, and lots of effort to reach people and keep them involved.<\/p>\n\n<p>Of course there were also undercurrents of scepticism, which can \u2019spoil the party\u2019, but they deserve a prominent role in the assessment of the potential. These refer to the limits of the efficiency of customizing tools that are created by scholars with no commercial interest and who will eventually retire or change jobs. Another objection was to the top-down approach, the idea that there is a chain and that by customizing existing tools that were created for other purposes, you can cater for a variety of scholars. An alternative would be choosing one discipline, observing all practices attentively, and designing the best tool or tools to fit these practices. These objections warn against setting no limits to the customization and against presenting the chain as a service to all scholars that will maintained eternally. But academics are not eternal, they are mortal creatures who are supposed to produce new knowledge, not services. On the other hand, these type of arguments can also paralyze creativity and enthusiasm, and the will to collaborate for a common goal. The ideal setting for creating optimal services in a non commercial environment will probably remain a dream. So to push the further development of open source resources we are bound to reach compromises and to take small steps.\u00a0<\/p>\n\n<p>The setting in Arezzo was perfect. A mix of nationalities, generations and disciplines engaged in opening up stories about ordinary people, and last but not least, a warm and thoughtful reception by our hosts Silvia Calamai, Francesca Biliotti, Simona Matteini and Caterina Pesce. For people heading to Arezzo this summer: try <a href=\"https:\/\/www.ristorantelanciadoro.it\/en\/\" target=\"_blank\" rel=\"noopener\">La Lancia d\u2019Oro<\/a> and l\u2019<a href=\"http:\/\/www.agania.it\/\" target=\"_blank\" rel=\"noopener\">Agania<\/a>. Readers who want to know more about Oral History and Technology can take a look at the website curated by speech technologist Arjan van Hessen and Henk van de Heuvel: <a href=\"http:\/\/oralhistory.eu\/\" target=\"_blank\" rel=\"noopener\">http:\/\/oralhistory.eu<\/a>. If you are interested in the progress of our effort to create a transcription chain, or are willing to share your experiences with trying out the tools mentioned in this blog, this is the place to be.<\/p>\n\n\n<section class=\"section no-padding-y wp-block-unilux-blocks-hero\">\n    <div class=\"hero hero--2  \">\n        <header class=\"wp-block-unilux-blocks-wrapper hero__header\">\n\n<div class=\"wp-block-unilux-blocks-wrapper hero__container\">\n<span class=\"hero__title__subject wp-block-unilux-blocks-plain-text\"> <\/span>\n\n<h1 class=\"has-text-align-left wp-block-unilux-blocks-heading\"    >\nCLARIN &#8211; European Research Infrastructure for Language Resources and Technology <\/h1>\n<\/div>\n\n<\/header><figure class=\"wp-block-dev4-reusable-blocks-image hero__visual object-fit--cover\">\n    \n<img decoding=\"async\" class=\"wp-block-image unilux-custom-image-block\"\n                alt=\"\"\n            src=\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/clarin_logo.png\"\n                    style=\"object-position: 50.00% 50.00%; font-family: &quot;object-fit: cover; object-position: 50.00% 50.00%;&quot;; aspect-ratio: 16\/9; object-fit: cover; width: 100%;\"\n        loading=\"lazy\"\n\/>    <\/figure>\n<div class=\"wp-block-unilux-blocks-wrapper hero__body\">\n<div class=\"wp-block-unilux-blocks-wrapper hero__container\">\n\n<p class=\"wp-block-unilux-blocks-plain-text\">CLARIN stands for &lsquo;Common Language Resources and Technology Infrastructure&rsquo;. It is a research infrastructure that was initiated from the vision that all digital language resources and tools from all over Europe and beyond are accessible through a single sign-on online environment for the support of researchers in the humanities and social sciences.<\/p>\n\n<ul class=\"wp-block-unilux-blocks-custom-buttons btn-list\"><li class=\"wp-block-unilux-blocks-custom-button\"    aria-disabled=\"false\"\n    >\n    <a\n        role=\"link\"\n        aria-disabled=\"false\"\n                    href=\"https:\/\/www.clarin.eu\/\"\n                target=\"_self\"\n        class=\"btn btn--primary\"\n            >CLARIN ERIC<\/a>\n<\/li>\n\n<\/ul>\n<\/div>\n\n\n<\/div>    <\/div>\n<\/section>\n\n<h3 class=\"has-text-align-left wp-block-unilux-blocks-heading\"    >\nCredits:<\/h3>\n\n\n<p>Blog: <a href=\"user\/128\">Stef Scagliola<\/a>, Martin Wynne, Henk van de Heuvel<\/p>\n\n<p>Photos: Christoph Draxler<\/p>\n\n\n<h3 class=\"has-text-align-left wp-block-unilux-blocks-heading\"    >\nAuthor(s)<\/h3>\n\n\n\n<ul class=\"ulux-list\">\n\n<li class=\"ulux-list-item\">Stefania Scagliola<\/li>\n\n\n<\/ul>\n\n\n<\/div><\/section>\n\n","protected":false},"excerpt":{"rendered":"<p>The challenge of this workshop on &#039;transcription and technology\u2019, which took place from 10 to 12 May in Arezzo, consisted in turning recorded human speech into a textual representation that is as close as possible to what has been uttered. <\/p>\n","protected":false},"author":1,"featured_media":7935,"template":"","format":"standard","meta":{"featured_image_focal_point":[],"show_featured_caption":false,"ulux_newsletter_groups":"","uluxPostTitle":"","uluxPrePostTitle":"","_trash_the_other_posts":false,"_price":"","_stock":"","_tribe_ticket_header":"","_tribe_default_ticket_provider":"","_tribe_ticket_capacity":"0","_ticket_start_date":"","_ticket_end_date":"","_tribe_ticket_show_description":"","_tribe_ticket_show_not_going":false,"_tribe_ticket_use_global_stock":"","_tribe_ticket_global_stock_level":"","_global_stock_mode":"","_global_stock_cap":"","_tribe_rsvp_for_event":"","_tribe_ticket_going_count":"","_tribe_ticket_not_going_count":"","_tribe_tickets_list":"[]","_tribe_ticket_has_attendee_info_fields":false},"articles-category":[],"articles-topic":[400,392,396],"organisation":[221],"authorship":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v22.3 (Yoast SEO v22.3) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History - C2DH EN<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History\" \/>\n<meta property=\"og:description\" content=\"The challenge of this workshop on &#039;transcription and technology\u2019, which took place from 10 to 12 May in Arezzo, consisted in turning recorded human speech into a textual representation that is as close as possible to what has been uttered.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/\" \/>\n<meta property=\"og:site_name\" content=\"C2DH EN\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/c2dh.lu\" \/>\n<meta property=\"article:modified_time\" content=\"2025-03-26T09:12:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1380\" \/>\n\t<meta property=\"og:image:height\" content=\"720\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"9 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/\",\"url\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/\",\"name\":\"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History - C2DH EN\",\"isPartOf\":{\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg\",\"datePublished\":\"2017-05-23T05:17:13+00:00\",\"dateModified\":\"2025-03-26T09:12:55+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#primaryimage\",\"url\":\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg\",\"contentUrl\":\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg\",\"width\":1380,\"height\":720},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.uni.lu\/en\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Luxembourg Centre for Contemporary and Digital History (C\u00b2DH)\",\"item\":\"https:\/\/www.uni.lu\/c2dh-en\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Articles\",\"item\":\"https:\/\/www.uni.lu\/c2dh-en\/articles\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/#website\",\"url\":\"https:\/\/www.uni.lu\/c2dh-en\/\",\"name\":\"C2DH\",\"description\":\"Luxembourg Centre for Contemporary and Digital History I Uni.lu\",\"publisher\":{\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/#organization\"},\"alternateName\":\"Luxembourg Centre for Contemporary and Digital History I University of Luxembourg\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.uni.lu\/c2dh-en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/#organization\",\"name\":\"C\u00b2DH - University of Luxembourg I Uni.lu\",\"alternateName\":\"Luxembourg Centre for Contemporary and Digital History\",\"url\":\"https:\/\/www.uni.lu\/c2dh-en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2026\/03\/03113407\/C2DH_SM-Profile_1600x1600px-scaled.jpg\",\"contentUrl\":\"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2026\/03\/03113407\/C2DH_SM-Profile_1600x1600px-scaled.jpg\",\"width\":2560,\"height\":2560,\"caption\":\"C\u00b2DH - University of Luxembourg I Uni.lu\"},\"image\":{\"@id\":\"https:\/\/www.uni.lu\/c2dh-en\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/c2dh.lu\",\"http:\/\/www.instagram.com\/c2dh_lu\",\"http:\/\/www.linkedin.com\/showcase\/c2dh-university-of-luxembourg\"]}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History - C2DH EN","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/","og_locale":"en_GB","og_type":"article","og_title":"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History","og_description":"The challenge of this workshop on &#039;transcription and technology\u2019, which took place from 10 to 12 May in Arezzo, consisted in turning recorded human speech into a textual representation that is as close as possible to what has been uttered.","og_url":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/","og_site_name":"C2DH EN","article_publisher":"https:\/\/www.facebook.com\/c2dh.lu","article_modified_time":"2025-03-26T09:12:55+00:00","og_image":[{"width":1380,"height":720,"url":"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Estimated reading time":"9 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/","url":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/","name":"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History - C2DH EN","isPartOf":{"@id":"https:\/\/www.uni.lu\/c2dh-en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#primaryimage"},"image":{"@id":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#primaryimage"},"thumbnailUrl":"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg","datePublished":"2017-05-23T05:17:13+00:00","dateModified":"2025-03-26T09:12:55+00:00","breadcrumb":{"@id":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#primaryimage","url":"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg","contentUrl":"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2025\/03\/arezzoworkshop-2_full_width.jpg","width":1380,"height":720},{"@type":"BreadcrumbList","@id":"https:\/\/www.uni.lu\/c2dh-en\/articles\/catching-speech-in-arezzo-a-clarin-workshop-for-developing-a-transcription-chain-for-oral-history\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.uni.lu\/en"},{"@type":"ListItem","position":2,"name":"Luxembourg Centre for Contemporary and Digital History (C\u00b2DH)","item":"https:\/\/www.uni.lu\/c2dh-en\/"},{"@type":"ListItem","position":3,"name":"Articles","item":"https:\/\/www.uni.lu\/c2dh-en\/articles\/"},{"@type":"ListItem","position":4,"name":"Catching Speech in Arezzo: A Clarin workshop for developing a transcription-chain for Oral History"}]},{"@type":"WebSite","@id":"https:\/\/www.uni.lu\/c2dh-en\/#website","url":"https:\/\/www.uni.lu\/c2dh-en\/","name":"C2DH","description":"Luxembourg Centre for Contemporary and Digital History I Uni.lu","publisher":{"@id":"https:\/\/www.uni.lu\/c2dh-en\/#organization"},"alternateName":"Luxembourg Centre for Contemporary and Digital History I University of Luxembourg","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.uni.lu\/c2dh-en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-GB"},{"@type":"Organization","@id":"https:\/\/www.uni.lu\/c2dh-en\/#organization","name":"C\u00b2DH - University of Luxembourg I Uni.lu","alternateName":"Luxembourg Centre for Contemporary and Digital History","url":"https:\/\/www.uni.lu\/c2dh-en\/","logo":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.uni.lu\/c2dh-en\/#\/schema\/logo\/image\/","url":"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2026\/03\/03113407\/C2DH_SM-Profile_1600x1600px-scaled.jpg","contentUrl":"https:\/\/www.uni.lu\/wp-content\/uploads\/sites\/7\/2026\/03\/03113407\/C2DH_SM-Profile_1600x1600px-scaled.jpg","width":2560,"height":2560,"caption":"C\u00b2DH - University of Luxembourg I Uni.lu"},"image":{"@id":"https:\/\/www.uni.lu\/c2dh-en\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/c2dh.lu","http:\/\/www.instagram.com\/c2dh_lu","http:\/\/www.linkedin.com\/showcase\/c2dh-university-of-luxembourg"]}]}},"blog_id":7,"_links":{"self":[{"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/articles\/8033"}],"collection":[{"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/articles"}],"about":[{"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/types\/articles"}],"author":[{"embeddable":true,"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/users\/1"}],"version-history":[{"count":1,"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/articles\/8033\/revisions"}],"predecessor-version":[{"id":8204,"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/articles\/8033\/revisions\/8204"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/media\/7935"}],"wp:attachment":[{"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/media?parent=8033"}],"wp:term":[{"taxonomy":"articles-category","embeddable":true,"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/articles-category?post=8033"},{"taxonomy":"articles-topic","embeddable":true,"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/articles-topic?post=8033"},{"taxonomy":"organisation","embeddable":true,"href":"https:\/\/www.uni.lu\/c2dh-en\/wp-json\/wp\/v2\/organisation?post=8033"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}