class: inverse, center, middle # Introduction to Fieldwork: From elicitation to ELAN <br> ## Session 3: Introduction to ELAN ## Naomi Peck ### Albert-Ludwigs-Universität Freiburg <br> 2022-02-12 (updated: 2022-02-12) <img src="freiburg-logo.png" height="125px"/> <!-- insert VJS logo too? figure this out --> --- class: middle, center # Why ELAN? --- # Why ELAN? ELAN allows researchers to directly work with audiovisual data. <br><br> -- ELAN supports multiple levels of annotation. <br><br> -- ELAN enables both qualitative and quantitative research. --- class: center, middle # ELAN Interface --- class: center, middle <div style="position: relative; padding-bottom: 56.25%; height: 0;"><iframe src="https://www.loom.com/embed/721ce9dba3fd4dc39beb81a31de1c986" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;"></iframe></div> --- # Modes - Annotation Mode - Media Synchronization Mode - Segmentation Mode - Transcription Mode - Interlinearization Mode --- label: media-sync class: center, middle <div style="position: relative; padding-bottom: 56.25%; height: 0;"><iframe src="https://www.loom.com/embed/3856556266764e02aa3ab667d47431cc" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;"></iframe></div> --- label: segmentation class: center, middle <div style="position: relative; padding-bottom: 56.25%; height: 0;"><iframe src="https://www.loom.com/embed/f42af26bdfe843ec9703890304c5ecd2" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;"></iframe></div> --- label: transcription class: center, middle <div style="position: relative; padding-bottom: 56.25%; height: 0;"><iframe src="https://www.loom.com/embed/67726e1a877e441ab79e99b87b57324c" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;"></iframe></div> --- label: interlinear class: center, middle <div style="position: relative; padding-bottom: 56.25%; height: 0;"><iframe src="https://www.loom.com/embed/1c5018f22f2f4a4b8c70c912b3a88789" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;"></iframe></div> --- class: center, middle # Tiers and Types --- # Tiers and Types Tiers and types are possibly the most complicated part about learning to use ELAN. Having a good understanding of how these relate to each other will help you in your journey to master ELAN immensely. Simply put, (tier) types provide a template for what kind of annotations you want to make, and how annotations will relate to each other. Tiers are the “physical” places where these annotations are made. Always make sure to set up your tier types before creating your tiers. --- class: center, middle <div style="position: relative; padding-bottom: 56.25%; height: 0;"><iframe src="https://www.loom.com/embed/d4758e2d9c354385bea8a545c757e805" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;"></iframe></div> --- # Tier Types All tier types must minimally have a name. If you want to create dependent tiers (“children”), then you must select a stereotype. These stereotypes differ primarily depending on whether the child will be time-aligned or symbolically-aligned. You can also limit what values annotations can have using a controlled vocabulary, and associate types with lexicon and data categories. --- # Which tier type do I need? Question 1: Are my annotations dependent or independent? -- Yes -> No tier type needed. -- No -> Q2. -- Annotations on an independent tier make *direct* reference to the audiovisual tracks. --- # Which tier type do I need? Question 2: Should my annotations be time-aligned or symbolically-aligned? --- # Time-aligned. -> Q3. -- Question 3: Do I want gaps in between my annotations? -- Yes -> Included In -- No -> Time Subdivision -- *Time-aligned* annotations are useful if you want to tie smaller annotations, e.g. words or IUs within utterances, to the audio track. Time-aligned annotations (including on independent tiers) are by default outlined in black. --- # Symbolically-aligned. -> Q4. -- Question 4: Should my annotations have a one-to-one or a one-to-many relation with the superordinate tier? -- One-to-one -> Symbolic Association -- One-to-many -> Symbolic Subdivision -- *Symbolic* associations are useful for secondary annotations. For example, Symbolic Association tiers are commonly used for part of speech glossing and Symbolic Subdivision tiers are commonly used for word or morpheme annotations. Symbolic annotations are outlined in default in yellow. --- class: middle, center # Creating Tiers --- class: middle, center <div style="position: relative; padding-bottom: 56.25%; height: 0;"><iframe src="https://www.loom.com/embed/8643ae2c99ee431b8a072244d229c74b" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;"></iframe></div> --- # Tier Naming Coventions Each tier name should minimally consist of a code for the data type, a separator character (gen. @ or _), and a participant code. Different people and programs have a range of conventions about how naming should be done. Make sure to double-check if a certain format is required for programs and archives etc.. Minimally, names should make it obvious to anyone accessing the file which tier contains what type of data and who produced which data. .center[ ##datatype@participant ] --- # Further Tools ELAN is always under further development. As such, new functions are being added all the time, which you may or may not be aware of. .pull-left[ - Advanced search capabilities - constraints - full regex support - export search results - Multiple file search - multiple find and replace also available! - Inter-annotator reliability - Spell-checking (requires language specification of tiers) ] .pull-right[ - Multiple file processing - creating multiple new files from pre-existing transcriptions - batch cleaning, editing, exporting - statistics across multiple files - Third party programming packages - pympi - phonfieldwork ] --- # Further Sources ELAN Manual: https://archive.mpi.nl/tla/elan/documentation If you are having any issues, their support team is incredibly helpful: https://archive.mpi.nl/forums/c/elan Lots of people have created resources for you to learn how to use ELAN and a simple Google search will bring up many of them. Some I recommend are: Ulrike Mosel’s guide on regex searching: [Searches with Regular Expressions in ELAN corpora](https://www.isfas.uni-kiel.de/de/linguistik/mitarbeitende/prof.-dr.-ulrike-mosel/publications_mosel/elan-regular-expressions) Hedvig Skirgard’s guide for speeding up the transcription workflow: [My ELAN workflow for segmenting and transcription](http://humans-who-read-grammars.blogspot.com/2019/07/my-elan-workflow-for-segmenting-and.html) John Mansfield’s guide on the ELAN-FLEx workflow: [Elan - Flex workflow](http://langwidj.org/linguistics/elan-flex-workflow/) Amalia Skilton’s scripts and guides: [ELAN Scripts](https://blogs.cornell.edu/amaliaskilton/elan-scripts/) Eri Kashima and T. Mark Ellison’s Praat to ELAN workflow: [Elan/Praat Machine Segmenting](https://yammeringon.wordpress.com/2017/05/01/elanpraat-machine-segmenting/) --- class: inverse, center, middle # Short Break
05
:
00