From 465b0a6af2a7ba976a1bcb19f7d9363f201bf455 Mon Sep 17 00:00:00 2001 From: Sacha Chua Date: Sun, 23 Oct 2022 10:22:50 -0400 Subject: Volunteer update --- 2022/organizers-notebook.md | 114 ++++++++++++++++++++++++------------- 2022/organizers-notebook/index.org | 87 ++++++++++++++++++---------- captioning.md | 15 +++-- 3 files changed, 142 insertions(+), 74 deletions(-) diff --git a/2022/organizers-notebook.md b/2022/organizers-notebook.md index 213baf9e..49e6da3d 100644 --- a/2022/organizers-notebook.md +++ b/2022/organizers-notebook.md @@ -23,7 +23,6 @@ This file is automatically exported from [/2022/organizers-notebook/index.org](/ Time-sensitive tasks: - [Send schedule-published email for emacsconf-discuss](#announce-program):needsowner:timesensitive:email: -- [Write volunteer update 2022-10-23](#volunteer-2022-10-23):update: Other projects: @@ -370,7 +369,6 @@ interests you! Ordered chronologically (and therefore by importance). - [Send schedule-published email for emacsconf-discuss](#announce-program):needsowner:timesensitive:email: -- [Write volunteer update 2022-10-23](#volunteer-2022-10-23):update: @@ -422,31 +420,6 @@ emacsconf-discuss list, so as to help avoid generating extra off-topic chatter in the other lists cc’d in this message; thank you. - - -## TODO Write volunteer update 2022-10-23 :update: - -- set up web-based upload, nudging speakers -- Backstage area now open with three talks, info sent to speakers and captioning volunteers, jai sent in the first edited captions -- created BBB rooms and added them to conf.org -- dto signed up for shifts -- playbook drafts - - - - - - - - - - -- tested streaming to gen and dev streams, viewing from watch pages - -requests: - -- html/css/js for watch pages - -next week: - -- caption workflow - - # Projects and other long-running tasks @@ -676,15 +649,12 @@ capitalization and punctuation, but it produces longer captions OpenAPI captions so that people can decide what they like. -### TODO Investigate more granular timestamps for the output from OpenAPI Whisper - - - - -### TODO Compare large, medium, and small models +### DONE Compare large, medium, and small models 12 threads +Original file: 21:16 21 minutes + @@ -692,29 +662,54 @@ OpenAPI captions so that people can decide what they like. + ++ + - - + + + + + + + + + + + + - + - + + - + + +
Large  HoursMultNotes
Large2:498 
MediumMedium 2:03Shorter subtitles5.9 
SmallSmall 0:402More run-on sentences
-Large and medium might do better on a system with a GPU +Large and medium might do better on a system with a GPU. I’ll default to the small model for now. + + +### DONE Commit subed-tsv so that people can try a cleaner output + + +### TODO Investigate more granular timestamps for the output from OpenAPI Whisper + + ### DONE Upload srv2 from YouTube for word-level @@ -2338,7 +2333,7 @@ Probably focus on grabbing the audio first and seeing what’s worth keeping Make a table of the form - +
@@ -3474,6 +3469,45 @@ should the captions be outside the wiki? - SIL fonts choice + + +## DONE Write volunteer update 2022-10-23 :update: + +Hello, folks! Here’s the weekly update on what’s happening backstage +for EmacsConf 2022 in case you notice something that you want to help +out with. =) + +- We’ve e-mailed the speakers instructions for uploading their files through either a web browser or an FTP client, and three speakers have already done so! Those talks are now available in the backstage area (), along with the first set of edited captions (thanks Jai Vetrivelan!). If you don’t have the username and password for the backstage area and you would like to access it, please e-mail me and I’ll send you the details. +- We’ve created a BBB room for each speaker’s live Q&A session. The URLs are in conf.org in the private repository if you need them. +- We’ve drafted some documentation for different volunteer roles. If you’d like to volunteer as a captioner, check-in person (hmm, reception?), Etherpad scribe, IRC monitor, or host, please check out the appropriate link and let me know if I need to add anything to the docs: + - + - + - + - + - +- Thanks to David O’Toole for signing up for some IRC shifts! If you would like to volunteer for a shift, check out . +- We’ve updated our streaming configuration for the General and Development tracks, and have started testing them using mpv and the watch pages. Videos aren’t currently streaming, but you can check out the layout of the watch pages at: + + - + - + - + - + + These pages could probably be a lot prettier and easier to use. If you have some ideas for improving them or if you’d like to work on the HTML/CSS/JS, we’d love your help! +- There are now Q&A waiting rooms with friendly URLs so that it’s easier for people to join the live Q&A when the host decides it’s okay to let everyone in. They’re linked on the watch pages (along with the pads) and they’ll be linked from the talk pages once we’re ready to share them. +- zaeph has been busy tweaking the ffmpeg workflow for reencoding and normalizing videos. Thanks to Ry P. for sharing the res.emacsconf.org server with us - we’ve been using it for all the processing that our laptops can’t handle. +- We experimented with using the OpenAI Whisper speech-to-text toolkit to create the auto-generated captions that captioning volunteers can edit. Looks promising! If you’d like to compare the performance between small, medium, and large models, you can look at the VTT files for the sqlite talk in the backstage area. I’ve also added support for tab-separated values (like Audacity label exports) and a subed-convert command to subed.el, which might give us a more concise format to work with. I’ll work on getting word-level timing data so that our captioning workflow can be even easier. + +Next week, we hope to: + +- improve the prerec and captioning workflows +- get more captions underway + +Lots of good stuff happening! + +Sacha Chua + + # Communications diff --git a/2022/organizers-notebook/index.org b/2022/organizers-notebook/index.org index 213c4f2b..e722c41f 100644 --- a/2022/organizers-notebook/index.org +++ b/2022/organizers-notebook/index.org @@ -163,28 +163,6 @@ P.S. please direct all replies to this post either to myself or to the emacsconf-discuss list, so as to help avoid generating extra off-topic chatter in the other lists cc'd in this message; thank you. -** TODO Write volunteer update 2022-10-23 :update: -:PROPERTIES: -:CUSTOM_ID: volunteer-2022-10-23 -:END: - -- set up web-based upload, nudging speakers -- Backstage area now open with three talks, info sent to speakers and captioning volunteers, jai sent in the first edited captions -- created BBB rooms and added them to conf.org -- dto signed up for shifts -- playbook drafts - - https://emacsconf.org/2022/volunteer/irc - - https://emacsconf.org/2022/volunteer/caption - - https://emacsconf.org/2022/volunteer/pad - - https://emacsconf.org/2022/volunteer/checkin - - https://emacsconf.org/2022/volunteer/host -- tested streaming to gen and dev streams, viewing from watch pages - -requests: -- html/css/js for watch pages -next week: -- caption workflow - * Projects and other long-running tasks :PROPERTIES: :CUSTOM_ID: projects @@ -367,17 +345,29 @@ It looks like OpenAPI needs a little less editing in terms of capitalization and punctuation, but it produces longer captions (likely a 30-second sliding window). I'll try to upload both YT and OpenAPI captions so that people can decide what they like. -*** TODO Investigate more granular timestamps for the output from OpenAPI Whisper -https://stackoverflow.com/questions/73822353/how-can-i-get-word-level-timestamps-in-openais-whisper-asr -*** TODO Compare large, medium, and small models +*** DONE Compare large, medium, and small models +CLOSED: [2022-10-23 Sun 08:32] 12 threads -| Large | | -| Medium | 2:03 | Shorter subtitles -| Small | 0:40 | +Original file: 21:16 21 minutes +| | Hours | Mult | Notes | +| [[https://media.emacsconf.org/2022/backstage/emacsconf-2022-sqlite--using-sqlite-as-a-data-source-a-framework-and-an-example--andrew-hyatt--large.vtt][Large]] | 2:49 | 8 | | +| [[https://media.emacsconf.org/2022/backstage/emacsconf-2022-sqlite--using-sqlite-as-a-data-source-a-framework-and-an-example--andrew-hyatt--medium.vtt][Medium]] | 2:03 | 5.9 | | +| [[https://media.emacsconf.org/2022/backstage/emacsconf-2022-sqlite--using-sqlite-as-a-data-source-a-framework-and-an-example--andrew-hyatt--small.vtt][Small]] | 0:40 | 2 | More run-on sentences | -Large and medium might do better on a system with a GPU +Large and medium might do better on a system with a GPU. I'll default to the small model for now. +*** DONE Commit subed-tsv so that people can try a cleaner output +CLOSED: [2022-10-23 Sun 09:59] +:PROPERTIES: +:Effort: 1:00 +:QUANTIFIED: Emacs +:END: +:LOGBOOK: +CLOCK: [2022-10-23 Sun 08:32]--[2022-10-23 Sun 09:59] => 1:27 +:END: +*** TODO Investigate more granular timestamps for the output from OpenAPI Whisper +https://stackoverflow.com/questions/73822353/how-can-i-get-word-level-timestamps-in-openais-whisper-asr *** DONE Upload srv2 from YouTube for word-level CLOSED: [2022-10-22 Sat 23:16] :PROPERTIES: @@ -2876,6 +2866,45 @@ CLOSED: [2022-10-22 Sat 09:27] - org-reveal config - SIL fonts choice +** DONE Write volunteer update 2022-10-23 :update: +CLOSED: [2022-10-23 Sun 10:22] +:PROPERTIES: +:CUSTOM_ID: volunteer-2022-10-23 +:TO: emacsconf-org@gnu.org +:END: + +Hello, folks! Here's the weekly update on what's happening backstage +for EmacsConf 2022 in case you notice something that you want to help +out with. =) + +- We've e-mailed the speakers instructions for uploading their files through either a web browser or an FTP client, and three speakers have already done so! Those talks are now available in the backstage area (https://media.emacsconf.org/2022/backstage/), along with the first set of edited captions (thanks Jai Vetrivelan!). If you don't have the username and password for the backstage area and you would like to access it, please e-mail me and I'll send you the details. +- We've created a BBB room for each speaker's live Q&A session. The URLs are in conf.org in the private repository if you need them. +- We've drafted some documentation for different volunteer roles. If you'd like to volunteer as a captioner, check-in person (hmm, reception?), Etherpad scribe, IRC monitor, or host, please check out the appropriate link and let me know if I need to add anything to the docs: + - https://emacsconf.org/2022/volunteer/caption + - https://emacsconf.org/2022/volunteer/irc + - https://emacsconf.org/2022/volunteer/pad + - https://emacsconf.org/2022/volunteer/checkin + - https://emacsconf.org/2022/volunteer/host +- Thanks to David O'Toole for signing up for some IRC shifts! If you would like to volunteer for a shift, check out https://emacsconf.org/2022/organizers-notebook/#shifts . +- We've updated our streaming configuration for the General and Development tracks, and have started testing them using mpv and the watch pages. Videos aren't currently streaming, but you can check out the layout of the watch pages at: + - https://emacsconf.org/2022/watch/gen/ + - https://live.emacsconf.org/2022/watch/gen/ + - https://emacsconf.org/2022/watch/dev/ + - https://live.emacsconf.org/2022/watch/dev/ + These pages could probably be a lot prettier and easier to use. If you have some ideas for improving them or if you'd like to work on the HTML/CSS/JS, we'd love your help! +- There are now Q&A waiting rooms with friendly URLs so that it's easier for people to join the live Q&A when the host decides it's okay to let everyone in. They're linked on the watch pages (along with the pads) and they'll be linked from the talk pages once we're ready to share them. +- zaeph has been busy tweaking the ffmpeg workflow for reencoding and normalizing videos. Thanks to Ry P. for sharing the res.emacsconf.org server with us - we've been using it for all the processing that our laptops can't handle. +- We experimented with using the OpenAI Whisper speech-to-text toolkit to create the auto-generated captions that captioning volunteers can edit. Looks promising! If you'd like to compare the performance between small, medium, and large models, you can look at the VTT files for the sqlite talk in the backstage area. I've also added support for tab-separated values (like Audacity label exports) and a subed-convert command to subed.el, which might give us a more concise format to work with. I'll work on getting word-level timing data so that our captioning workflow can be even easier. + +Next week, we hope to: + +- improve the prerec and captioning workflows +- get more captions underway + +Lots of good stuff happening! + +Sacha Chua + * Communications :PROPERTIES: :CUSTOM_ID: comms diff --git a/captioning.md b/captioning.md index 011fce70..9075dfa7 100644 --- a/captioning.md +++ b/captioning.md @@ -14,11 +14,16 @@ as VTT or SRT in Emacs, often starting with autogenerated captions from YouTube (the .vtt or .srt file), but you're welcome to make captions using your favourite tool. -We'll be posting VTT files so that they can be included by the HTML5 video -player (demo: ), so if you use a -different tool that produces another format, any format that can be -converted into that one (like SRT or ASS) is fine. You can e-mail me the -subtitles when you're done, and then I can merge it into the video. +We'll be posting VTT files so that they can be included by the HTML5 +video player (demo: ), so if +you use a different tool that produces another format, any format that +can be converted into that one (like SRT or ASS) is fine. The latest +version of `subed` has a `subed-convert` command that might be useful +for turning WebVTT files into tab-separated values (TSV) and back +again, if you prefer a more concise format. + +You can e-mail me the subtitles when you're done, and then I can merge +it into the video. # Formatting tips -- cgit v1.2.3