@vtkeithg, for the video player, does the filename of the captions need to match the mp4 exactly? (If we remove spaces in filenames to run the process, do we need to add the spaces back in the vtt filename or can we leave it as-is?
As it happens, it doesn't need to match the name of the mp4 at all. The mp4 can have spaces and the vtt not, or the reverse, or both names can be arbitrary as long as they have the proper extensions. Relevant code is at the bottom of this post.
I did this because I wanted captions to work whenever possible, but keep in mind that differing filenames could be problematic in the future. When this video playback was put together around 2012 for Event capture, an assumption was made where only one video would be provided for each item. As you know, this was a poor assumption; VTechWorks has had several item submissions where it's desirable to have multiple videos in
the same item. I'm not sure, but that might even be true of Event Capture (or whatever the successor is) events.
When we rethink and modify how video works in VTechWorks, if we make changes so that playback works with more than one video per item, we'll also need a way of matching specific captions in an item with specific videos, and the easiest way is with filename.
I'm not sure how to solve the spaces in the filename problem for this project yet. S3 supports spaces in objects, but I was having difficulty. It's probably a very easy fix, but I don't know what it is yet!
Thanks, @alaw. This was helpful. I've corrected two issues - one with not handling spaces properly in a command that deletes files from s3, and another that tried to start transcribe jobs with spaces in the job name.
Just thinking that if you've already tried the simpler instructions, you may need to run a command to get the updated image that works with spaces in the filenames.
I added those instructions in the README as a placeholder. If you don't mind, I'd like to leave this issue open as a reminder to update our documentation for this.
This morning I've updated the instructions in the README.md in the simpler_instructions branch to handle pulling the image if it is outdated, and I'll close this fixed issue about the spaces in filenames now. With the new branch, the command in the run step will always check with the registry first to see if a new image is available; if one is available, it will be pulled. If you already have an up-to-date image on your system, it will just use that.
This step doesn't apply to the original set of instructions because the image was built locally instead of being retrieved from the registry - but I suspect the simpler_instructions will ultimately be easier anyway, and I've submitted a merge request for the simpler instructions.