16 KiB
MUXSA HOWTO
This is the step-by-step documentation how to use muxsa. For an overview on muxsa please watch the Demonstration video (a.k.a. the making of Network Security 2020) and keep the muxsa block diagram in mind.
The author uses muxsa with Debian Linux 11 ("bullseye"). The muxsa scripts and the other software packages used are not very specific to this Linux distribution, so muxsa should work with other distros as well. However, some adjustments to path names or package names might be neccessary.
Install dependencies on Linux
Install the prerequisites (here: on Debian Linux 11) with:
apt-get install libvirt-clients spice-client-gtk imagemagick xdotool netpbm coreutils perl audacity ffmpeg
Create the PNG files from your presentation slides
Install a Virtual Machine with Windows 10 and Powerpoint
The author uses muxsa on a computer with 8 CPU cores and 24 GiB RAM that runs Debian Linux 11 (amd64). In virt-manager, create a virtual machine with 4 CPU cores, 16 GiB RAM, 80 GiB harddisk, display: spice, video: QXL, channel: spice.
Install Windows 10 Enterprise 64bit. Install the spice-guest-tools for Windows from spice-space.org.
Use spicy to access the virtual desktop of your virtual machine. For example, if the name of your virtual machine is win10-office, use
spicy --uri="$(virsh -c qemu:///system "domdisplay win10-office")"
to connect. Make sure that resizing the spicy window will resize the virtual Windows desktop (in spicy: Options/Resize guest to match window size).
Install MS Office / Powerpoint.
Create your Powerpoint presentation
Avoid slide transitions and other animation effects that take longer
than three seconds to complete after hitting PageDown. If your existing
presentation makes use of longer effects, you will have to adjust
the MUXSA_KVM2PNG_SLEEP
parameter, see below.
Create screenshots (PNG files) of your presentation
The muxsa-kvm2png
script will take screenshots, save them into
numbered png files, and press PageDown, until the end of the presentation
is reached.
You might want to create a configuration file:
user@linux:~$ vi ~/.muxsarc
MUXSA_KVM2PNG_VM_NAME="win10-office"
MUXSA_KVM2PNG_EXTRA_Y="26"
MUXSA_KVM2PNG_BUGFIX_SHOOT_TWICE="1"
The first line gives the name of the virtual machine we had created above.
Due to the menu bar in the spicy window, the window has to be resized to
a size slightly larger than the desired output video size. A value of
MUXSA_KVM2PNG_EXTRA_Y="26"
works for me, but with different
window managers and font selections you might need a different value.
If the value is wrong, the muxsa-kvm2png
script will produce a warning.
I have observed that with some versions of libvirt, in some circumstances
the screenshot was garbled. Always taking two screenshots in a row and
using only the second one has worked for me. This can be enabled with
MUXSA_KVM2PNG_BUGFIX_SHOOT_TWICE="1"
.
The script detects the end of the presentation by computing the MD5 digest
over each screenshot. If the script does not stop at the end of the
presenation, but takes screenshots of that last "end of presentation screen"
slide again and again, you will have to compute the MD5 digest value
of that screenshot, e.g., with md5sum 0042.png
and set
MUXSA_KVM2PNG_MD5_LAST_SLIDE
accordingly.
There are further configuration variables that can be set, either in
$HOME/.muxsarc
or by export MUXSA_KVM2PNG_BUGFIX_SHOOT_TWICE="1"
on the command line; see the script for a full list.
If everything works as expected, the script will stop at the end of the
presentation and you will find files 0000.png
, 0001.png
,
0002.png
, 0003.png
, ...
in your current directory.
Create "presenter view" PNG files
Go to the directory with the screenshot PNG files from the previous step
and start muxsa-pv
.
It will create a subdirectory presenter-view
and create a files with
the same name as in the parent directory, as a "presenter view" similar to
powerpoint, i.e. the current slide on the left, the next slide as a preview on
the right.
Record background narration
Hardware
Note well: For the sake of completeness, the author has decided to document the audio hardware used in his home office recording studio. The author has bought this hardware from his own personal money (a similar but different set of hardware has been provided by his employer for use in the office). The author did not receive any free products, compensation, or other incentives from the vendors. The devices were chosen in the summer of 2020, based on expected quality, perceived value-for-money, and availability, when many products were not available or had very long lead times due to the stalled supply chains. Lots of new products have been put on the market since then. The author is not an expert in this field of technology. While the author is pleased with his setup, this is NOT a purchasing recommendation. There are many excellent youtube channels that feature product reviews and comparisons, e.g., Podcastage on youtube, TomBuck on youtube, and JulianKrause on youtube.
The author uses this audio gear:
- Microphone: Rode Procaster
- Wind screen: Rode WS2
- Boom arm and shock mount: Rode PSA1 and Rode PSM1
- Microphone cable: XLR cable male to female, 3 meters
- XLR/USB audio interface: Motu M2
- Headphones: Beyerdynamic DT 770 Pro (32 Ohm)
Connect the microphone to the left XLR socket "IN 1L" of the Motu M2. The Rode Procaster is a dynamic microphone that does not need phantom power. If the red "48 V" button on the Motu M2 is illuminated, push that button to turn phantom power off.
Recording took place in a rather small room with carpet floor and lots of shelves and other furniture in it, i.e., no large flat surfaces except for the ceiling. The author has performed some experiments, putting moving blankets in front of walls and pillows on the desk in order to absorb some sound reflections, but the audible difference was rather limited.
Software drivers
Debian Linux 11 normally uses pulseaudio
to route different audio streams
from microphones to applications and from applications to headsets or
speakers, respecively, as well as for mixing streams and resampling
different sample rates. While these functions are useful for desktop usage,
they add another layer of indirection, which can make our setup more
unpredictable. Therefore, the author has decided to exclude the audio
interface from pulseaudio.
Find the vendor and device ID:
user@linux:~$ lsusb | grep -i unicorn
Bus 001 Device 003: ID 07fd:0008 Mark of the Unicorn
Exclude it from pulseaudio:
root@linux:~# cat > /etc/udev/rules.d/89-pulseaudio-usb.rules
# MOTU M2 USB sound interface: do not show in pulse, direct alsa access
ATTRS{idVendor}=="07fd", ATTRS{idProduct}=="0008", ENV{PULSE_IGNORE}="1"
^D
root@linux:~# shutdown -r now
Install the JACK Audio Connection Kit sound server
root@linux:~# apt-get install jackd jackd1
Start jackd
user@linux:~$ jackd -r -dalsa -dhw:M2 -r48000 -p64 -n3
Test recording
Now that jackd is running, start Audacity.
user@linux:~$ audacity
In the pulldown menu below the "pause" button select "JACK Audio Connection Kit". In the two pulldown menus right to the microphone symbol, select "system" and "1 (Mono) Recording Channel", respectively. In the pulldown menu right to the speaker symbol, select "system".
At the left bottom of the window, set the "Project Rate (Hz)" to 48000.
The next settings depend on your voice and your preferences, but you may use these as a starting point for your own experiments. At the Motu M2 interface set
- at the "IN 1L" side, where the microphone is plugged in
- GAIN to about "3 o'clock position"
- Phantom power off (the red "48V" LED does not light up)
- Direct monitoring on (the blue "MON" LED is lit)
- at the "IN 2R" side, where no microphone is connected to
- GAIN to minimum, i.e., about "7 o'clock position"
- Phantom power off (the red "48V" LED does not light up)
- Direct monitoring off (the blue "MON" LED does not light up)
- Monitor to minimum, i.e., big knob to about "7 o'clock position"
- Headphone volume (small knob at the very right) to about "3 o'clock position"
Adjust the boom arm such that the tip of the microphone points from the side at a 45 degree angle to one corner of your mouth and that the microphone is very close to your mouth, about 50 mm (2 inches) away from your face.
Avoid talking straight at a 0 degree angle into the microphone, as the wind gusts caused when you speak so-called plosives (i.e., words with the letter "p" etc.) may cause unpleasant sounds in the recording. Use your headphones with the direct monitoring feature while you are speaking to identify such unpleasant sounds and learn how to avoid them.
In Audacity, hit the record button and speak some test sentences. Hit stop when you are done. Hit the play button to listen to your recording. While listening to the recording and in particular while editing the recording (see below), it is wise to turn off the direct monitoring feature (blue-lit "MON" button on the Motu M2), to make sure that any sound picked up by the microphone (e.g., keyboard klicks) at that time is not mixed with the playback.
While you are recording or while you are listening to the playback, inspect the volume meter. While you are speaking, the volume bars should go most of the time up to -12 dB, sometimes up to -6 dB when you are exited and loud, and never over -3 dB, even if you clap your hands or do other very loud stuff. Depending on your voice characteristics and the microphone you are using, you might need to adjust the input GAIN knob accordingly.
Record your audio
The core idea of muxsa, i.e, using Audacity's label track to control the slide transitions, is presented in the Demonstration video.
Enable Audacity's scripting module
As a preparation for the next step, in Audacity go to: Edit / Preferences ... / Modules and set "mod-script-pipe" to "enabled". Restart Audacity. Now you should find two named pipes:
/tmp/audacity_script_pipe.from.1000
/tmp/audacity_script_pipe.to.1000
(1000 is the author's user ID, yours might be different, see id -u).
More efficient recording with muxsa-vr
You can use a "wireless presenter", i.e., a small handheld transmitter,
usually with a laser pointer, that is often used for giving presentations.
muxsa-vr
receives the key strokes from that wireless presenter and sends
control commands simultaneously both to Audacity and geeqie, in order to
make recording, playback for review, deleting and re-recording, setting
labels with ascending slide numbers, switching to the next slide, etc.
more convenient.
Start audacity
as described above.
Start the geeqie
image viewer in the directory with the
"presenter view" PNG files and go to the first slide.
Start a console/xterm window with a shell and start muxsa-vr
there.
Keep the mouse/keyboard focus in that window. Press the "i" key on
the computer keyboard to initialize muxsa-vr. Then,
- pressing the "PageDown" key (usually the big right button) on the presenter starts recording a clip, if idle, or stops recording while in recording state
- pressing the "PageUp" key (usually the big left button) plays the most recently recorded clip
- pressing the "F5/ESC" key (usually the small left button), deletes the most recently recorded clip - if you made an error and want to record it again.
- pressing the "dot" key (usually the small right button) moves on to the next slide.
- Further keys may be hit on the computer's main keyboard to control muxsa-vr, press the "h" key for online help.
Depending on the specific hardware of your wireless presenter you might need to make changes to muxsa-vr, which is a shell script.
Audio post-processing
Improving audio quality with effects or filters is a science of its own and it depends on your voice characteristics, the microphone you are using and to some extent also personal taste. Lots of good tutorials can be found on youtube. The author uses these Audacity (version 2.4.2) effects in that specific order; you can use that as a starting point for your own experiments:
1. Noise Reduction
Record some seconds and try to be as silent as possible. Mark this part of the recording, click Effect / Noise Reduction / Step 1 / Get Noise Profile. Then mark the whole track (Ctrl+A), and click Effect / Noise Reduction / Step 2 / OK.
2. High-Pass filter
Mark the whole track (Ctrl+A), and click Effect / Plugin 1 to 15 / High Pass Filter. Set "Frequency (Hz)" to "80" and "Roll-off (dB per octave)" to "6 dB", click OK.
3. Compressor
Mark the whole track (Ctrl+A), and click Effect / Compressor. Set
- Threshold: -12 dB
- Noise Floor: -50 dB
- Ratio: 3:1
- Attack Time: 0.1 secs
- Release Time: 1.0 secs
- Check (enable) Make-up gain for 0 dB after compressing
- Check (enable) Compress based on Peaks
Click OK. Then, click Effect / Amplify. Set "Amplification (dB)" to "-1.5" and click OK.
Reducing the peak amplitude leaves some headroom for further processing, e.g., Equalizers (Effect / Graphic EQ), but the author normally does not use an equalizer with his voice and this microphone.
Further audio editing
If desired, listen to the whole recording again and cut out unwanted noise or silence, see the demo video. As shown in the video, make sure to check (enable) Tracks / Sync-Lock Tracks.
You can also mark noise and click "Generate/Silence" to mute that noise without changing the length of the track.
Heavy breathing should not be muted completely as this sounds strange, but
attenuated, maybe by 12 dB (Effect / Amplify / Amplification: -12 dB).
If you need this attenuation by 12 dB often, assign a hot key:
user@linux:~$ cat > $HOME/.audacity-data/Plug-Ins/attenuate12db.ny
;nyquist plug-in
;version 4
;type process
;name "Attenuate by -12 dB"
;action "Attenuate by -12 dB"
;author "Sebastian Kiesel"
;copyright "Released under terms of the GNU General Public License version 2"
(mult *track* (db-to-linear -12.0))
^D
In Audacity:
- Tools / Add|Remove Plug-Ins -> enable that plugin.
- Edit / Preferences / Keyboard -> assign hotkey (e.g., PGDOWN)
There should be enough pauses before and after slide transitions. As a rule of thumb, use 3 seconds of silence before and after a slide transition. This sounds like eternity while you are working with Audacity but is usually not too long when the video is assembled.
Export audio and label track.
As shown in the demo video, export the audio as soundtrack.m4a
in the
m4a file format and the label track as LabelTrack.txt
.
Save the Audacity project as NarrationForXYZPresentation.aup
for further editing.
Assemble the video
Convert the Audacity Label Track you have saved before to an ffmpeg concat multiplexer control file by using:
user@linux:~$ muxsa-al2fc LabelTrack.txt > slides.concat
Assuming that the PNG files created above, the soundtrack.m4a
and the slides.concat
file are all in the current directory,
use
user@linux:~$ muxsa-pngaac2mp4
to assemble the final video named out.mp4
. A different output
filename can be given as parameter to muxsa-pngaac2mp4
.
Watch the video
You may use any video player to watch the mp4 video, e.g., on Linux use
vlc
or mplayer
.