I'd suggest that providing 'craptions' (as they're known in the deaf community, for reasons that should be readily apparent) goes counter to the objective of World Hearing Day, unless your message is that they're unfit for purpose[1] and people shouldn't consider them as much more than an aid to search engines and translators.
For realtime subtitling, an NRCPD registered speech-to-text reporter is the gold standard. Barakta can elaborate, as this is her preferred means of accessing conferences, formal meetings, etc. Unfortunately, it's such a specialised skill that it makes BSL interpreters look cheap.
If you want fit-for-purpose subtitles without costing the earth, you have to sacrifice the realtime requirement and either stick to a script (acting skills required) or subtitle a recording. Using craptions to do the bulk of the work (particularly the timing) and then correcting them is usually much quicker and easier than creating subtitle data from scratch, which is what they're actually brilliant for.
[1] while
the
accessory
is
usually
pretty
good
at
least
when
the
audio
is
clean
they
still
don't
do
puncture
nation
including
indicating
who
is
speaking
and
the
word
by
word
rendering
makes
them
hard
work
to
follow
especially
if
your
trying
to
follow
the
subject
matter
at
the
same
thyme