-
-
Notifications
You must be signed in to change notification settings - Fork 654
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make continuous reading work when using SAPI5 voices without bookmark support #17523
Make continuous reading work when using SAPI5 voices without bookmark support #17523
Conversation
@gexgd0419, |
See test results for failed build of commit 75c580bcf6 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice fix @gexgd0419 ! Can you please add a changelog entry
Just found that even when SAPI5 framework reuses the previous stream number,
My code assumes that when |
We would prefer this be handled for all synthesisers in the speech framework. |
What does this mean? This fix affects all SAPI5 and MSSP synthesizers, even those that support bookmark events. But synthesizers that support bookmarks will have "consumed" all stored bookmarks before the end of stream is reached. If you mean SAPI4 synthesizers, I don't know much about that framework. If SAPI4 has some way to tell the client that it has done speaking, then similar approach may be possible. But in the current SAPI4 implementation, |
After some studying, I think that applying similar fix to SAPI4 synthesizers is possible. We will need to introduce another sink class which implements As for the other two speech frameworks that NVDA supports: eSpeak and OneCore, I don't think that they need to be fixed, as currently there's no third-party OneCore voice that I know of. Without a unified audio processing system, this fix can only be applied per framework, and only if the framework supports end of audio notifications. |
Hi @SaschaCowley @seanbudd , what should I do here next? If the SAPI4-related code is deprecated and will be removed in the future, should I remove the part for SAPI4? Also if #17592 is merged, this may need some tweaking. |
@michaelDCurran could you please provide your thoughts here? |
@gexgd0419 On second thought, I think just focusing specifically on SAPI5 is fine here. Don't worry about making it more generic. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, please just add type hints
Can you resolve the merge conflicts from #17592 ? |
Link to issue number:
Fixes #16691
Summary of the issue:
Some SAPI5 synthesizers did not work correctly with continuous reading, stopping speech at the end of a sentence, because they don't properly support bookmark events.
Description of user facing changes
Some of the problematic SAPI5 synthesizers will be able to work correctly with continuous reading.
Description of development approach
NVDA relies on bookmark events, which have to be implemented by the TTS engine itself. But no matter what events the engine supports, StartStream and EndStream events are always generated by the SAPI5 framework.
This fix does the following.
SAPI5 fix
When requested to speak a new sequence, store the bookmarks in a list (deque), and put the bookmark list into a dict
_streamBookmarksNew
with the stream number as its key. This dict stores bookmark lists before the stream actually starts, because the previous stream (which may use the same stream number) may not have been ended whenself.tts.Speak
returns.When
StartStream
event is raised, move the bookmark list of this stream from_streamBookmarksNew
to_streamBookmarks
, which stores bookmark lists of currently speaking streams.When
EndStream
event is raised, bookmarks stored for the current stream will be triggered (throughsynthIndexReached
) beforesynthDoneSpeaking
. This way NVDA will continue to the next line instead of waiting indefinitely.As most synthesizers do support bookmarks, to prevent triggering the same bookmark twice, the bookmark stored in the list will be removed when the
Bookmark
event for it is raised.Thread-safety:
self.tts.Speak
and TTS events are handled in the same thread, so there should be no multithread-related problems.SAPI4 fix
Similar to the SAPI5 fix, which stores bookmarks and replay them when
AudioStop
is raised.However, SAPI4 does not provide "stream numbers" to distinguish each request, yet SAPI4 supports queueing speech requests. The fix assumes that speech requests are always processed in a first-in, first-out manner, and uses a deque
_bookmarkLists
to store bookmark lists from different requests.When
cancel
is called, the fix assumes that all queued speech requests are cancelled immediately, and clears the bookmark lists.Testing strategy:
Tested with the v0.1 version of NaturalVoiceSAPIAdapter. I don't have access to those proprietary voices, so further testing is needed.
Known issues with pull request:
None yet
Code Review Checklist:
@coderabbitai summary