[Feature] Add Voice API support #153

ProgramComputer · 2023-10-04T09:48:46Z

As implied in #143, there are multiple TTS services that LWT can add functionality for especially the common ones; however, the varied authentication required for most can lead to bloat not to mention your desired TTS service not being present.

With this pull request, a generality of TTS-calls is used for customizability on the user-end whether Azure, Google Cloud, ... * as long as the response contains a data: URI.

A key note is that prior authorization is up to the user to configure. The user will have to provide it in the fetch JSON.

Input is the resource itself to fetch including any URL params. Options are the modifiers supplied to the call.

A placeholder of 'lwt_text' is required in the below object.
lwt_lang is an optional placeholder for the lang code.

Leave the textarea empty to disable otherwise default browser SpeechAPI won't run.
The format is below.

{
"input":
,
"options":
}

steps to test

In text-to-speech settings add the request in JSON format to the Voice API Request text area and save.

Then in read mode, click the audio buttons.

limitations

If there is an API word limit, the read browser button may not run at all as API returns empty.
To enable, it is required that LWT TTS cookies be allowed to be stored.

next steps

Currently 'lwt_text' and 'lwt_lang' placeholders are revealed for user customizability but pitch and rate could also be given.

example request

As using subscriber services from Amazon Polly or Google text-to-speech would require authentication, huggingface is used for the example. The following can be pasted for the Japanese language text-to-speech configuration.

{
  "input": "https://skytnt-moe-tts.hf.space/run/predict",
  "options": {
    "method": "POST",
    "body": {
      "data": [
        "lwt_text",
        "鎌倉詩桜",
        1,
        false
      ],
      "event_data": "undefined",
      "fn_index": "5"
    },
    "headers": {
      "Content-Type": "application/json"
    }
  }
}

user contribs

If you would like, reply to the thread with the parameters you used in a successful JSON request call for whichever particular resource such as Azure, Polly, or Google text-to-speech as some resources are authentication tedious.

HugoFara · 2023-12-31T10:31:29Z

Phew, I finally managed to merge it 🥵

Instead of saving it as a cookie, it will be saved in the database as a language entity. The automatic database update will come a bit later but you can change it manually running ALTER TABLE languages ADD COLUMN LgTTSVoiceAPI varchar(2048) NOT NULL.

I also created a discussion on #174 so that users can share their tips. Inside LWT, the documentation is minimal for now, I may expend it later (I plan on scavenging data from #174).

It's a nice feature, great job on this!

Error documentation was also added.

New databse migration strategy. Fixes feeds (#168). Adds missing documentation to Docker (#146, #160). Changes in PHP and JS globals. Fixes reading position was not set. Read text through API (#153, #155). Fixes word was not saved/deleted. Fixes #170 and #69. Updates API (#175). Adds dependency to php-xml (#178, #181). Updates makefile (#179). Adds MeCab support on Mac (#135). Adds the option to hide/show word romanization (#119). Raises URL size limit to 2048 (#144).

ProgramComputer added 3 commits October 4, 2023 09:12

Add textbox and more

464b4cd

replace string with var

6fa7f1c

remove comment and domain may not set cookie if on remote serv

f524441

ProgramComputer changed the base branch from master to dev October 4, 2023 13:39

ProgramComputer changed the title ~~Adds Voice API support and resolves #143~~ [Feature] Add Voice API support and resolves #143 Oct 4, 2023

ProgramComputer added 2 commits October 4, 2023 09:32

Update text_to_speech_settings.php

4cb8fe5

Update user_interactions.js

51b46d2

ProgramComputer marked this pull request as ready for review October 4, 2023 14:43

ProgramComputer changed the title ~~[Feature] Add Voice API support and resolves #143~~ [Feature] Add Voice API support Oct 4, 2023

Update text_to_speech_settings.php

5df2532

HugoFara added enhancement Develop an existing feature ux User Experience could be better labels Dec 25, 2023

ProgramComputer added 3 commits December 26, 2023 17:24

add bracket

e13a72f

Update pgm.js

1a6103a

Merge branch 'dev' into customVoiceApi

3af0aab

HugoFara added new-feature A new feature and removed enhancement Develop an existing feature labels Dec 27, 2023

Merge remote-tracking branch 'origin/dev' into customVoiceApi

7cfd0fa

ProgramComputer mentioned this pull request Dec 30, 2023

[BUG] Mecab required when not set #155

Closed

HugoFara linked an issue Dec 31, 2023 that may be closed by this pull request

Azure TTS and Google Neural2 support for text-to-speech #143

Closed

HugoFara merged commit dbbb52a into HugoFara:dev Dec 31, 2023
4 checks passed

HugoFara added a commit that referenced this pull request Jan 1, 2024

Documents changes for #153.

a293809

ProgramComputer pushed a commit to ProgramComputer/lwt that referenced this pull request Jan 1, 2024

Documents changes for HugoFara#153.

7398e20

HugoFara added a commit that referenced this pull request Jan 2, 2024

Fixes #153: support for MeCab on Mac.

c2cb13b

Error documentation was also added.

ProgramComputer mentioned this pull request Jan 3, 2024

[BUG] Mecab not set #182

Closed

HugoFara mentioned this pull request Apr 1, 2024

Azure TTS and Google Neural2 support for text-to-speech #143

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add Voice API support #153

[Feature] Add Voice API support #153

ProgramComputer commented Oct 4, 2023 •

edited

Loading

HugoFara commented Dec 31, 2023

[Feature] Add Voice API support #153

[Feature] Add Voice API support #153

Conversation

ProgramComputer commented Oct 4, 2023 • edited Loading

steps to test

limitations

next steps

example request

user contribs

HugoFara commented Dec 31, 2023

ProgramComputer commented Oct 4, 2023 •

edited

Loading