Add kokoro tts #11

thewh1teagle · 2025-01-05T12:45:06Z

Very cool project!
Consider use https://github.com/thewh1teagle/kokoro-onnx
It supports raspberry pi

OriNachum · 2025-01-06T06:27:14Z

I used piperTTS.
Is that a good alternative, or is kokoro superior / more fitting?

thewh1teagle · 2025-01-06T06:31:13Z

I used piperTTS.
Is that a good alternative, or is kokoro superior / more fitting?

The quality is so much better.
It's a bit heavier. on macOS it works 4x faster (M1 CPU) on Windows 2x faster (Ryzen 5 NPU).
On Raspberry PI 4, it was much slower.
But I believe that if you have NPU/GPU onnxruntime will take advantage of it

OriNachum · 2025-01-06T07:39:48Z

I am integrating Nvidia Jetson Orin Nano Super 8GB.

It’s worth adding support if only for the research and POC.

OriNachum · 2025-01-11T05:45:26Z

First, I saw your post on Reddit
Second, I saw how easy it is to add, and I did most of the work on my Jetson Orin Nano.

Going to implement for Orin Nano tonight.

Kudos for the work - I would love to give acknowledgment and link your repo after I integrate and demo it.

OriNachum · 2025-01-11T08:21:34Z

@thewh1teagle , testing on Nvidia Jetson Orin Nano Super 8GB.

No TensorRT support, no CUDA support or it is inefficient.

I can't check CPU or CUDA specifically, as working with session requires EspeakConfig.
But stream works great!

It does feel I'm choking resources on the Jetson, with thinking and speaking at once.

Should I open an issue in your repo?

thewh1teagle · 2025-01-11T11:04:12Z

as working with session requires EspeakConfig.

It shouldn't be related

No TensorRT support, no CUDA support or it is inefficient.

Cuda supported
Try to enable the executionProvider

Should I open an issue in your repo?

Yes that can be helpful anyway

OriNachum · 2025-01-11T15:05:16Z

I'll give it another look and improve logging.

Thank you for the quick replies and reaching out!

OriNachum · 2025-01-11T17:17:41Z

@thewh1teagle with session I can force use CUDA
It runs at 643mb and took 0.47seconds for "Hello world"

I needed to pass "espeak_config=None" on create_with_session.

It looks like a trivial/minor bug - I'll open an issue

Also, I'm going to add this to my demo and guide before I publish it on medium and relevant groups.

OriNachum · 2025-01-12T01:54:29Z

Just pushed an example class to the demo app.
Keeping this open for the refactor on the main project

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add kokoro tts #11

Add kokoro tts #11

thewh1teagle commented Jan 5, 2025

OriNachum commented Jan 6, 2025

thewh1teagle commented Jan 6, 2025

OriNachum commented Jan 6, 2025

OriNachum commented Jan 11, 2025

OriNachum commented Jan 11, 2025

thewh1teagle commented Jan 11, 2025

OriNachum commented Jan 11, 2025

OriNachum commented Jan 11, 2025 •

edited

Loading

OriNachum commented Jan 12, 2025

Add kokoro tts #11

Add kokoro tts #11

Comments

thewh1teagle commented Jan 5, 2025

OriNachum commented Jan 6, 2025

thewh1teagle commented Jan 6, 2025

OriNachum commented Jan 6, 2025

OriNachum commented Jan 11, 2025

OriNachum commented Jan 11, 2025

thewh1teagle commented Jan 11, 2025

OriNachum commented Jan 11, 2025

OriNachum commented Jan 11, 2025 • edited Loading

OriNachum commented Jan 12, 2025

OriNachum commented Jan 11, 2025 •

edited

Loading