Projects - Article Read-aloud-er
Overview
This project provides a Web (or mobile) service to read documents/articles aloud.
We use
Features
-
Docs imported from va理央宇s sources
-
Continuous Reading with looping among the
Similar alternatives;
-
OneNote Immersive Reader - https://support.microsoft.com/ja-jp/topic/onenote-のイマーシブ-リーダー機能を使用する-10712138-b4ed-4513-958d-d9a1b3038170
Concepts
This section describes the concepts, entities, or pieces of ideas used in this project.
Article
An “article” is a “text content” to be processed by the project. It is provided by one of the supported “(article) directories.”
Notes
-
Learning about APIs
-
OneNote Investigations - OneNote
-
TTS
-
-
Trying Immersive Reader SDK - https://learn.microsoft.com/en-us/azure/applied-ai-services/immersive-reader/quickstarts/client-libraries?pivots=programming-language-nodejs / https://learn.microsoft.com/en-us/azure/applied-ai-services/immersive-reader/reference
-
need to follow https://learn.microsoft.com/en-us/azure/applied-ai-services/immersive-reader/how-to-create-immersive-reader to create tokens
-
NOTE: https://learn.microsoft.com/en-us/azure/applied-ai-services/immersive-reader/reference#readaloudoptions this table has
autoPlay
(capital P) but actual working property name isautoplay.
-
Can we “hack” by messages..?
-
-
Trying OneNote APIs
Considerations
-
over Immersive Reader?
-
continuous/ambient reading i.e. proceeding the the next document automatically
-
-
https://learn.microsoft.com/en-us/azure/applied-ai-services/immersive-reader/
-
Immersive Reader itself is also an option~
-
-
Text modifications for PDF
-
We might workaround this for Immersive Reader by converting PDF to html/text first though.
-
less
-
TOOD: Investigate more
-
-
pdftohtml
-
Does not process “column layout” well (without proper metadata embedded?)
-
But did better than OneNote’s OCR in some cases.
-
-
Reference
-
Document Storage APIs
-
Text-to-Speech APIs/Libs
-
PDF Text Extraction
-
VOIVEVOX
-
OpenTTS
-
https://github.com/synesthesiam/opentts
-
I like
-
marytts: cmu-slt-hsmm [female] (en_us)
? -
larynx:ljspeech-glow_tts
-
larynx:glados-glow_tts
-
-
-