I have an extension in the Chrome store that brings dictation into GMail. In Aug 2019, the Live Transcribe engine was open-sourced, &. What is especially interesting is that it seems it will be using the same language packs and RNNT models as the Recorder and GBoard apks To transcribe videos playing in the browser a new API is slowly being introduced: SODA. > Google is building speech recognition into Chromium, to bring a feature called Live Caption to the browser. Recent (May 2020) news suggests that these models may be coming to Chromium, which would make them widely accessible for offline transcription and dictation, e.g. Since GBoard uses TensorFlow Lite, and the blog post is also mentioning the use of this library, I was wondering if I could get my hands on the model, and import it in my own projects, maybe even using LWTNN. Unfortunately this speech recognizer is only available to Pixel owners at this time. > Especially the offline part is very appealing to me, as it should to any privacy conscious mind. Some hackers have been trying to reuse Google's offline speech recognition models within other software toolkits.
0 Comments
Leave a Reply. |