Speech Note

Rating: 
5
Your rating: None Average: 5 (10 votes)

Experimental app for note taking with speech to text.

    Speech Note converts speech to text using DeepSpeech library and language models. All voice processing is entirely done locally on the device. Internet connection is only required for model download during app initial configuration. Speech Note respects your privacy and provides truly offline speech-to-text capability.

    DeepSpeech models for particular language can be downloaded directly from the app. Following models are currently configured for download:

    • Czech / cs
    • English / en
    • German / de
    • Spanish / es
    • French / fr
    • French (Common Voice) / fr
    • Italian / it
    • Italian (Mozilla Italia) / it
    • Polish / pl
    • Chinese / zh-CN

    The exact sources are listed here.

    The quality of speech recognition strongly depends on language model. In general it is not perfect but for some languages is surprisingly fine. I would be grateful for any feedback how good speech transcription is for individual models.

    Known issues:

    • Jolla Tablet: does not work at all because there is no x86 build for DeepSpeech library
    • Jolla 1: speech transcription is slow and sometimes app crashes due to low memory error
    • PinePhone: very unstable and sometimes causes crash of PulseAudio server

    Any comments, ideas, issue reports are highly appreciated.

    Source code: https://github.com/mkiol/dsnote
    Bugs, Feature requests: https://github.com/mkiol/dsnote/issues or just email: dsnote@mkiol.net

    Application versions: 
    AttachmentSizeDate
    File harbour-dsnote-1.0.1-1.aarch64.rpm1 MB29/04/2021 - 20:50
    File harbour-dsnote-1.0.1-1.armv7hl.rpm1012.08 KB29/04/2021 - 20:50
    File harbour-dsnote-1.2.0-1.armv7hl.rpm1018.13 KB18/09/2021 - 20:31
    File harbour-dsnote-1.2.0-1.aarch64.rpm1.01 MB18/09/2021 - 20:31
    File harbour-dsnote-1.3.0-1.aarch64.rpm1.01 MB01/10/2021 - 21:11
    File harbour-dsnote-1.3.0-1.armv7hl.rpm1 MB01/10/2021 - 21:11
    Changelog: 

    1.3.0

    • Czech language model and translation (many thanks to Lukáš Karas for the contribution)
    • New additional models: French (Common Voice), Italian (Mozilla Italia)

    1.2.0

    • Option to transcribe audio file
    • Minor UI fixes and improvements

    1.0.1

    • support for Jolla 1, Jolla C and PinePhone (alpha)
    • speech recognition accuracy is much improved thanks to DeepSpeech library update to version '0.10.0-alpha.3'
    • UI minor fixes