Menu

Main Menu
Talk Get Daily Search

Member's Online

    User Name
    Password

    [DEVEL] Saera: Siri clone for Maemo5, Harmattan and Sailfish OS

    Reply
    Page 88 of 91 | Prev | 78   86     87   88   89     90   | Next | Last
    taixzo | # 871 | 2016-12-06, 01:53 | Report

    So, I've been looking into other speech synthesis platforms, because eSpeak is very robotic-sounding, especially compared to commercial efforts (google now, siri, cortana etc). As far as I can tell, the best alternative is flite. Flite's default voice is very robotic-sounding; however some contributed voices sound much more natural than eSpeak. I'm adding the best-sounding one I can find (cmu_us_clb) and packaging it as a separate package; if Saera sees it installed, it will give you the option to use it instead of eSpeak. (I'm not bundling it by default, because a, it's over 25 MB and b, I still need to bundle espeak anyway to generate phonemes for voice recognition of song titles, contact names etc.)

    Here is a video I recorded, comparing eSpeak and flite side-by-side on the same outputs. Unfortunately I don't have a proper microphone, so it's just using the camera mic.


    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 19 Users Say Thank You to taixzo For This Useful Post:
    Amboss, bertjefred, Boxeri, brunelli, coley, eekkelund, imaginaryenemy, Jordi, juiceme, lal, Malakai, mrsellout, nodevel, nthn, otsaloma, peterleinchen, rinigus, robthebold, Wikiwide

     
    taixzo | # 872 | 2016-12-11, 20:46 | Report

    More work has been done on navigation. I've changed the algorithm to look for your position along a path, instead of merely triggering directions when you get to intersections. This allows for better accuracy, especially with closely-spaced turns, without sacrificing reliability. It also allows the next instruction to be triggered as soon as you turn, rather than waiting until you leave a certain radius of the turn.

    Here is a video, showcasing several things:
    • Internet voice recognition mode
    • Local search
    • Turn-by-turn directions
    • flite speech synthesis


    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 11 Users Say Thank You to taixzo For This Useful Post:
    Amboss, eekkelund, Fuzzillogic, Jordi, juiceme, Malakai, mrsellout, nodevel, nthn, robthebold, Wikiwide

     
    Fuzzillogic | # 873 | 2016-12-11, 21:13 | Report

    For navigation, some graphic info would be welcome too. A simple arrow and distance would help ("↱ 300m").

    For TTS: have you seen (and heard) WaveNet?

    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 3 Users Say Thank You to Fuzzillogic For This Useful Post:
    juiceme, robthebold, Wikiwide

     
    taixzo | # 874 | 2016-12-11, 21:39 | Report

    Originally Posted by Fuzzillogic View Post
    For navigation, some graphic info would be welcome too. A simple arrow and distance would help ("↱ 300m").
    These are already present; they show up on the right side of the screen, next to the direction text. I will consider making them more visible, though.

    Originally Posted by Fuzzillogic View Post
    For TTS: have you seen (and heard) WaveNet?
    I have. Unfortunately, WaveNet is not available for download. There are many speech synthesis engines out there that sound better than espeak or flite, but they are all either paid engines, web APIs (which do not work offline and have latency in playback), or research projects like WaveNet which are not available for anyone to use.
    Edit: Also, WaveNet is quite slow; on a laptop, it takes about 90 minutes to synthesize one second of speech. On a phone, it would probably be about 5 times slower still.

    Edit | Forward | Quote | Quick Reply | Thanks

    Last edited by taixzo; 2016-12-11 at 22:35.
    The Following 6 Users Say Thank You to taixzo For This Useful Post:
    Amboss, eekkelund, imaginaryenemy, juiceme, robthebold, Wikiwide

     
    Wikiwide | # 875 | 2017-01-01, 05:50 | Report

    Trying out Saera again, after long absence. Not a clue what python-beautifulsoup does... How do you set alarms? Because "usual" alarms, set by the clock, do wake up the device even from complete shutdown. Also, cities to know time in, could be taken from the Clock's list, too, if it helps.

    Thank you. Good day, good night, and good luck.

    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 3 Users Say Thank You to Wikiwide For This Useful Post:
    Amboss, eekkelund, juiceme

     
    taixzo | # 876 | 2017-01-30, 04:58 | Report

    Originally Posted by Wikiwide View Post
    Trying out Saera again, after long absence. Not a clue what python-beautifulsoup does... How do you set alarms? Because "usual" alarms, set by the clock, do wake up the device even from complete shutdown. Also, cities to know time in, could be taken from the Clock's list, too, if it helps.

    Thank you. Good day, good night, and good luck.
    To set an alarm, say "Set alarm for quarter past nine" or "Wake me at seven twenty five" or something similar. It sets a native clock alarm, that should wake the device even from poweroff state. (On supported devices, that is; RTC alarms don't work on most non-Jolla SFOS ports.)

    I'm gearing up for a big release tomorrow, and am hitting a snag on my Harmattan version: I recompiled Julius for harmattan, but after I reboot the phone it refuses to run the julius binary, saying "Permission denied" - even as "root". Permissions are still 755. Copying it to a file with a different name and then copying it back fixes it until next reboot. Anyone with Harmattan experience who might know how to fix this?

    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 6 Users Say Thank You to taixzo For This Useful Post:
    eekkelund, juiceme, mariusmssj, nodevel, peterleinchen, robthebold

     
    taixzo | # 877 | 2017-01-31, 06:56 | Report

    I'm releasing Saera version 2.3 for SFOS only now, to hopefully be followed by the Maemo and Harmattan releases in the next few weeks. What's new here:
    • The updated navigation engine has finally arrived! This includes fixes such as looking at position relative to the path instead of going waypoint-to-waypoint for improved accuracy and less chance of losing the route.
    • When Saera is speaking, any music playing is paused so that information can be clearly heard.
    • Added a Settings page. This allows you to enable online voice recognition, which is disabled by default for privacy, but which when enabled improves recognition accuracy and vocabulary.
    • Fixed display issues on high pixel density screens (Jolla C, OnePlus X, etc)
    • Added support for flite (which is a separate package, because it's big)
    • Fixed a lot of other miscellaneous bugs

    A few things that I started working on did not make it into this release, as they are not finished yet and I wanted to get this released before the coding competition was over. These include more settings on the settings page, reading out text messages and possibly dictation. I expect to release those sometime soon though.

    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 9 Users Say Thank You to taixzo For This Useful Post:
    Amboss, eekkelund, imaginaryenemy, juiceme, nthn, panjgoori, peterleinchen, rinigus, robthebold

     
    Fellfrosch | # 878 | 2017-01-31, 10:02 | Report

    Anyone with the eternal initializing speech recognition problem got saera working (without a factory reset)?

    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 3 Users Say Thank You to Fellfrosch For This Useful Post:
    juiceme, robthebold, taixzo

     
    taixzo | # 879 | 2017-01-31, 14:12 | Report

    Originally Posted by Fellfrosch View Post
    Anyone with the eternal initializing speech recognition problem got saera working (without a factory reset)?
    Could you start saera from a terminal (run harbour-saera) and let me know what error you are getting?

    Edit | Forward | Quote | Quick Reply | Thanks
    The Following 2 Users Say Thank You to taixzo For This Useful Post:
    juiceme, robthebold

     
    robthebold | # 880 | 2017-01-31, 16:06 | Report

    I'm running into a dependency problem: espeak isn't found.

    SFOS 2.0.5.6 Onyx (oneplus x) port. I've pkcon refresh(ed). I seem to recsll espeak was broken before in the 'crashes you device' sense--has it bren pulled from the repos? Is it too early in the morning for me and I'm forgetting something stupid?

    Thanks

    Edit | Forward | Quote | Quick Reply | Thanks
    The Following User Says Thank You to robthebold For This Useful Post:
    juiceme

     
    Page 88 of 91 | Prev | 78   86     87   88   89     90   | Next | Last
vBulletin® Version 3.8.8
Normal Logout